llama.cpp
ec428b02
- llama : add --n-cpu-moe option (#15077)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
94 days ago
llama : add --n-cpu-moe option (#15077) * llama : add --n-cpu-moe option Keeps the MoE weights of the first N layers in the CPU
References
#15077 - llama : add --n-cpu-moe option
Author
slaren
Parents
19f68fa5
Loading