llama.cpp
ec428b02 - llama : add --n-cpu-moe option (#15077)

Commit
94 days ago
llama : add --n-cpu-moe option (#15077) * llama : add --n-cpu-moe option Keeps the MoE weights of the first N layers in the CPU
Author
Parents
Loading