llama : add --n-cpu-moe option (#15077)

Commit

202 days ago

llama : add --n-cpu-moe option (#15077) * llama : add --n-cpu-moe option Keeps the MoE weights of the first N layers in the CPU