llama.cpp
7f323a58 - Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386)

Commit
177 days ago
Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386)
Author
Parents
Loading