llama.cpp
7f323a58
- Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
177 days ago
Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386)
References
#13386 - Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B
Author
hjc4869
Parents
3eac2093
Loading