whisper.cpp
0b1962a1
- Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (llama/13386)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
231 days ago
Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (llama/13386)
References
#3148 - sync : ggml
Author
hjc4869
Committer
ggerganov
Parents
86dece9c
Loading