llama.cpp
d498af3d
- graph : avoid huge warm-up graphs for MoE models (#14753)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
56 days ago
graph : avoid huge warm-up graphs for MoE models (#14753) * graph : avoid huge warm-up graphs for MoE models ggml-ci * cont : bump max nodes to 8x model tensors
References
#14753 - graph : avoid huge warm-up graphs for MoE models
Author
ggerganov
Parents
eacdeb5b
Loading