llama.cpp
d498af3d - graph : avoid huge warm-up graphs for MoE models (#14753)

Commit
56 days ago
graph : avoid huge warm-up graphs for MoE models (#14753) * graph : avoid huge warm-up graphs for MoE models ggml-ci * cont : bump max nodes to 8x model tensors
Author
Parents
Loading