llama.cpp
graph : avoid huge warm-up graphs for MoE models
#14753
Merged

graph : avoid huge warm-up graphs for MoE models #14753

ggerganov merged 2 commits into master from gg/context-reduce-min-nodes
ggerganov
ggerganov ggerganov force pushed 207 days ago
ggerganov ggerganov requested a review from slaren slaren 207 days ago
ggerganov graph : avoid huge warm-up graphs for MoE models
033b3066
ggerganov ggerganov force pushed to 033b3066 207 days ago
ggerganov
ggerganov commented on 2025-07-18
slaren
ggerganov
ggerganov cont : bump max nodes to 8x model tensors
5883f014
slaren
slaren approved these changes on 2025-07-18
ggerganov ggerganov merged d498af3d into master 207 days ago
ggerganov ggerganov deleted the gg/context-reduce-min-nodes branch 207 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone