llama.cpp
Tensor-parallel: Fix delayed AllReduce on Gemma-4 MoE
#22129
Merged

Tensor-parallel: Fix delayed AllReduce on Gemma-4 MoE #22129

gaugarg-nv
gaugarg-nv Fix delayed AllReduce on Gemma-4 MoE
4ce8fde4
github-actions github-actions added ggml
am17an
am17an
am17an approved these changes on 2026-04-20
loci-dev
gaugarg-nv
JohannesGaessler
JohannesGaessler commented on 2026-04-20
gaugarg-nv Check for all sources before skipping nodes
07a15854
JohannesGaessler
JohannesGaessler approved these changes on 2026-04-20
gaugarg-nv Address review comments
63c7607d
JohannesGaessler
JohannesGaessler approved these changes on 2026-04-20
JohannesGaessler JohannesGaessler merged fd6ae4ca into master 50 days ago
gaugarg-nv gaugarg-nv deleted the gemma4_perf branch 50 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone