Tensor-parallel: Fix delayed AllReduce on Gemma-4 MoE #22129
Fix delayed AllReduce on Gemma-4 MoE
4ce8fde4
am17an
approved these changes
on 2026-04-20
Check for all sources before skipping nodes
07a15854
Address review comments
63c7607d
gaugarg-nv
deleted the gemma4_perf branch 50 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub