[Kernel] Refactor FlashInfer allreduce for mnnvl backend #34109
squash merge
b2c39595
hjjq
force pushed
from
d95e22af
to
b2c39595
15 days ago
hjjq
marked this pull request as ready for review 15 days ago
Allow both backends to be used at the same time
a118b20f
Merge branch 'main' into hjjq/ar
159bdd6b
Test both trtllm and mnnvl backends in test_fusion_all_reduce.py
fe3458c3
Merge branch 'main' into hjjq/ar
922a4dbf
Add flashinfer AR to benchmark_device_communicators.py
955630f1
wzhao18
force pushed
from
d6d1a835
to
955630f1
2 days ago
ilmarkov
approved these changes
on 2026-02-25
Merge main
8948a590
Special warning for multicast check in AR workspace initialization fa…
3f3dc6f8
Merge branch 'main' into hjjq/ar
746c8fed
mgoin
removed documentation
mgoin
removed speculative-decoding
mgoin
removed multi-modality
mgoin
approved these changes
on 2026-02-26
Assignees
No one assigned
Labels
performance
ready
nvidia
Login to write a write a comment.
Login via GitHub