[Kernel] Refactor FlashInfer allreduce for mnnvl backend #34109
squash merge
b2c39595
hjjq
force pushed
to
b2c39595
105 days ago
hjjq
marked this pull request as ready for review 105 days ago
Allow both backends to be used at the same time
a118b20f
Merge branch 'main' into hjjq/ar
159bdd6b
Test both trtllm and mnnvl backends in test_fusion_all_reduce.py
fe3458c3
Merge branch 'main' into hjjq/ar
922a4dbf
Add flashinfer AR to benchmark_device_communicators.py
955630f1
wzhao18
force pushed
to
955630f1
92 days ago
ilmarkov
approved these changes
on 2026-02-25
Merge main
8948a590
Special warning for multicast check in AR workspace initialization fa…
3f3dc6f8
Merge branch 'main' into hjjq/ar
746c8fed
mgoin
removed documentation
mgoin
removed speculative-decoding
mgoin
removed multi-modality
mgoin
approved these changes
on 2026-02-26
hjjq
deleted the hjjq/ar branch 72 days ago
Assignees
No one assigned
Labels
performance
ready
nvidia
Login to write a write a comment.
Login via GitHub