vllm
[Kernel] Refactor FlashInfer allreduce for mnnvl backend
#34109

Merged

Commits

squash merge

hjjq committed 58 days ago
Allow both backends to be used at the same time

hjjq committed 57 days ago
Merge branch 'main' into hjjq/ar

wzhao18 committed 52 days ago
Test both trtllm and mnnvl backends in test_fusion_all_reduce.py

wzhao18 committed 52 days ago
Merge branch 'main' into hjjq/ar

wzhao18 committed 45 days ago
Add flashinfer AR to benchmark_device_communicators.py

wzhao18 committed 45 days ago
Merge main

wzhao18 committed 44 days ago
Special warning for multicast check in AR workspace initialization failure

wzhao18 committed 44 days ago
Merge branch 'main' into hjjq/ar

wzhao18 committed 44 days ago