vllm
[Kernel] Refactor FlashInfer allreduce for mnnvl backend
#34109
Merged

Commits
  • squash merge
    hjjq committed 58 days ago
  • Allow both backends to be used at the same time
    hjjq committed 57 days ago
  • Merge branch 'main' into hjjq/ar
    wzhao18 committed 52 days ago
  • Test both trtllm and mnnvl backends in test_fusion_all_reduce.py
    wzhao18 committed 52 days ago
  • Merge branch 'main' into hjjq/ar
    wzhao18 committed 45 days ago
  • Add flashinfer AR to benchmark_device_communicators.py
    wzhao18 committed 45 days ago
  • Merge main
    wzhao18 committed 44 days ago
  • Special warning for multicast check in AR workspace initialization failure
    wzhao18 committed 44 days ago
  • Merge branch 'main' into hjjq/ar
    wzhao18 committed 44 days ago
Loading