vllm
[Kernel] Refactor FlashInfer allreduce for mnnvl backend
#34109
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
9
Changes
View On
GitHub
Commits
squash merge
hjjq
committed
58 days ago
Allow both backends to be used at the same time
hjjq
committed
57 days ago
Merge branch 'main' into hjjq/ar
wzhao18
committed
52 days ago
Test both trtllm and mnnvl backends in test_fusion_all_reduce.py
wzhao18
committed
52 days ago
Merge branch 'main' into hjjq/ar
wzhao18
committed
45 days ago
Add flashinfer AR to benchmark_device_communicators.py
wzhao18
committed
45 days ago
Merge main
wzhao18
committed
44 days ago
Special warning for multicast check in AR workspace initialization failure
wzhao18
committed
44 days ago
Merge branch 'main' into hjjq/ar
wzhao18
committed
44 days ago
Loading