vllm
Fix Flashinfer Allreduce+Norm enable disable calculation based on `fi_allreduce_fusion_max_token_num`
#21325
Merged

Fix Flashinfer Allreduce+Norm enable disable calculation based on `fi_allreduce_fusion_max_token_num` #21325

xinli-git
xinli-git xinli-git requested a review from zou3519 zou3519 206 days ago
xinli-git xinli-git requested a review from youkaichao youkaichao 206 days ago
xinli-git xinli-git requested a review from ProExpertProg ProExpertProg 206 days ago
github-actions
gemini-code-assist
gemini-code-assist commented on 2025-07-21
xinli-git xinli-git force pushed 206 days ago
xinli-sw fix flashifner enable disable calculation
3fb686fa
xinli-sw address copilot feedback
bf3fa634
xinli-git xinli-git force pushed to bf3fa634 206 days ago
mgoin
mgoin commented on 2025-07-21
xinli-sw address review feedback when world size is uncommon
8195e6c5
xinli-git xinli-git force pushed to 8195e6c5 206 days ago
ilmarkov
mgoin mgoin added ready
mgoin mgoin added performance
xinli-git
mgoin mgoin changed the title Fix Flashifner Allreduce+Norm enable disable calculation based on `fi_allreduce_fusion_max_token_num` Fix Flashinfer Allreduce+Norm enable disable calculation based on `fi_allreduce_fusion_max_token_num` 205 days ago
mgoin
mgoin approved these changes on 2025-07-22
simon-mo simon-mo merged ae268b63 into main 205 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone