vllm
[Model] Add tuned triton fused_moe configs for Qwen3Moe on B200
#31448
Merged

[Model] Add tuned triton fused_moe configs for Qwen3Moe on B200 #31448

Jzz1943
Jzz1943 [Model] Add tuned triton fused_moe configs for Qwen3Moe on B200
8c9f53ea
Jzz1943 Jzz1943 requested a review from mgoin mgoin 54 days ago
Jzz1943 Jzz1943 requested a review from pavanimajety pavanimajety 54 days ago
mergify mergify added qwen
gemini-code-assist
gemini-code-assist commented on 2025-12-28
mgoin
mgoin approved these changes on 2025-12-28
vllm-bot vllm-bot merged 0b6b7010 into main 54 days ago
Jzz1943 Jzz1943 deleted the b200-fused-moe branch 52 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone