vllm
[Bugfix] Fix fused MoE int32 overflow in stride*offset without perf regression
#34507
Merged

[Bugfix] Fix fused MoE int32 overflow in stride*offset without perf regression #34507

haosdent
haosdent haosdent requested a review from mgoin mgoin 86 days ago
haosdent haosdent requested a review from pavanimajety pavanimajety 86 days ago
mergify mergify added bug
gemini-code-assist
gemini-code-assist commented on 2026-02-13
eugr
mgoin mgoin assigned tlrmchlsmth tlrmchlsmth 86 days ago
mgoin mgoin requested a review from tlrmchlsmth tlrmchlsmth 86 days ago
mgoin mgoin requested a review from robertgshaw2-redhat robertgshaw2-redhat 86 days ago
mgoin
tlrmchlsmth
haosdent
haosdent haosdent requested a review from WoosukKwon WoosukKwon 85 days ago
haosdent haosdent requested a review from yewentao256 yewentao256 85 days ago
haosdent
haosdent haosdent force pushed 85 days ago
haosdent [Bugfix] Fix fused MoE int32 overflow in stride*offset without perf r…
54ba644b
haosdent haosdent force pushed to 54ba644b 85 days ago
haosdent haosdent changed the title [Bugfix] Fix fused MoE perf regression on small GPUs from int64 strides [Bugfix] Fix fused MoE int32 overflow in stride*offset without perf regression 85 days ago
haosdent
mgoin
mgoin approved these changes on 2026-02-14
mgoin mgoin added ready
mgoin Merge branch 'main' into fix/fused-moe-int64-stride-perf-regression
8a0d4c45
tlrmchlsmth
tlrmchlsmth approved these changes on 2026-02-16
vllm-bot vllm-bot merged b68fd899 into main 83 days ago
mgehre-amd
haosdent
mgehre-amd

Login to write a write a comment.

Login via GitHub

Assignees
Labels
Milestone