vllm
[Bugfix][MoE] Fix 6-8% decode regression: prefer multi-stream shared expert overlap
#38990
Merged

[Bugfix][MoE] Fix 6-8% decode regression: prefer multi-stream shared expert overlap #38990

voipmonitor
voipmonitor [MoE] Prefer multi-stream shared expert overlap over external ordering
8b7b9035
voipmonitor voipmonitor requested a review from mgoin mgoin 34 days ago
voipmonitor voipmonitor requested a review from pavanimajety pavanimajety 34 days ago
gemini-code-assist
gemini-code-assist commented on 2026-04-04
robertgshaw2-redhat
robertgshaw2-redhat
robertgshaw2-redhat
update logic
52f86572
robertgshaw2-redhat
robertgshaw2-redhat robertgshaw2-redhat added ready-run-all-tests
robertgshaw2-redhat robertgshaw2-redhat added ready
robertgshaw2-redhat Merge branch 'main' into fix-shared-experts-overlap
12ab5e09
robertgshaw2-redhat robertgshaw2-redhat changed the title [MoE] Fix 6-8% decode regression: prefer multi-stream shared expert overlap [Bugfix][MoE] Fix 6-8% decode regression: prefer multi-stream shared expert overlap 34 days ago
robertgshaw2-redhat robertgshaw2-redhat enabled auto-merge (squash) 34 days ago
mergify mergify added bug
milesial
robertgshaw2-redhat
robertgshaw2-redhat approved these changes on 2026-04-05
robertgshaw2-redhat robertgshaw2-redhat merged 228023b3 into main 33 days ago
robertgshaw2-redhat
milesial

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone