vllm
[Bugfix][MoE] Fix 6-8% decode regression: prefer multi-stream shared expert overlap
#38990
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
3
Changes
View On
GitHub
[Bugfix][MoE] Fix 6-8% decode regression: prefer multi-stream shared expert overlap
#38990
robertgshaw2-redhat
merged 3 commits into
vllm-project:main
from
voipmonitor:fix-shared-experts-overlap
[MoE] Prefer multi-stream shared expert overlap over external ordering
8b7b9035
voipmonitor
requested a review
from
mgoin
34 days ago
voipmonitor
requested a review
from
pavanimajety
34 days ago
gemini-code-assist
commented on 2026-04-04
update logic
52f86572
robertgshaw2-redhat
added
ready-run-all-tests
robertgshaw2-redhat
added
ready
Merge branch 'main' into fix-shared-experts-overlap
12ab5e09
robertgshaw2-redhat
changed the title
[MoE] Fix 6-8% decode regression: prefer multi-stream shared expert overlap
[Bugfix][MoE] Fix 6-8% decode regression: prefer multi-stream shared expert overlap
34 days ago
robertgshaw2-redhat
enabled auto-merge (squash)
34 days ago
mergify
added
bug
robertgshaw2-redhat
approved these changes on 2026-04-05
robertgshaw2-redhat
merged
228023b3
into main
33 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
robertgshaw2-redhat
gemini-code-assist
mgoin
pavanimajety
Assignees
No one assigned
Labels
bug
ready
ready-run-all-tests
Milestone
No milestone
Login to write a write a comment.
Login via GitHub