vllm
[Kernels] MoE refactor
#19636
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
72
Changes
View On
GitHub
[Kernels] MoE refactor
#19636
vllm-bot
merged 72 commits into
vllm-project:main
from
neuralmagic:moe-refactor
gemini-code-assist
commented on 2025-06-14
gemini-code-assist
commented on 2025-06-14
bnellnm
force pushed
328 days ago
mergify
added
ci/build
bnellnm
force pushed
to
6b4e4060
326 days ago
bnellnm
changed the title
Moe refactor
[Kernels] MoE refactor
326 days ago
bnellnm
marked this pull request as ready for review
326 days ago
bnellnm
requested a review
from
tlrmchlsmth
326 days ago
bnellnm
requested a review
from
WoosukKwon
326 days ago
bnellnm
requested a review
from
mgoin
326 days ago
bnellnm
requested a review
from
robertgshaw2-redhat
326 days ago
mgoin
commented on 2025-06-19
tlrmchlsmth
commented on 2025-06-24
mergify
added
needs-rebase
tlrmchlsmth
commented on 2025-06-24
tlrmchlsmth
commented on 2025-06-24
bnellnm
force pushed
319 days ago
mergify
removed
needs-rebase
bnellnm
force pushed
318 days ago
mergify
added
needs-rebase
bnellnm
force pushed
318 days ago
mergify
removed
needs-rebase
mergify
added
performance
bnellnm
requested a review
from
mgoin
317 days ago
bnellnm
requested a review
from
tlrmchlsmth
317 days ago
mergify
added
needs-rebase
bnellnm
force pushed
316 days ago
mergify
removed
needs-rebase
ElizaWszola
commented on 2025-06-30
ElizaWszola
commented on 2025-06-30
tlrmchlsmth
commented on 2025-06-30
tlrmchlsmth
commented on 2025-06-30
bnellnm
force pushed
314 days ago
ElizaWszola
commented on 2025-07-01
bnellnm
force pushed
313 days ago
tlrmchlsmth
added
ready
tlrmchlsmth
approved these changes on 2025-07-01
tlrmchlsmth
enabled auto-merge (squash)
313 days ago
disabled auto-merge
313 days ago
Head branch was pushed to by a user without write access
tlrmchlsmth
enabled auto-merge (squash)
313 days ago
turn try_get_optimal_moe_config into an op so it can be torch.compiled
e8ab05a1
lint
e60fc9e8
torch.compile tests
515b60e0
add tests
b8c64a13
add compiler + cudagraph tests
f2916aca
tests
9daa8320
reduce number of compile/cudagraph tests
d269e476
lint
e4a49524
fix lint
debd4654
replace import that lint removed
26816943
fixes
960f8619
lint
7fef8211
opify at a higher level
3c74170a
de-opify deepgemm kernels
43441cd4
remove cruft
813b66c7
MoE refactoring
010d9047
make FusedMoEModularKernel a Leaf
1b0fad3a
make FusedMoEModularKernel a Leaf
584de044
fix format
c42f7429
config stuff + add more tests
8f91f36e
fixes
4f521502
wip test
2c8ec1d7
fix mergea
0d39be3d
disable buggy fp8 tests
17097eac
fixes
f5973ab3
more lint
c8223223
more lint
12b1df4c
merge
c68fe52d
fix merge
af060d4b
fix deep gemm test
763f5906
add supports_expert_map method + cleanup select_gemm_impl methods
b9c027ac
lint
44076185
revert random linter changes
e9a66cb1
fix comments + lint
762394c4
remove some logging
e7973d7b
remove unused method
5fc344c5
try to fix lint
72097bb9
add some asserts to make lint happy
d1b83ba6
try again with the linter
74223575
review comments + fixes
d1928adb
review comments + test fixes
7546a292
fix test_mixtral_moe + bump up some tolerances
2061d683
remove duplicate test setup code. fix some tests, some still failing
96b08fc3
lint
a6e7d47f
more lint
149f7b7e
fix lint
4b4ae50d
more linter fixes
07a2599e
appease yapf/isort gods
a26eab4e
fix test_deepep_moe.py
fd4ffd8a
move deepep_utils -> parallel_utils
455a6ce5
fix test_block_fp8.py test
3caa61f0
more lint nonsense
bb5d8e99
Fix incorrect per_act_token
76842252
fix merge
f188691a
fix lint nonsense
579af67e
fix merge
a76d2eff
fix test_deepep_moe.py
fd4ffd8a
more lint nonsense
bb5d8e99
fix merge
f188691a
fix deepep ht tests
d466524f
some quantization tweaks
d2b66825
fix
d81a46bc
bump up int8 tolerance a tiny bit
e635a37c
fix merge
db33d8fc
disabled auto-merge
313 days ago
Head branch was pushed to by a user without write access
bnellnm
force pushed
to
db33d8fc
313 days ago
tlrmchlsmth
enabled auto-merge (squash)
313 days ago
fix messed up config setup
347a7b75
disabled auto-merge
313 days ago
Head branch was pushed to by a user without write access
one more fix
86224d00
vllm-bot
merged
c1909e7e
into main
312 days ago
luccafong
commented on 2025-07-05
bnellnm
deleted the moe-refactor branch
180 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
tlrmchlsmth
luccafong
mgoin
ElizaWszola
gemini-code-assist
WoosukKwon
robertgshaw2-redhat
Assignees
No one assigned
Labels
performance
ready
ci/build
Milestone
No milestone
Login to write a write a comment.
Login via GitHub