vllm
[Kernels] MoE refactor
#19636
Merged

[Kernels] MoE refactor #19636

bnellnm
github-actions
gemini-code-assist
gemini-code-assist commented on 2025-06-14
gemini-code-assist
gemini-code-assist commented on 2025-06-14
bnellnm bnellnm force pushed 328 days ago
mergify mergify added ci/build
bnellnm bnellnm force pushed to 6b4e4060 326 days ago
bnellnm bnellnm changed the title Moe refactor [Kernels] MoE refactor 326 days ago
bnellnm bnellnm marked this pull request as ready for review 326 days ago
bnellnm bnellnm requested a review from tlrmchlsmth tlrmchlsmth 326 days ago
bnellnm bnellnm requested a review from WoosukKwon WoosukKwon 326 days ago
bnellnm bnellnm requested a review from mgoin mgoin 326 days ago
bnellnm bnellnm requested a review from robertgshaw2-redhat robertgshaw2-redhat 326 days ago
mgoin
mgoin commented on 2025-06-19
tlrmchlsmth
tlrmchlsmth commented on 2025-06-24
mergify
mergify mergify added needs-rebase
tlrmchlsmth
tlrmchlsmth commented on 2025-06-24
tlrmchlsmth
tlrmchlsmth commented on 2025-06-24
bnellnm bnellnm force pushed 319 days ago
mergify mergify removed needs-rebase
bnellnm bnellnm force pushed 318 days ago
mergify
mergify mergify added needs-rebase
bnellnm bnellnm force pushed 318 days ago
mergify mergify removed needs-rebase
mergify mergify added performance
bnellnm bnellnm requested a review from mgoin mgoin 317 days ago
bnellnm bnellnm requested a review from tlrmchlsmth tlrmchlsmth 317 days ago
mergify
mergify mergify added needs-rebase
bnellnm bnellnm force pushed 316 days ago
mergify mergify removed needs-rebase
ElizaWszola
ElizaWszola commented on 2025-06-30
ElizaWszola
ElizaWszola commented on 2025-06-30
tlrmchlsmth
tlrmchlsmth commented on 2025-06-30
tlrmchlsmth
tlrmchlsmth commented on 2025-06-30
bnellnm bnellnm force pushed 314 days ago
ElizaWszola
ElizaWszola commented on 2025-07-01
bnellnm bnellnm force pushed 313 days ago
tlrmchlsmth tlrmchlsmth added ready
tlrmchlsmth
tlrmchlsmth approved these changes on 2025-07-01
tlrmchlsmth tlrmchlsmth enabled auto-merge (squash) 313 days ago
disabled auto-merge 313 days ago
Head branch was pushed to by a user without write access
tlrmchlsmth tlrmchlsmth enabled auto-merge (squash) 313 days ago
bnellnm turn try_get_optimal_moe_config into an op so it can be torch.compiled
e8ab05a1
bnellnm lint
e60fc9e8
bnellnm torch.compile tests
515b60e0
bnellnm add tests
b8c64a13
bnellnm add compiler + cudagraph tests
f2916aca
bnellnm tests
9daa8320
bnellnm reduce number of compile/cudagraph tests
d269e476
bnellnm lint
e4a49524
bnellnm fix lint
debd4654
bnellnm replace import that lint removed
26816943
bnellnm fixes
960f8619
bnellnm lint
7fef8211
bnellnm opify at a higher level
3c74170a
bnellnm de-opify deepgemm kernels
43441cd4
bnellnm remove cruft
813b66c7
bnellnm MoE refactoring
010d9047
bnellnm make FusedMoEModularKernel a Leaf
1b0fad3a
bnellnm make FusedMoEModularKernel a Leaf
584de044
bnellnm fix format
c42f7429
bnellnm config stuff + add more tests
8f91f36e
bnellnm fixes
4f521502
bnellnm wip test
2c8ec1d7
bnellnm fix mergea
0d39be3d
bnellnm disable buggy fp8 tests
17097eac
bnellnm fixes
f5973ab3
bnellnm more lint
c8223223
bnellnm more lint
12b1df4c
bnellnm merge
c68fe52d
bnellnm fix merge
af060d4b
bnellnm fix deep gemm test
763f5906
bnellnm add supports_expert_map method + cleanup select_gemm_impl methods
b9c027ac
bnellnm lint
44076185
bnellnm revert random linter changes
e9a66cb1
bnellnm fix comments + lint
762394c4
bnellnm remove some logging
e7973d7b
bnellnm remove unused method
5fc344c5
bnellnm try to fix lint
72097bb9
bnellnm add some asserts to make lint happy
d1b83ba6
bnellnm try again with the linter
74223575
bnellnm review comments + fixes
d1928adb
bnellnm review comments + test fixes
7546a292
bnellnm fix test_mixtral_moe + bump up some tolerances
2061d683
bnellnm remove duplicate test setup code. fix some tests, some still failing
96b08fc3
bnellnm lint
a6e7d47f
bnellnm more lint
149f7b7e
bnellnm fix lint
4b4ae50d
bnellnm more linter fixes
07a2599e
bnellnm appease yapf/isort gods
a26eab4e
bnellnm fix test_deepep_moe.py
fd4ffd8a
bnellnm move deepep_utils -> parallel_utils
455a6ce5
bnellnm fix test_block_fp8.py test
3caa61f0
bnellnm more lint nonsense
bb5d8e99
ElizaWszola Fix incorrect per_act_token
76842252
bnellnm fix merge
f188691a
bnellnm fix lint nonsense
579af67e
bnellnm fix merge
a76d2eff
bnellnm fix test_deepep_moe.py
fd4ffd8a
bnellnm more lint nonsense
bb5d8e99
bnellnm fix merge
f188691a
bnellnm fix deepep ht tests
d466524f
bnellnm some quantization tweaks
d2b66825
bnellnm fix
d81a46bc
bnellnm bump up int8 tolerance a tiny bit
e635a37c
bnellnm fix merge
db33d8fc
disabled auto-merge 313 days ago
Head branch was pushed to by a user without write access
bnellnm bnellnm force pushed to db33d8fc 313 days ago
tlrmchlsmth tlrmchlsmth enabled auto-merge (squash) 313 days ago
bnellnm fix messed up config setup
347a7b75
disabled auto-merge 313 days ago
Head branch was pushed to by a user without write access
bnellnm one more fix
86224d00
vllm-bot vllm-bot merged c1909e7e into main 312 days ago
huydhn
mgoin
minosfuture
huydhn
luccafong
luccafong commented on 2025-07-05
huydhn
bnellnm bnellnm deleted the moe-refactor branch 180 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone