vllm
permute/unpermute kernel for moe optimization
#14568
Merged

permute/unpermute kernel for moe optimization #14568

simon-mo merged 13 commits into vllm-project:main from CalebDu:caleb_dev
CalebDu
CalebDu CalebDu requested a review from tlrmchlsmth tlrmchlsmth 1 year ago
CalebDu CalebDu requested a review from WoosukKwon WoosukKwon 1 year ago
github-actions
mergify mergify added ci/build
gzy19990617
CalebDu
gzy19990617
CalebDu CalebDu force pushed 1 year ago
CalebDu
CalebDu CalebDu force pushed 1 year ago
CalebDu CalebDu force pushed 1 year ago
CalebDu CalebDu force pushed 1 year ago
CalebDu
CalebDu CalebDu force pushed 1 year ago
CalebDu
bnellnm
CalebDu
CalebDu
bnellnm
mergify
mergify mergify added needs-rebase
CalebDu CalebDu force pushed 1 year ago
mergify mergify removed needs-rebase
CalebDu
bnellnm
CalebDu
bnellnm
CalebDu
bnellnm
CalebDu
bnellnm
bnellnm commented on 2025-04-17
mergify
mergify mergify added needs-rebase
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
bnellnm commented on 2025-04-17
bnellnm
CalebDu
CalebDu CalebDu force pushed 1 year ago
mergify mergify removed needs-rebase
CalebDu CalebDu force pushed 1 year ago
CalebDu
bnellnm
CalebDu
CalebDu CalebDu force pushed 1 year ago
bnellnm
bnellnm commented on 2025-04-28
bnellnm
bnellnm approved these changes on 2025-04-28
mergify
mergify mergify added needs-rebase
CalebDu CalebDu force pushed 1 year ago
mergify mergify removed needs-rebase
CalebDu CalebDu force pushed 1 year ago
CalebDu CalebDu force pushed 1 year ago
CalebDu CalebDu force pushed 1 year ago
CalebDu CalebDu force pushed 1 year ago
CalebDu CalebDu force pushed 1 year ago
CalebDu CalebDu force pushed 1 year ago
tlrmchlsmth
tlrmchlsmth commented on 2025-04-22
tlrmchlsmth
tlrmchlsmth commented on 2025-05-02
tlrmchlsmth tlrmchlsmth added ready
tlrmchlsmth
tlrmchlsmth commented on 2025-05-02
CalebDu implement moe permute kernel
65e8abe0
CalebDu implement moe unpermute
45884cf0
CalebDu update code and fix pre-commit
b1fd42bc
CalebDu add expert_map support for moe permute
1abc4c23
CalebDu add expert_map support for moe unpermute
4f29dc0f
CalebDu fix mismatch and add more test case
42507df4
CalebDu support align_block_size for contiguous group gemm in deepgemm
6960faaa
CalebDu for each local valid expert,fill padding row with expert_id in `m_ind…
88168126
CalebDu add fill_invalid_expert to workaround deepgemm unsupport -1 in m_indices
d453aa7f
CalebDu update code according bnellnm's comment:
3db8defb
CalebDu remove arch limit in cmake and add return `token_expert_indices` in a…
c7e58308
CalebDu fix pre-commit failed
840cd419
CalebDu CalebDu force pushed 1 year ago
CalebDu 1. fix call FusedMoE.select_experts failed
b29bacff
CalebDu CalebDu force pushed to b29bacff 1 year ago
CalebDu
tlrmchlsmth
tlrmchlsmth approved these changes on 2025-05-02
simon-mo simon-mo merged 3e887d2e into main 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone