vllm
[MoE Refactor] Split `invoke_fused_moe_kernel`
#31050
Merged

[MoE Refactor] Split `invoke_fused_moe_kernel` #31050

zyongye
zyongye zyongye requested a review from mgoin mgoin 41 days ago
zyongye zyongye requested a review from pavanimajety pavanimajety 41 days ago
ywang96 ywang96 added ready
chatgpt-codex-connector
chatgpt-codex-connector commented on 2025-12-20
mergify
gemini-code-assist
gemini-code-assist commented on 2025-12-20
mergify
zyongye zyongye changed the title [MoE Refactor]Use Modular Kernels for triton bf16 experts [MoE Refactor] Split `invoke_fused_moe_kernel` 41 days ago
mergify
mergify mergify added needs-rebase
zyongye change top level interface to mk
60279d27
zyongye different kernel in different functions
f13eb40d
zyongye pre-commit
fb567f60
zyongye update triton experts
4f69e85b
zyongye zyongye force pushed from 0c14373a to 4f69e85b 37 days ago
mergify mergify removed needs-rebase
jinzhen-lin
jinzhen-lin commented on 2025-12-26
zyongye change wna16 kernel name
4a22f0ab
zyongye add notes
f3abab6c
jinzhen-lin
zyongye change dropped sm version
ff4af425
zyongye zyongye force pushed from 0f6a153c to ff4af425 29 days ago
zyongye remote debug
d206c471
zyongye zyongye force pushed from 986ce164 to d206c471 29 days ago
robertgshaw2-redhat robertgshaw2-redhat enabled auto-merge (squash) 28 days ago
robertgshaw2-redhat
robertgshaw2-redhat approved these changes on 2026-01-01
robertgshaw2-redhat
robertgshaw2-redhat Merge branch 'main' into bf16_triton_refactor
0e1e56ae
zyongye Merge branch 'main' into bf16_triton_refactor
958dbd53
mgoin
mgoin approved these changes on 2026-01-02
vllm-bot vllm-bot merged 5a468ff7 into main 27 days ago
JartX
zyongye
JartX
JartX
zyongye
JartX
JartX
JartX
zyongye
JartX
russellb
JartX
JartX

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone