vllm
Add `moe_align_block_size_no_permute` for small batch size with large num_expert
#30280
Open

Add `moe_align_block_size_no_permute` for small batch size with large num_expert #30280

RunkaiTao wants to merge 4 commits into vllm-project:main from RunkaiTao:feat/unpermute-kernel
RunkaiTao
RunkaiTao unpermute kernel cuda
e4fc343c
RunkaiTao pre-commit
bb5ee065
RunkaiTao disable-expert-map
84b1b202
gemini-code-assist
gemini-code-assist commented on 2025-12-08
RunkaiTao pre-commit
976bf1d1
mergify
mergify mergify added needs-rebase

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone