vllm
[Bugfix] Allocate less memory in non-batched CUTLASS MoE
#21121
Merged

[Bugfix] Allocate less memory in non-batched CUTLASS MoE #21121

ElizaWszola
ElizaWszola Allocate less memory in CUTLASS MoE
ba131c36
github-actions
gemini-code-assist
gemini-code-assist commented on 2025-07-17
mgoin
mgoin commented on 2025-07-17
mgoin mgoin added bug
mgoin
mgoin approved these changes on 2025-07-17
mgoin mgoin added ready
ElizaWszola Merge branch 'main' into cutlass-moe-less-mem
1712fc49
DarkLight1337 DarkLight1337 merged 4adc66f6 into main 247 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone