vllm
4adc66f6 - [Bugfix] Allocate less memory in non-batched CUTLASS MoE (#21121)

Commit
156 days ago
[Bugfix] Allocate less memory in non-batched CUTLASS MoE (#21121) Signed-off-by: ElizaWszola <ewszola@redhat.com>
Author
Parents
Loading