vllm
fe56180c - [MoE] More balanced expert sharding (#21497)

Commit
141 days ago
[MoE] More balanced expert sharding (#21497) Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>
Author
Parents
Loading