vllm
fe56180c
- [MoE] More balanced expert sharding (#21497)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
141 days ago
[MoE] More balanced expert sharding (#21497) Signed-off-by: Woosuk Kwon <woosuk@thinkingmachines.ai>
References
#21497 - [MoE] More balanced expert sharding
Author
WoosukKwon
Parents
07d80d7b
Loading