vllm
5c3bae1a
- [Fix] Remove divisibility requirement between num_kv_heads and tp_size in bailing_moe (#26876)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
197 days ago
[Fix] Remove divisibility requirement between num_kv_heads and tp_size in bailing_moe (#26876) Signed-off-by: vito.yy <vito.yy@antgroup.com>
References
#26876 - [Fix] Remove divisibility requirement between num_kv_heads and tp_size in bailing_moe
Author
ant-yy
Parents
5210dc39
Loading