vllm
5c3bae1a - [Fix] Remove divisibility requirement between num_kv_heads and tp_size in bailing_moe (#26876)

Commit
197 days ago
[Fix] Remove divisibility requirement between num_kv_heads and tp_size in bailing_moe (#26876) Signed-off-by: vito.yy <vito.yy@antgroup.com>
Author
Parents
Loading