vllm
c13434bb
- add fusion of shared expert and fused_moe_gate
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
40 days ago
add fusion of shared expert and fused_moe_gate Signed-off-by: Barbara Suslova <barbara.suslova@axel-t.com>
References
#29497 - Deepseek optimizations [experimental duplicate of https://github.com/vllm-project/vllm/pull/28540]
Author
Red-Caesar
Committer
alexm-redhat
Parents
c7a29d2c
Loading