vllm
2ec401bc
- Load tuned fused_moe_lora shrink and expand kernel configs separately (#27435)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
173 days ago
Load tuned fused_moe_lora shrink and expand kernel configs separately (#27435) Signed-off-by: Yu Gong <yu3.gong@gmail.com> Co-authored-by: Jee Jee Li <pandaleefree@gmail.com>
References
#27435 - Load tuned fused_moe_lora shrink and expand kernel configs separately
Author
yugong333
Parents
4022a9d2
Loading