vllm
0e71eaa6
- [Feature] AWQ marlin quantization support for fused moe with lora (#30442)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
6 days ago
[Feature] AWQ marlin quantization support for fused moe with lora (#30442) Signed-off-by: princepride <wangzhipeng628@gmail.com>
References
#30442 - [Feature] AWQ marlin quantization support for fused moe with lora
Author
princepride
Parents
8781cd6b
Loading