vllm
e392d858
- [Core] Refactor `QKVCrossParallelLinear` implementation to support BNB 4-bit quantization (#14545)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
362 days ago
[Core] Refactor `QKVCrossParallelLinear` implementation to support BNB 4-bit quantization (#14545) Signed-off-by: Isotr0py <2037008807@qq.com>
References
#14545 - [Core] Refactor `QKVCrossParallelLinear` implementation to support BNB 4-bit quantization
Author
Isotr0py
Parents
77a318bd
Loading