vllm
e392d858 - [Core] Refactor `QKVCrossParallelLinear` implementation to support BNB 4-bit quantization (#14545)

Commit
362 days ago
[Core] Refactor `QKVCrossParallelLinear` implementation to support BNB 4-bit quantization (#14545) Signed-off-by: Isotr0py <2037008807@qq.com>
Author
Parents
Loading