[Core] Refactor `QKVCrossParallelLinear` implementation to support BNB 4-bit quantization #14545
revert mllama and init x-qkv refactor
73ec79dc
fix
288b3a97
it just work for bnb
8ef9fc1a
refactor
4faab844
add doc string
2073544d
Merge branch 'vllm-project:main' into refactor-x-qkv
9e8ae319
make mypy happy
5f8b1cea
fix typo
9772f061
lints
3b75f931
add extra_repr
a4793431
add mllama bnb test
e5feabe6
fix bias attrs
06226853
jeejeelee
approved these changes
on 2025-03-11
vllm-bot
merged
e392d858
into main 1 year ago
Isotr0py
deleted the refactor-x-qkv branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub