vllm
[Core] Refactor `QKVCrossParallelLinear` implementation to support BNB 4-bit quantization
#14545
Merged

[Core] Refactor `QKVCrossParallelLinear` implementation to support BNB 4-bit quantization #14545

vllm-bot merged 12 commits into vllm-project:main from Isotr0py:refactor-x-qkv
Isotr0py
Isotr0py revert mllama and init x-qkv refactor
73ec79dc
Isotr0py fix
288b3a97
Isotr0py it just work for bnb
8ef9fc1a
Isotr0py refactor
4faab844
Isotr0py add doc string
2073544d
Isotr0py Merge branch 'vllm-project:main' into refactor-x-qkv
9e8ae319
github-actions
Isotr0py Isotr0py requested a review from mgoin mgoin 1 year ago
Isotr0py Isotr0py requested a review from jeejeelee jeejeelee 1 year ago
NickLucche
NickLucche requested changes on 2025-03-10
Isotr0py make mypy happy
5f8b1cea
Isotr0py
Isotr0py fix typo
9772f061
NickLucche
Isotr0py lints
3b75f931
jeejeelee
jeejeelee commented on 2025-03-10
Isotr0py add extra_repr
a4793431
Isotr0py add mllama bnb test
e5feabe6
Isotr0py Isotr0py requested a review from DarkLight1337 DarkLight1337 1 year ago
Isotr0py Isotr0py requested a review from ywang96 ywang96 1 year ago
Isotr0py fix bias attrs
06226853
jeejeelee
jeejeelee approved these changes on 2025-03-11
DarkLight1337 DarkLight1337 enabled auto-merge (squash) 1 year ago
github-actions github-actions added ready
vllm-bot vllm-bot merged e392d858 into main 1 year ago
Isotr0py Isotr0py deleted the refactor-x-qkv branch 1 year ago
gshtras
Isotr0py

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone