vllm
[Misc][Quark] Upstream Quark format to VLLM
#10765
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
20
Changes
View On
GitHub
[Misc][Quark] Upstream Quark format to VLLM
#10765
mgoin
merged 20 commits into
vllm-project:main
from
kewang-xlnx:kewang/quark_upstream
DarkLight1337
requested a review
from
mgoin
1 year ago
mergify
added
needs-rebase
kewang-xlnx
force pushed
1 year ago
mergify
removed
needs-rebase
kewang-xlnx
force pushed
1 year ago
kewang-xlnx
force pushed
1 year ago
kewang-xlnx
force pushed
1 year ago
kewang-xlnx
changed the title
[feature]:upstream quark format to vllm
[Misc][Quark] Upstream Quark format to VLLM
1 year ago
mgoin
commented on 2024-12-16
mergify
added
needs-rebase
kewang-xlnx
force pushed
1 year ago
mergify
removed
needs-rebase
kewang-xlnx
force pushed
1 year ago
mergify
added
needs-rebase
kewang-xlnx
force pushed
1 year ago
mergify
removed
needs-rebase
kewang-xlnx
force pushed
1 year ago
kewang-xlnx
force pushed
1 year ago
kewang-xlnx
force pushed
to
58b9221b
1 year ago
mgoin
requested a review
from
mgoin
364 days ago
mgoin
approved these changes on 2024-12-24
mgoin
added
ready
kewang-xlnx
force pushed
from
58b9221b
351 days ago
kewang-xlnx
requested a review
from
robertgshaw2-redhat
342 days ago
[AMD] support Quark quantized format
5016a297
[Model] add param_name remap in experts in dbrx model
90250831
[Model] support quark kv cache format in dbrx
05cf03d6
add kv cache remap method for quark format
b6593cc0
[AMD][Quark] Fix fails in pr checks
68ab9b95
[AMD][Quark] replace get_compressed_tensors_cache_scale with get_cach…
ea86886c
[AMD][Quark] remove quark dependency
3488f13e
[AMD][Quark] fix mypy error
ccd214da
auto fix yapf
5172a429
fix mypy error
33dd3738
fix code format
a86314fa
Update vllm/model_executor/layers/quantization/compressed_tensors/tri…
044041f0
Modify based on PR comments
1ad3cf8a
fix mypy error
975c7c22
delete comments of get_cache_scale in compressed_tensors.py
48c6ba7a
fix mypy error
e2d511be
change get_compressed_tensors_cache_scale to get_cache_scale in gemma…
1ffab057
add CI test for quark format
011da4f8
fix mypy error in CI tests
2c614654
be consistent with origin vllm
f3d9e58c
kewang-xlnx
force pushed
to
f3d9e58c
339 days ago
mgoin
merged
de0526f6
into main
339 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
mgoin
robertgshaw2-redhat
Assignees
No one assigned
Labels
ready
Milestone
No milestone
Login to write a write a comment.
Login via GitHub