vllm
[Misc][Quark] Upstream Quark format to VLLM
#10765
Merged

[Misc][Quark] Upstream Quark format to VLLM #10765

kewang-xlnx
github-actions
DarkLight1337 DarkLight1337 requested a review from mgoin mgoin 1 year ago
mergify
mergify mergify added needs-rebase
kewang-xlnx kewang-xlnx force pushed 1 year ago
mergify mergify removed needs-rebase
kewang-xlnx kewang-xlnx force pushed 1 year ago
kewang-xlnx kewang-xlnx force pushed 1 year ago
kewang-xlnx kewang-xlnx force pushed 1 year ago
kewang-xlnx kewang-xlnx changed the title [feature]:upstream quark format to vllm [Misc][Quark] Upstream Quark format to VLLM 1 year ago
robertgshaw2-redhat
kewang-xlnx
mgoin
mgoin
mgoin commented on 2024-12-16
mergify
mergify mergify added needs-rebase
kewang-xlnx
kewang-xlnx kewang-xlnx force pushed 1 year ago
mergify mergify removed needs-rebase
kewang-xlnx kewang-xlnx force pushed 1 year ago
mergify
mergify mergify added needs-rebase
kewang-xlnx kewang-xlnx force pushed 1 year ago
mergify mergify removed needs-rebase
kewang-xlnx kewang-xlnx force pushed 1 year ago
kewang-xlnx kewang-xlnx force pushed 1 year ago
kewang-xlnx kewang-xlnx force pushed to 58b9221b 1 year ago
mgoin mgoin requested a review from mgoin mgoin 364 days ago
mgoin
mgoin approved these changes on 2024-12-24
mgoin mgoin added ready
kewang-xlnx kewang-xlnx force pushed from 58b9221b 351 days ago
kewang-xlnx kewang-xlnx requested a review from robertgshaw2-redhat robertgshaw2-redhat 342 days ago
kewang-xlnx [AMD] support Quark quantized format
5016a297
kewang-xlnx [Model] add param_name remap in experts in dbrx model
90250831
kewang-xlnx [Model] support quark kv cache format in dbrx
05cf03d6
kewang-xlnx add kv cache remap method for quark format
b6593cc0
kewang-xlnx [AMD][Quark] Fix fails in pr checks
68ab9b95
[AMD][Quark] replace get_compressed_tensors_cache_scale with get_cach…
ea86886c
kewang-xlnx [AMD][Quark] remove quark dependency
3488f13e
kewang-xlnx [AMD][Quark] fix mypy error
ccd214da
kewang-xlnx auto fix yapf
5172a429
kewang-xlnx fix mypy error
33dd3738
kewang-xlnx fix code format
a86314fa
kewang-xlnx Update vllm/model_executor/layers/quantization/compressed_tensors/tri…
044041f0
kewang-xlnx Modify based on PR comments
1ad3cf8a
kewang-xlnx fix mypy error
975c7c22
kewang-xlnx delete comments of get_cache_scale in compressed_tensors.py
48c6ba7a
kewang-xlnx fix mypy error
e2d511be
kewang-xlnx change get_compressed_tensors_cache_scale to get_cache_scale in gemma…
1ffab057
kewang-xlnx add CI test for quark format
011da4f8
kewang-xlnx fix mypy error in CI tests
2c614654
kewang-xlnx be consistent with origin vllm
f3d9e58c
kewang-xlnx kewang-xlnx force pushed to f3d9e58c 339 days ago
mgoin
mgoin mgoin merged de0526f6 into main 339 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone