vllm
[Bugfix][CT] Fix KV cache scale handling
#39418
Merged

[Bugfix][CT] Fix KV cache scale handling #39418

yiliu30
yiliu30 [BugFix] CompressedTensors: set _k_scale_float/_v_scale_float for KV …
63941d09
yiliu30 yiliu30 requested a review from mgoin mgoin 38 days ago
yiliu30 yiliu30 requested a review from robertgshaw2-redhat robertgshaw2-redhat 38 days ago
yiliu30 yiliu30 requested a review from tlrmchlsmth tlrmchlsmth 38 days ago
yiliu30 yiliu30 requested a review from yewentao256 yewentao256 38 days ago
yiliu30 yiliu30 requested a review from pavanimajety pavanimajety 38 days ago
mergify
mergify mergify added documentation
mergify mergify added bug
yiliu30 yiliu30 changed the title [BugFix] CompressedTensors: set _k_scale_float/_v_scale_float for KV cache quantization [CT] set `_k_scale_float`/`_v_scale_float` for KV cache quantization 38 days ago
yiliu30 yiliu30 changed the title [CT] set `_k_scale_float`/`_v_scale_float` for KV cache quantization [CT] Fix KV cache scale handling 38 days ago
gemini-code-assist
yiliu30 Merge branch 'main' into fix/compressed-tensors-kv-cache-scale-float
6fcc0c3b
yiliu30 add claude back
746f19e7
yiliu30 yiliu30 changed the title [CT] Fix KV cache scale handling [Bugfix][CT] Fix KV cache scale handling 37 days ago
brian-dellabetta
mgoin
mgoin approved these changes on 2026-04-10
mgoin mgoin added ready
mgoin mgoin added quantization
mgoin
mgoin commented on 2026-04-10
yiliu30 fix
44f84ed6
yiliu30 fix
372e0be1
yiliu30 fix
ed7f74f0
yiliu30 Merge branch 'main' into fix/compressed-tensors-kv-cache-scale-float
ddd289da
hshen14
hshen14 approved these changes on 2026-04-13
yewentao256
yewentao256 approved these changes on 2026-04-13
yewentao256 yewentao256 merged d8ddb316 into main 34 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone