yiliu30
changed the title [BugFix] CompressedTensors: set _k_scale_float/_v_scale_float for KV cache quantization [CT] set `_k_scale_float`/`_v_scale_float` for KV cache quantization38 days ago
yiliu30
changed the title [CT] set `_k_scale_float`/`_v_scale_float` for KV cache quantization [CT] Fix KV cache scale handling38 days ago
Merge branch 'main' into fix/compressed-tensors-kv-cache-scale-float
6fcc0c3b
add claude back
746f19e7
yiliu30
changed the title [CT] Fix KV cache scale handling [Bugfix][CT] Fix KV cache scale handling37 days ago
Login to write a write a comment.
Login via GitHub