vllm
[Bugfix] Fix `kv_cache_dtype=fp8` without scales for FP8 checkpoints
#6761
Merged

[Bugfix] Fix `kv_cache_dtype=fp8` without scales for FP8 checkpoints #6761

simon-mo merged 3 commits into main from fix-unloaded-fp8-kv-scales
mgoin
mgoin Fix fp8 kv cache without scales
452bfeb5
github-actions
mgoin mgoin changed the title [Bugfix] Fix fp8 kv cache without scales [Bugfix] Fix `kv_cache_dtype=fp8` without scales for FP8 checkpoints 1 year ago
mgoin mgoin added ready
mgoin Add smoke test
fe9648b0
mgoin Format
eda606ed
robertgshaw2-redhat
robertgshaw2-redhat approved these changes on 2024-07-25
mgoin mgoin enabled auto-merge (squash) 1 year ago
comaniac
comaniac approved these changes on 2024-07-25
disabled auto-merge 1 year ago
Manually disabled by user
simon-mo simon-mo merged 65b1f121 into main 1 year ago
simon-mo simon-mo deleted the fix-unloaded-fp8-kv-scales branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone