vllm
[Bugfix] Fix `kv_cache_dtype=fp8` without scales for FP8 checkpoints
#6761
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
3
Changes
View On
GitHub
[Bugfix] Fix `kv_cache_dtype=fp8` without scales for FP8 checkpoints
#6761
simon-mo
merged 3 commits into
main
from
fix-unloaded-fp8-kv-scales
Fix fp8 kv cache without scales
452bfeb5
mgoin
changed the title
[Bugfix] Fix fp8 kv cache without scales
[Bugfix] Fix `kv_cache_dtype=fp8` without scales for FP8 checkpoints
1 year ago
mgoin
added
ready
Add smoke test
fe9648b0
Format
eda606ed
robertgshaw2-redhat
approved these changes on 2024-07-25
mgoin
enabled auto-merge (squash)
1 year ago
comaniac
approved these changes on 2024-07-25
disabled auto-merge
1 year ago
Manually disabled by user
simon-mo
merged
65b1f121
into main
1 year ago
simon-mo
deleted the fix-unloaded-fp8-kv-scales branch
1 year ago
Login to write a write a comment.
Login via GitHub
Reviewers
comaniac
robertgshaw2-redhat
Assignees
No one assigned
Labels
ready
Milestone
No milestone
Login to write a write a comment.
Login via GitHub