vllm
a3a73ab0
- [Misc] Load FP8 kv-cache scaling factors from checkpoints (#4893)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
[Misc] Load FP8 kv-cache scaling factors from checkpoints (#4893) The 2nd PR for #4532. This PR supports loading FP8 kv-cache scaling factors from a FP8 checkpoint (with .kv_scale parameter).
References
#4893 - [Misc] Load FP8 kv-cache scaling factors from checkpoints
Author
comaniac
Parents
8674f988
Loading