vllm
e97f802b
- [FP8][Kernel] Dynamic kv cache scaling factors computation (#11906)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
309 days ago
[FP8][Kernel] Dynamic kv cache scaling factors computation (#11906) Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by: Micah Williamson <micah.williamson@amd.com>
References
#11906 - [FP8][Kernel] Dynamic kv cache scaling factors computation
Author
gshtras
Parents
6e650f56
Loading