vllm
c42ff4f4
- [BugFix][torch.compile] KV scale calculation issues with FP8 quantization (#25513)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
93 days ago
[BugFix][torch.compile] KV scale calculation issues with FP8 quantization (#25513) Signed-off-by: adabeyta <aabeyta@redhat.com>
References
#25513 - [BugFix][torch.compile] KV scale calculation issues with FP8 quantization (#21640)
Author
adabeyta
Parents
d5ab2851
Loading