vllm
31d5c179
- [Perf][fp8] Use CustomOp abstraction for fp8 quant for better perf (#19830)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
288 days ago
[Perf][fp8] Use CustomOp abstraction for fp8 quant for better perf (#19830) Signed-off-by: Luka Govedic <lgovedic@redhat.com> Co-authored-by: mgoin <mgoin64@gmail.com>
References
#19830 - [Perf][fp8] Use CustomOp abstraction for fp8 quant for better perf
Author
ProExpertProg
Parents
35514b68
Loading