vllm
b10f41c8
- [SM100] Enable fp8 compute for prefill MLA (#30746)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
122 days ago
[SM100] Enable fp8 compute for prefill MLA (#30746) Signed-off-by: Pavani Majety <pmajety@nvidia.com>
References
#30746 - [SM100] Enable fp8 compute for prefill MLA
Author
pavanimajety
Parents
7b926e89
Loading