vllm
9fb2d220
- [Performance] Performance improvements in non-blockwise fp8 CUTLASS MoE (#20762)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
276 days ago
[Performance] Performance improvements in non-blockwise fp8 CUTLASS MoE (#20762) Signed-off-by: ElizaWszola <ewszola@redhat.com>
References
#20762 - [Performance] Performance improvements in non-blockwise fp8 CUTLASS MoE
Author
ElizaWszola
Parents
2d6a3820
Loading