vllm
97e3dda8
- [Perf] SM100 - add swap AB optimization to CUTLASS FP8 GEMM (#27284)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
66 days ago
[Perf] SM100 - add swap AB optimization to CUTLASS FP8 GEMM (#27284) Signed-off-by: Faqin Zhong <faqin.zhong@gmail.com> Co-authored-by: Faqin Zhong <zhofaqin@amazon.com> Co-authored-by: Michael Goin <mgoin64@gmail.com>
References
#27284 - [Perf] SM100 - add swap AB optimization to CUTLASS FP8 GEMM
Author
LyrisZhong
Parents
5a0a6dfd
Loading