vllm
97e3dda8 - [Perf] SM100 - add swap AB optimization to CUTLASS FP8 GEMM (#27284)

Commit
66 days ago
[Perf] SM100 - add swap AB optimization to CUTLASS FP8 GEMM (#27284) Signed-off-by: Faqin Zhong <faqin.zhong@gmail.com> Co-authored-by: Faqin Zhong <zhofaqin@amazon.com> Co-authored-by: Michael Goin <mgoin64@gmail.com>
Author
Parents
Loading