vllm
5780121c
- [Perf] Add swap_ab to SM90 FP8 non-block CUTLASS moe grouped gemm (#20911)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
220 days ago
[Perf] Add swap_ab to SM90 FP8 non-block CUTLASS moe grouped gemm (#20911) Signed-off-by: Shixian Cui <shixian@amazon.com> Co-authored-by: Shixian Cui <shixian@amazon.com>
References
#20911 - [Perf] Add swap_ab to SM90 FP8 non-block CUTLASS moe grouped gemm
Author
shixianc
Parents
c7d8724e
Loading