vllm
1bfd0971
- optimize cutlass fp8
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
5 days ago
optimize cutlass fp8 Signed-off-by: yewentao256 <zhyanwentao@126.com>
References
#43706 - [Perf] Optimize cutlass fp8 scaled mm bypassing padding, 20% kernel performance improvement
Author
yewentao256
Parents
193ce881
Loading