DeepSpeed
4f950672
- Add fp8-fused gemm kernel (#5764)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
Add fp8-fused gemm kernel (#5764) This PR adds the new fused kernel for the Dense GeMM using fp8-quantized weight. --------- Co-authored-by: Jeff Rasley <jeffra45@gmail.com> Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
References
#5764 - Add fp8-fused gemm kernel
Author
sfc-gh-reyazda
Parents
f8039434
Loading