DeepSpeed
2e3769a1
- Enable fused_lamb_cuda_kernel on ROCm (#2148)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
2 years ago
Enable fused_lamb_cuda_kernel on ROCm (#2148) Co-authored-by: Jeff Rasley <jerasley@microsoft.com> Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
References
#2148 - Enable fused_lamb_cuda_kernel on ROCm
Author
rraminen
Parents
e419f7cb
Files
1
csrc/lamb
fused_lamb_cuda_kernel.cu
Loading