DeepSpeed
0ad7c7d3
- Fix grad norm scaling
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 years ago
Fix grad norm scaling
References
#1801 - bf16+pipeline parallelism
Author
tjruwase
Parents
f4977024
Loading