DeepSpeed
429dfa6c
- Handle Norm allreduce when no mp (#1021)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
Handle Norm allreduce when no mp (#1021) Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
References
#1021 - ZeRO3: Gradient norm allreduce for DP
Author
tjruwase
Parents
dad26428
Loading