DeepSpeed
0178e6cc
- Fix unbalanced gradients bug in ZeRO-2 gradient accumulation (#545)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
5 years ago
Fix unbalanced gradients bug in ZeRO-2 gradient accumulation (#545) * Use zero-tensors for missing gradients to avoid size mismatch * Unit test for unbalanced gradients in ZeRO * Formatting fixes
References
#545 - Fix unbalanced gradients bug in ZeRO-2 gradient accumulation
Author
tjruwase
Parents
6b28bc5d
Loading