Fix unbalanced gradients bug in ZeRO-2 gradient accumulation #545
Use zero-tensors for missing gradients to avoid size mismatch
0bb0cc80
Unit test for unbalanced gradients in ZeRO
e1ff8bab
Merge branch 'master' of github.com:microsoft/DeepSpeed into olruwase…
6a7829a1
jeffra
approved these changes
on 2020-11-20
Formatting fixes
ce5d9074
tjruwase
merged
0178e6cc
into master 5 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub