DeepSpeed
Fix issue with empty grads for non-fused optimizers
#83
Merged

Fix issue with empty grads for non-fused optimizers #83

jeffra merged 4 commits into master from jeffra/empty_grad_fix
jeffra
jeffra guard against empty gradients
1c759d7f
jeffra bug fixes for adamw/lamb and corresponding tests
391900b9
jeffra fix formatting
79ccfa7e
jeffra jeffra requested a review from ShadenSmith ShadenSmith 5 years ago
jeffra jeffra requested a review from samyam samyam 5 years ago
jeffra Merge branch 'master' into jeffra/empty_grad_fix
dce2d331
ShadenSmith
ShadenSmith approved these changes on 2020-02-15
samyam
samyam approved these changes on 2020-02-15
jeffra jeffra merged 807480a0 into master 5 years ago
jeffra jeffra deleted the jeffra/empty_grad_fix branch 5 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone