DeepSpeed
Fix gradient accumulation for Z2+offload
#6550
Merged

Fix gradient accumulation for Z2+offload #6550

tohtana
fix gradient accumulation for z2+offload
18489835
tohtana tohtana requested a review from tjruwase tjruwase 1 year ago
tjruwase
tjruwase approved these changes on 2024-09-18
tjruwase Merge branch 'master' into tohtana/fix_grad_acc_z2_offload
3a37b030
loadams Merge branch 'master' into tohtana/fix_grad_acc_z2_offload
3ee184cb
tjruwase tjruwase merged c85c8703 into master 1 year ago
zwhe99

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone