DeepSpeed
Fix ZeRO3 save_checkpoint
#857
Merged

Fix ZeRO3 save_checkpoint #857

jeffra merged 12 commits into master from olruwase/zero3_save_checkpoint
tjruwase
tjruwase Fix ZeRO3 save_checkpoint
3aea9650
tjruwase tjruwase requested a review from jeffra jeffra 5 years ago
tjruwase tjruwase requested a review from arashashari arashashari 5 years ago
tjruwase tjruwase requested a review from awan-10 awan-10 5 years ago
tjruwase tjruwase requested a review from cli99 cli99 5 years ago
tjruwase tjruwase requested a review from conglongli conglongli 5 years ago
tjruwase tjruwase requested a review from eltonzheng eltonzheng 5 years ago
tjruwase tjruwase requested a review from minjiaz minjiaz 5 years ago
tjruwase tjruwase requested a review from niumanar niumanar 5 years ago
tjruwase tjruwase requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 5 years ago
tjruwase tjruwase requested a review from samyam samyam 5 years ago
tjruwase tjruwase requested a review from ShadenSmith ShadenSmith 5 years ago
jeffra Merge branch 'master' into olruwase/zero3_save_checkpoint
c256197f
stas00
tjruwase Merge branch 'master' into olruwase/zero3_save_checkpoint
69b9aaa6
tjruwase Merge branch 'master' of github.com:microsoft/DeepSpeed into olruwase…
9e4b306e
jeffra Merge branch 'master' into olruwase/zero3_save_checkpoint
74ce362f
jeffra turn checkpoint test back on
bf47165f
tjruwase Merge branch 'master' into olruwase/zero3_save_checkpoint
1809b9a9
jeffra formatting
0371abb6
tjruwase debug prints
9016e20a
tjruwase Remove debug prints; formatting
45f446d0
tjruwase Merge branch 'master' into olruwase/zero3_save_checkpoint
2770cb9a
jeffra
jeffra approved these changes on 2021-03-16
jeffra Merge branch 'master' into olruwase/zero3_save_checkpoint
9d7e2ae3
jeffra jeffra merged fa87a73a into master 5 years ago
mrwyattii mrwyattii deleted the olruwase/zero3_save_checkpoint branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone