DeepSpeed
Support loading and saving ZeRO checkpoints with changing DP degree
#240
Merged

Support loading and saving ZeRO checkpoints with changing DP degree #240

tjruwase merged 20 commits into master from olruwase/zero_checkpoints
tjruwase
tjruwase Support saving and loading ZeRO checkpoints on different data
5ef11084
tjruwase Merge branch 'master' of github.com:microsoft/DeepSpeed into olruwase…
133ece66
tjruwase tjruwase requested a review from jeffra jeffra 5 years ago
tjruwase tjruwase requested a review from samyam samyam 5 years ago
tjruwase Fix formatting
1c683042
tjruwase Merge branch 'master' into olruwase/zero_checkpoints
d97c178b
jeffra Merge branch 'master' into olruwase/zero_checkpoints
fd3505e0
jeffra
tjruwase
jeffra
chunyang-wen
chunyang-wen commented on 2020-05-29
samyam
jeffra Merge branch 'master' into olruwase/zero_checkpoints
51dc5e20
tjruwase Support checkpoint with varying GPU count in ZeRO stage 1
e6fda661
tjruwase Merge branch 'olruwase/zero_checkpoints' of github.com:microsoft/Deep…
b19e5b4c
tjruwase Fix formatting
97277364
tjruwase Formatting fixes
44576a8f
tjruwase Update model tests
181e42b1
tjruwase Merge with master
ed5443c1
tjruwase Remove pprint
6e8d90ac
tjruwase Minor fix
f9e06a9e
tjruwase Merge branch 'master' into olruwase/zero_checkpoints
fc31c813
samyam
samyam commented on 2020-07-13
samyam
samyam commented on 2020-07-13
samyam
samyam approved these changes on 2020-07-13
tjruwase Merge with master
939324d0
tjruwase Merge branch 'olruwase/zero_checkpoints' of github.com:microsoft/Deep…
4f5b9848
tjruwase Fix formatting
1d6dd2ec
tjruwase Update model tests
ca410361
tjruwase Merge branch 'master' into olruwase/zero_checkpoints
47832f9c
tjruwase tjruwase merged 7ccc9daf into master 5 years ago
jeffra jeffra deleted the olruwase/zero_checkpoints branch 4 years ago
kisseternity
tjruwase
kkteru

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone