DeepSpeed
Refactor dist tests: Checkpointing
#2202
Merged

Refactor dist tests: Checkpointing #2202

mrwyattii merged 24 commits into master from olruwase/refactor_dist_ci
tjruwase
tjruwase Refactor dist tests: Checkpointing
e53a8c98
tjruwase tjruwase requested a review from jeffra jeffra 3 years ago
tjruwase tjruwase requested a review from mrwyattii mrwyattii 3 years ago
tjruwase tjruwase requested a review from samyam samyam 3 years ago
tjruwase tjruwase requested a review from ShadenSmith ShadenSmith 3 years ago
tjruwase tjruwase requested a review from conglongli conglongli 3 years ago
tjruwase tjruwase requested a review from awan-10 awan-10 3 years ago
tjruwase tjruwase requested a review from cli99 cli99 3 years ago
tjruwase tjruwase requested a review from eltonzheng eltonzheng 3 years ago
tjruwase tjruwase requested a review from minjiaz minjiaz 3 years ago
tjruwase tjruwase requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 3 years ago
tjruwase tjruwase requested a review from duli2012 duli2012 3 years ago
tjruwase tjruwase requested a review from yaozhewei yaozhewei 3 years ago
tjruwase tjruwase requested a review from arashb arashb 3 years ago
tjruwase tjruwase requested a review from xiaoxiawu-microsoft xiaoxiawu-microsoft 3 years ago
tjruwase tjruwase requested a review from samadejacobs samadejacobs 3 years ago
tjruwase Merge branch 'master' into olruwase/refactor_dist_ci
30177d1c
mrwyattii
mrwyattii requested changes on 2022-08-09
tjruwase Remove local functions
477a7b81
tjruwase ds.init() with config_dict
66cb4c51
tjruwase Merge branch 'master' into olruwase/refactor_dist_ci
274fbb68
tjruwase Merge branch 'master' into olruwase/refactor_dist_ci
ce389623
tjruwase Hardcode to simplify
eb2bee7e
tjruwase Merge branch 'olruwase/refactor_dist_ci' of github.com:microsoft/Deep…
a3b0be7c
tjruwase Merge branch 'master' into olruwase/refactor_dist_ci
8a64bd98
mrwyattii Merge branch 'master' into olruwase/refactor_dist_ci
2947970a
tjruwase Merge branch 'master' into olruwase/refactor_dist_ci
4af09feb
tjruwase Format fixes
4aab5d1b
tjruwase Merge branch 'master' of github.com:microsoft/DeepSpeed into olruwase…
45aafd45
tjruwase Try avoiding race
8dba9dbf
tjruwase Merge branch 'master' into olruwase/refactor_dist_ci
d2da10e8
tjruwase Barrier for checkpoint saves
3be389b8
tjruwase Merge branch 'olruwase/refactor_dist_ci' of github.com:microsoft/Deep…
94a8d414
tjruwase Merge branch 'master' into olruwase/refactor_dist_ci
968bd755
tjruwase Merge branch 'master' into olruwase/refactor_dist_ci
ecb8abb5
tjruwase Merge branch 'master' into olruwase/refactor_dist_ci
b139325c
tjruwase Merge branch 'master' into olruwase/refactor_dist_ci
b80cacf3
mrwyattii
mrwyattii approved these changes on 2022-08-18
tjruwase Merge branch 'master' into olruwase/refactor_dist_ci
7876e5a5
tjruwase Merge branch 'master' into olruwase/refactor_dist_ci
4a00f620
mrwyattii Merge branch 'master' into olruwase/refactor_dist_ci
808d4284
mrwyattii mrwyattii merged 217338be into master 3 years ago
mrwyattii mrwyattii deleted the olruwase/refactor_dist_ci branch 2 years ago

Login to write a write a comment.

Login via GitHub