DeepSpeed
Add container load checkpoint error reporting + refactor
#2792
Merged

Add container load checkpoint error reporting + refactor #2792

lekurile merged 4 commits into master from lekurile/meta_tensor_msg
lekurile
lekurile Add container load checkpoint error reporting + refactor
f22b7f0e
lekurile Update assertion message
1f4c95b9
lekurile lekurile marked this pull request as ready for review 3 years ago
lekurile lekurile requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 3 years ago
lekurile lekurile requested a review from jeffra jeffra 3 years ago
lekurile lekurile requested a review from mrwyattii mrwyattii 3 years ago
lekurile lekurile requested a review from awan-10 awan-10 3 years ago
lekurile lekurile requested a review from cmikeh2 cmikeh2 3 years ago
lekurile lekurile requested a review from arashb arashb 3 years ago
awan-10
awan-10 approved these changes on 2023-02-07
lekurile Merge branch 'master' into lekurile/meta_tensor_msg
a9a66dbc
lekurile lekurile enabled auto-merge (squash) 3 years ago
tjruwase Merge branch 'master' into lekurile/meta_tensor_msg
45e4cbbb
lekurile lekurile merged 10f3c301 into master 3 years ago
mrwyattii mrwyattii deleted the lekurile/meta_tensor_msg branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone