DeepSpeed
Graceful exit on failures for multi-node runs
#2008
Merged

Graceful exit on failures for multi-node runs #2008

jeffra merged 3 commits into deepspeedai:master from graceful_exit
jerrymannil
jerrymannil jerrymannil requested a review from jeffra jeffra 3 years ago
jerrymannil jerrymannil requested a review from samyam samyam 3 years ago
jerrymannil jerrymannil requested a review from tjruwase tjruwase 3 years ago
jerrymannil jerrymannil requested a review from ShadenSmith ShadenSmith 3 years ago
jerrymannil jerrymannil requested a review from conglongli conglongli 3 years ago
jerrymannil jerrymannil requested a review from awan-10 awan-10 3 years ago
jerrymannil jerrymannil requested a review from cli99 cli99 3 years ago
jerrymannil jerrymannil requested a review from eltonzheng eltonzheng 3 years ago
jerrymannil jerrymannil requested a review from minjiaz minjiaz 3 years ago
jerrymannil jerrymannil requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 3 years ago
jerrymannil jerrymannil requested a review from duli2012 duli2012 3 years ago
jerrymannil jerrymannil requested a review from mrwyattii mrwyattii 3 years ago
jerrymannil jerrymannil requested a review from yaozhewei yaozhewei 3 years ago
jerrymannil jerrymannil requested a review from arashb arashb 3 years ago
jerrymannil jerrymannil requested a review from xiaoxiawu-microsoft xiaoxiawu-microsoft 3 years ago
jerrymannil
jerrymannil jerrymannil requested a review from samadejacobs samadejacobs 3 years ago
jerrymannil
Graceful exit on failures for multi-node runs
e50be0be
jerrymannil
tjruwase Merge branch 'master' into graceful_exit
16a94dcb
tjruwase
tjruwase
tjruwase approved these changes on 2022-07-28
tjruwase Merge branch 'master' into graceful_exit
12122a01
jeffra
jeffra approved these changes on 2022-07-28
jeffra jeffra merged 66d29b0a into master 3 years ago

Login to write a write a comment.

Login via GitHub