DeepSpeed
Reduce CPU memory overhead during ZeRO checkpoint loading
#1524
Merged

Reduce CPU memory overhead during ZeRO checkpoint loading #1524

jeffra
jeffra formatting and comments
ab2a013e
jeffra add zero_elastic_checkpoint=true path
2789cbe8
jeffra jeffra requested a review from awan-10 awan-10 4 years ago
jeffra jeffra requested a review from cli99 cli99 4 years ago
jeffra jeffra requested a review from conglongli conglongli 4 years ago
jeffra jeffra requested a review from eltonzheng eltonzheng 4 years ago
jeffra jeffra requested a review from minjiaz minjiaz 4 years ago
jeffra jeffra requested a review from niumanar niumanar 4 years ago
jeffra jeffra requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 4 years ago
jeffra jeffra requested a review from samyam samyam 4 years ago
jeffra jeffra requested a review from ShadenSmith ShadenSmith 4 years ago
jeffra jeffra requested a review from tjruwase tjruwase 4 years ago
jeffra jeffra changed the title Reduce cpu memory requirement on zero checkpoint loading Reduce CPU memory overhead during ZeRO checkpoint loading 4 years ago
jeffra jeffra merged 165739a5 into jeffra/engine-xthru-v2 4 years ago
jeffra jeffra deleted the jeffra/z2-ckpt-issue branch 4 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone