DeepSpeed
Fix ZeRO-1 checkpointing bug
#529
Merged

Fix ZeRO-1 checkpointing bug #529

tjruwase
tjruwase Make elastic checkpointing optional
8f042f82
tjruwase Merge with staging branch
9f34d1eb
tjruwase Fix incorrect communication interval schema in checkpointing logic
6a4db4bf
tjruwase Merge with saksham-zero1-fixes branch
b0cba776
tjruwase tjruwase requested a review from jeffra jeffra 5 years ago
jeffra
jeffra approved these changes on 2020-11-16
tjruwase tjruwase merged 4f5f83e4 into saksham-zero1-fixes 5 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone