DeepSpeed
Enable universal checkpoint for zero stage 1
#4516
Merged

Enable universal checkpoint for zero stage 1 #4516

tjruwase merged 14 commits into master from olruwase/ds_2921
tjruwase
tjruwase Enable uni_ckpt for z1
2a60f793
tjruwase tjruwase requested a review from mrwyattii mrwyattii 1 year ago
tjruwase tjruwase requested a review from jeffra jeffra 1 year ago
tjruwase tjruwase requested a review from samyam samyam 1 year ago
tjruwase tjruwase requested a review from ShadenSmith ShadenSmith 1 year ago
tjruwase tjruwase requested a review from duli2012 duli2012 1 year ago
tjruwase tjruwase requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 1 year ago
tjruwase tjruwase requested a review from awan-10 awan-10 1 year ago
tjruwase tjruwase requested a review from cmikeh2 cmikeh2 1 year ago
tjruwase tjruwase requested a review from arashb arashb 1 year ago
tjruwase Merge branch 'master' into olruwase/ds_2921
3dc989ec
tjruwase tjruwase removed review request from arashb arashb 1 year ago
tjruwase tjruwase removed review request from ShadenSmith ShadenSmith 1 year ago
tjruwase tjruwase removed review request from cmikeh2 cmikeh2 1 year ago
tjruwase tjruwase removed review request from duli2012 duli2012 1 year ago
tjruwase tjruwase removed review request from samyam samyam 1 year ago
tjruwase tjruwase removed review request from awan-10 awan-10 1 year ago
tjruwase tjruwase removed review request from RezaYazdaniAminabadi RezaYazdaniAminabadi 1 year ago
tjruwase
tjruwase tjruwase changed the title Enable uni_ckpt for z1 Enable universal checkpoint for zero stage 1 1 year ago
stas00
tjruwase Remove logging fix to seperate PR. Relocate conversion script to avoi…
b13006bc
tjruwase Formatting fix
64d8c0d8
tjruwase Merge branch 'master' into olruwase/ds_2921
f21a5de5
mosheisland
mosheisland commented on 2023-10-17
mosheisland
mosheisland commented on 2023-10-17
mosheisland
mosheisland commented on 2023-10-17
mosheisland
mosheisland commented on 2023-10-17
tjruwase PR feedback
f5c6b2dd
tjruwase Merge branch 'olruwase/ds_2921' of github.com:microsoft/DeepSpeed int…
51b3af87
mrwyattii
mrwyattii approved these changes on 2023-10-17
tjruwase Handle replicated params
d737cbc7
tjruwase Merge branch 'master' into olruwase/ds_2921
3b9a3845
tjruwase Detect bf16_optimizer
d1cefd61
tjruwase Merge branch 'master' into olruwase/ds_2921
507fee84
tjruwase Docs
f25ff5b1
tjruwase Merge branch 'master' of github.com:microsoft/DeepSpeed into olruwase…
f638d922
tjruwase Fix docs
a1c41e02
tjruwase tjruwase enabled auto-merge 1 year ago
tjruwase tjruwase merged 8fdd9b35 into master 1 year ago
rgtjf
rgtjf commented on 2024-03-14

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone