DeepSpeed
Fix random token-generation issue + MP-checkpoint loading/saving
#2132
Merged

Fix random token-generation issue + MP-checkpoint loading/saving #2132

jeffra merged 29 commits into master from ds-inference/bloom-fix
RezaYazdaniAminabadi
Fix random token-generation issue + MP-checkpoint loading/saving
cc0a7db2
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from jeffra jeffra 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from samyam samyam 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from tjruwase tjruwase 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from ShadenSmith ShadenSmith 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from conglongli conglongli 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from awan-10 awan-10 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from cli99 cli99 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from eltonzheng eltonzheng 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from minjiaz minjiaz 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from duli2012 duli2012 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from mrwyattii mrwyattii 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from yaozhewei yaozhewei 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from arashb arashb 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from xiaoxiawu-microsoft xiaoxiawu-microsoft 3 years ago
RezaYazdaniAminabadi RezaYazdaniAminabadi requested a review from samadejacobs samadejacobs 3 years ago
RezaYazdaniAminabadi Merge branch 'master' into ds-inference/bloom-fix
79ba8b9a
small fix
b7085ea9
jeffra Merge branch 'master' into ds-inference/bloom-fix
dc7fa6ea
get the path for saving mp-checkpoints
fa6b6ae8
Merge branch 'ds-inference/bloom-fix' of github.com:microsoft/DeepSpe…
f39c78f9
jeffra Merge branch 'master' into ds-inference/bloom-fix
13b1aa4a
jeffra bug fix + formatting
1ae78968
fix save_checkpoint path
070f0227
Merge branch 'ds-inference/bloom-fix' of github.com:microsoft/DeepSpe…
c70b5293
RezaYazdaniAminabadi Merge branch 'master' into ds-inference/bloom-fix
51c2e7d2
jeffra Merge branch 'master' into ds-inference/bloom-fix
3dcdbe4c
jeffra Merge branch 'master' into ds-inference/bloom-fix
75b33c61
tjruwase Merge branch 'master' into ds-inference/bloom-fix
e8ef9561
jeffra Merge branch 'master' into ds-inference/bloom-fix
4d3c6527
zcrypt0
zcrypt0 commented on 2022-07-27
Modify checkpoint saving to include the config json used during loading
d794e6c8
git pushMerge branch 'ds-inference/bloom-fix' of github.com:microsoft…
833f260c
set ckpt_mp_size to world_size by default
557521d5
add missing None
b51c4473
small fix: change None -> 0
01003bf9
RezaYazdaniAminabadi Merge branch 'master' into ds-inference/bloom-fix
6ea98e86
fix indentation
043cd98e
zcrypt0
jeffra
several fixes
bee0b8fc
RezaYazdaniAminabadi
zcrypt0
zcrypt0
zcrypt0
zcrypt0 commented on 2022-07-28
jeffra support checkpoint as dict or json file
4d5a4ace
jeffra fix for non-bloom models
b8237c79
jeffra add default if parallelization doesn't exist
007551ea
fix the path to save non-tp checkpoint
851a6817
Merge branch 'ds-inference/bloom-fix' of github.com:microsoft/DeepSpe…
a53fe083
jeffra Merge branch 'master' into ds-inference/bloom-fix
b249f276
jeffra
jeffra approved these changes on 2022-07-29
jeffra jeffra merged 556f0051 into master 3 years ago
jeffra jeffra deleted the ds-inference/bloom-fix branch 3 years ago
mayank31398
mayank31398
mayank31398
mayank31398
mayank31398
jeffra
mayank31398
mayank31398
mayank31398
mayank31398
pai4451
mayank31398

Login to write a write a comment.

Login via GitHub