DeepSpeed
Trajepl/nebula ckpt engine
#2085
Merged

Trajepl/nebula ckpt engine #2085

trajepl
trajepl Merge pull request #1 from microsoft/master
25e04b38
trajepl enable checkpoint engine
d88e591d
trajepl seprated nebula config
07e59d66
trajepl trajepl requested a review from jeffra jeffra 3 years ago
trajepl trajepl requested a review from samyam samyam 3 years ago
trajepl trajepl requested a review from tjruwase tjruwase 3 years ago
trajepl trajepl requested a review from ShadenSmith ShadenSmith 3 years ago
trajepl trajepl requested a review from conglongli conglongli 3 years ago
trajepl trajepl requested a review from awan-10 awan-10 3 years ago
trajepl trajepl requested a review from cli99 cli99 3 years ago
trajepl trajepl requested a review from eltonzheng eltonzheng 3 years ago
trajepl trajepl requested a review from minjiaz minjiaz 3 years ago
trajepl trajepl requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 3 years ago
trajepl trajepl requested a review from duli2012 duli2012 3 years ago
trajepl trajepl requested a review from mrwyattii mrwyattii 3 years ago
trajepl trajepl requested a review from yaozhewei yaozhewei 3 years ago
trajepl trajepl requested a review from arashb arashb 3 years ago
trajepl trajepl requested a review from xiaoxiawu-microsoft xiaoxiawu-microsoft 3 years ago
trajepl trajepl requested a review from samadejacobs samadejacobs 3 years ago
trajepl
trajepl add __init__.py for nebula importing
4cbdfe6f
trajepl linter fix
1f2f40c3
trajepl fix: ds_config is None
d9001459
trajepl fix: ds config
b44832b0
trajepl fix: get sd loader fix
e4a57bdf
tjruwase Merge branch 'master' into trajepl/nebula_ckpt_engine
85e52f5b
trajepl align the API with torch raw code
d70bcd16
trajepl Merge branch 'trajepl/nebula_ckpt_engine' of github.com:trajepl/DeepS…
4c503081
trajepl linter fix
5d987a08
trajepl remove duplicate tag params
a04a81a8
mrwyattii Merge branch 'master' into trajepl/nebula_ckpt_engine
21b70bd1
tjruwase
tjruwase commented on 2022-07-19
tjruwase
tjruwase commented on 2022-07-19
tjruwase
tjruwase commented on 2022-07-19
tjruwase
tjruwase commented on 2022-07-19
tjruwase
tjruwase commented on 2022-07-19
tjruwase
tjruwase commented on 2022-07-19
tjruwase
tjruwase commented on 2022-07-19
tjruwase
tjruwase commented on 2022-07-19
tjruwase
tjruwase commented on 2022-07-19
tjruwase
tjruwase commented on 2022-07-19
tjruwase
tjruwase commented on 2022-07-19
tjruwase
tjruwase commented on 2022-07-19
tjruwase Merge branch 'master' into trajepl/nebula_ckpt_engine
81ccd07a
tjruwase
trajepl
tjruwase
trajepl
trajepl make checkpoint_engine as required args
4b42bc23
trajepl Merge branch 'trajepl/nebula_ckpt_engine' of github.com:trajepl/DeepS…
22f8c2ab
trajepl Merge pull request #2 from microsoft/master
bbd2bde2
trajepl Merge branch 'master' of github.com:trajepl/DeepSpeed into trajepl/ne…
d9298cf5
trajepl fix args
19063988
trajepl extract parameters out to config
432e7c67
trajepl fix: load state dict
7dbb6d8a
trajepl separate load engine
e912e31b
trajepl linter fix
7fc279bf
tjruwase Merge branch 'master' into trajepl/nebula_ckpt_engine
5ebacc64
trajepl extract checkpoint engine to abstract calss
c70c8187
trajepl linter fix
e6dd7943
trajepl Merge branch 'trajepl/nebula_ckpt_engine' of github.com:trajepl/DeepS…
3788ada5
trajepl construct function args fix
1efd2ce0
trajepl add docs for dev/customers
dce0fb51
trajepl linter fix
bb5bb7c6
tjruwase Merge branch 'master' into trajepl/nebula_ckpt_engine
0c21dc2c
trajepl remove load engine
3e8c238f
trajepl print->log_dist
a5c88974
trajepl linter fix
44d687b9
trajepl add tag flag to distinguish the loading order
82ad297a
tjruwase Merge branch 'master' into trajepl/nebula_ckpt_engine
cf12a8d8
tjruwase
tjruwase approved these changes on 2022-07-27
tjruwase Merge branch 'master' into trajepl/nebula_ckpt_engine
422221bf
tjruwase Merge branch 'master' into trajepl/nebula_ckpt_engine
340de115
tjruwase Merge branch 'master' into trajepl/nebula_ckpt_engine
7f3f14ce
jeffra Merge branch 'master' into trajepl/nebula_ckpt_engine
5071091b
jeffra
jeffra approved these changes on 2022-07-27
tjruwase Merge branch 'master' into trajepl/nebula_ckpt_engine
1b43df5c
tjruwase tjruwase merged e669aaf5 into master 3 years ago

Login to write a write a comment.

Login via GitHub