DeepSpeed
d10b8ca0 - add sharded checkpoint loading for AutoTP path to reduce the peak mem… (#3102)

Commit
2 years ago
add sharded checkpoint loading for AutoTP path to reduce the peak mem… (#3102) * add sharded checkpoint loading for AutoTP path to reduce the peak memory in initialization stage Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> * fix gptj sharded checkpoint loading problem Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> --------- Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Author
Parents
Loading