DeepSpeed
b31b46c0 - fix regression in shard checkpoint loading in AutoTP Path caused by qkv_copy() is deleted and add UT case for shard checkpoint loading in AutoTP (#3457)

Commit
2 years ago
fix regression in shard checkpoint loading in AutoTP Path caused by qkv_copy() is deleted and add UT case for shard checkpoint loading in AutoTP (#3457) * add UT case for shard checkpoint loading in AutoTP Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> * autoTP path also support shard loading Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> --------- Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Author
Parents
Loading