DeepSpeed
Set tp world size to 1 in ckpt load, if MPU is not provided
#5243
Merged

Loading