DeepSpeed
Spread layers more uniformly when using partition_uniform
#4053
Merged

Loading