DeepSpeed
Automatic tensor parallelism v2
#2670
Merged

Loading