DeepSpeed
Fix the MLP output tensor's shape
#2380
Merged

Loading