DeepSpeed
Add MLP/lm_head tp grain size setting.
#6828
Merged

Loading