DeepSpeed
Fix AutoTP gathering replaced layer params when bias is not None
#7257
Merged

Loading