DeepSpeed
Fix missing scale attributes for GPTJ
#3256
Merged

Loading