DeepSpeed
[fp16] lower `initial_scale_power` to `16`
#2663
Merged

Loading