DeepSpeed
DeepSpeedZeroOptimizer: refactor bit16 flattening to support more accelerators
#4833
Merged

Loading