DeepSpeed
Fix autotuning so that it records Floating Point Operations per second, not microsecond
#2711
Merged

Loading