Megatron-DeepSpeed
fix `add_scalar` for pt<1.9
#240
Merged

Loading