DeepSpeed
Add engine.coalesce_grad_reduction() for ZeRO 1/2/3 multi-backward
#7992
Open

Loading