DeepSpeed
95aee34f - fix for complete_grad_norm_calc in stage3

Commit
1 year ago
fix for complete_grad_norm_calc in stage3 place err tensor on the same device as inf_or_nan
Author
Nadav Elyahu
Parents
Loading