DeepSpeed
abe2204d - Support fp32 grad clipping and fix max_grad_norm confusion (#232)

Commit
5 years ago
Support fp32 grad clipping and fix max_grad_norm confusion (#232) * updates to support fp32 grad clipping and disable max_grad_norm
Author
Parents
Loading