DeepSpeed
cce85b89 - API for obtaining global gradient norm (#1292)

Commit
4 years ago
API for obtaining global gradient norm (#1292) * FP16 fused and unfused grad norm query. * API for obtaining global unclipped gradient norm across parameter groups * Use global norm not group norms Co-authored-by: Shaden Smith <shaden.smith@microsoft.com>
Author
Parents
Loading