transformers
4f09d0fd - storing & logging gradient norm in trainer (#27326)

Commit
1 year ago
storing & logging gradient norm in trainer (#27326) * report grad_norm during training * support getting grad_norm from deepspeed
Author
Parents
Loading