DeepSpeed
90793aab
- re-introduce: stage3: efficient compute of scaled_global_grad_norm (#5493)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
re-introduce: stage3: efficient compute of scaled_global_grad_norm (#5493) reverting previous revert of this feature: https://github.com/nelyahu/DeepSpeed/commit/bc48371c5e1fb8fd70fc79285e66201dbb65679b in addition, bug fix for offload mode.
References
#5493 - re-introduce: stage3: efficient compute of scaled_global_grad_norm
Author
nelyahu
Parents
f32ad3e1
Loading