DeepSpeed
18489835
- fix gradient accumulation for z2+offload
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
fix gradient accumulation for z2+offload
References
#6550 - Fix gradient accumulation for Z2+offload
#6554 - Improve consistency of zero_grad
Author
Masahiro Tanaka
Parents
170b46e8
Loading