DeepSpeed
00320a9b
- Update adam.py (#1278)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
4 years ago
Update adam.py (#1278) Make add operation inplace. Without it momentum decays to zero and training has no effect on corresponding parameters
References
#1278 - Fix masked parameters training in 1-bit Adam
Author
DT6A
Parents
adc21a4d
Loading