DeepSpeed
00320a9b - Update adam.py (#1278)

Commit
4 years ago
Update adam.py (#1278) Make add operation inplace. Without it momentum decays to zero and training has no effect on corresponding parameters
Author
Parents
Loading