DeepSpeed
af512117
- Samyamr/zero offload correctness (#359)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
5 years ago
Samyamr/zero offload correctness (#359) * fixing gradient accumulation for zero offload * Bug fixes. ZeRO Stage 1,2 and Offload all produce the same loss with gradient accumulation step of 2
References
#359 - Samyamr/zero offload correctness
Author
samyam
Parents
504a643b
Loading