DeepSpeed
636e842e
- cpu_offload enables overlap_comm and contiguous_gradients
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
5 years ago
cpu_offload enables overlap_comm and contiguous_gradients Remove non-portable tensor.mul_()
References
#366 - ZeRO-Offload passing model functionality tests
Author
tjruwase
Parents
9ba232aa
Loading