DeepSpeed
746c0ba3
- Enable contiguous gradients with Z1+MoE
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 years ago
Enable contiguous gradients with Z1+MoE MoE training with zero stage 1 only works with `contiguous gradients=True`.
References
#2250 - Enable contiguous gradients with Z1+MoE
Author
siddharth9820
Parents
86164c48
Loading