DeepSpeed
d7f95869 - grad reduce for extra large param

Commit
2 years ago
grad reduce for extra large param
Author
Parents
Loading