DeepSpeed
066644d7
- fix the sequence-parallelism for the dense models
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
fix the sequence-parallelism for the dense models
References
#4530 - Fix the sequence-parallelism for the dense model architecture
Author
Reza Yazdani
Parents
12aedac6
Loading