DeepSpeed
5fe9d610
- Tensor parallelism for Mixture of Experts (#2074)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 years ago
Tensor parallelism for Mixture of Experts (#2074) * tensor parallelism for mixture of experts Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Ammar Ahmad Awan <ammar.awan@microsoft.com>
References
#2074 - Tensor parallelism for Mixture of Experts
Author
siddharth9820
Parents
2210ebe7
Loading