DeepSpeed
b7e1010b
- Missing strided copy for gated MLP (#3788)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
Missing strided copy for gated MLP (#3788) Co-authored-by: Ammar Ahmad Awan <ammar.awan@microsoft.com> Co-authored-by: Jeff Rasley <jerasley@microsoft.com> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
References
#3788 - Llama Tensor Parallel Accuracy Fix
Author
cmikeh2
Parents
b4a2c0af
Loading