DeepSpeed
145c3a75 - Fix missing scale attributes for GPTJ (#3256)

Commit
2 years ago
Fix missing scale attributes for GPTJ (#3256) Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
Author
Parents
Loading