DeepSpeed
efa8aded - Fix the residual add mp scaling for GPTNeoX (#2310)

Commit
3 years ago
Fix the residual add mp scaling for GPTNeoX (#2310)
Author
Parents
Loading