transformers
4ec425ff
- Fix GA loss for Deepspeed (#35808)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
358 days ago
Fix GA loss for Deepspeed (#35808) * Fix GA loss for Deepspeed * Turn off loss scaling in DeepSpeed engine by scale_wrt_gas * Add comment linking to PR
References
#35808 - Fix GA loss for Deepspeed
Author
timjeffrey10
Parents
f3f6c865
Loading