transformers
Fix GA loss for Deepspeed
#35808
Merged

Fix GA loss for Deepspeed #35808

timjeffrey10
timjeffrey10 Fix GA loss for Deepspeed
73b688fe
timjeffrey10 timjeffrey10 force pushed to 73b688fe 1 year ago
Rocketknight1
muellerzr
muellerzr commented on 2025-01-21
timjeffrey10 Turn off loss scaling in DeepSpeed engine by scale_wrt_gas
30824ced
timjeffrey10
timjeffrey10
muellerzr
muellerzr approved these changes on 2025-01-22
muellerzr
timjeffrey10 Add comment linking to PR
e655d879
muellerzr muellerzr requested a review from ArthurZucker ArthurZucker 1 year ago
muellerzr muellerzr requested a review from SunMarc SunMarc 1 year ago
ArthurZucker
ArthurZucker approved these changes on 2025-01-23
ArthurZucker ArthurZucker merged 4ec425ff into main 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone