transformers
fix: fix gradient accumulate step for learning rate
#27667
Merged

fix: fix gradient accumulate step for learning rate #27667

ArthurZucker merged 1 commit into huggingface:main from fix_warmup
pphuc25
pphuc25 fix: fix gradient accumulate step for learning rate
47135629
sanchit-gandhi
sanchit-gandhi approved these changes on 2023-12-06
sanchit-gandhi sanchit-gandhi requested a review from ArthurZucker ArthurZucker 2 years ago
ArthurZucker
ArthurZucker approved these changes on 2023-12-07
ArthurZucker ArthurZucker merged 0410a29a into main 2 years ago
pphuc25 pphuc25 deleted the fix_warmup branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone