transformers
fix(trainer): Correct loss scaling for incomplete gradient accumulation steps
#39659
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
5
Changes
View On
GitHub
fix(trainer): Correct loss scaling for incomplete gradient accumulation steps
#39659
SunMarc
merged 5 commits into
huggingface:main
from fix-issue-38837
Fix issue[#38837]: wrong loss scaled in last step of epoch
701110c9
chore: trigger CI
ae0f42a3
qgallouedec
approved these changes on 2025-07-25
Update src/transformers/trainer.py
b6b4b59d
Update src/transformers/modeling_flash_attention_utils.py
d16dc5b1
Merge branch 'main' into fix-issue-38837
da250646
qgallouedec
approved these changes on 2025-07-25
SunMarc
approved these changes on 2025-07-29
SunMarc
merged
075dbbce
into main
254 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
SunMarc
qgallouedec
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub