transformers
fix(trainer): Correct loss scaling for incomplete gradient accumulation steps
#39659
Merged

fix(trainer): Correct loss scaling for incomplete gradient accumulation steps #39659

SunMarc merged 5 commits into huggingface:main from fix-issue-38837
hutaiHang
Fix issue[#38837]: wrong loss scaled in last step of epoch
701110c9
chore: trigger CI
ae0f42a3
hutaiHang
qgallouedec
qgallouedec approved these changes on 2025-07-25
hutaiHang Update src/transformers/trainer.py
b6b4b59d
hutaiHang Update src/transformers/modeling_flash_attention_utils.py
d16dc5b1
hutaiHang Merge branch 'main' into fix-issue-38837
da250646
hutaiHang
qgallouedec
qgallouedec approved these changes on 2025-07-25
qgallouedec
HuggingFaceDocBuilderDev
hutaiHang
qgallouedec
SunMarc
SunMarc approved these changes on 2025-07-29
SunMarc SunMarc merged 075dbbce into main 254 days ago
kaln27
hutaiHang

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone