transformers
fix: astronomical loss with ModernBERT when using gradient checkpointing (#38982)
#38983
Merged

fix: astronomical loss with ModernBERT when using gradient checkpointing (#38982) #38983

ArthurZucker merged 3 commits into huggingface:main from umarbutler:patch-3
umarbutler
umarbutler fix: astronomical loss with ModernBERT when using gradient checkpointing
50e8ff13
SunMarc
SunMarc approved these changes on 2025-06-23
SunMarc
SunMarc
ArthurZucker
ArthurZucker approved these changes on 2025-06-24
ArthurZucker
ArthurZucker
ArthurZucker commented on 2025-06-25
umarbutler
ArthurZucker
ArthurZucker update the modling fix
0a583d55
ArthurZucker Merge branch 'main' of github.com:huggingface/transformers into patch-3
6a62df7c
ArthurZucker ArthurZucker merged 860b898d into main 188 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone