transformers
feat(granitemoe*): Remove logits upcast when computing loss
#42753
Merged

feat(granitemoe*): Remove logits upcast when computing loss #42753

gabe-l-hart
gabe-l-hart feat: Remove logits upcast when computing loss
1f8d4288
gabe-l-hart chore: make fix-copies
3dbe486d
Ssukriti
Ssukriti approved these changes on 2025-12-10
gabe-l-hart Merge branch 'main' into GraniteOptionalUpcast-42709
47f810f4
github-actions
ArthurZucker
ArthurZucker approved these changes on 2025-12-11
ArthurZucker ArthurZucker merged 0af2381f into main 9 days ago
HuggingFaceDocBuilderDev
gabe-l-hart gabe-l-hart deleted the GraniteOptionalUpcast-42709 branch 9 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone