feat(granitemoe*): Remove logits upcast when computing loss #42753
feat: Remove logits upcast when computing loss
1f8d4288
chore: make fix-copies
3dbe486d
Ssukriti
approved these changes
on 2025-12-10
Merge branch 'main' into GraniteOptionalUpcast-42709
47f810f4
gabe-l-hart
deleted the GraniteOptionalUpcast-42709 branch 9 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub