transformers
Enable users to use their own loss functions + deal with prefetching for grad accum
#34198
Merged

Enable users to use their own loss functions + deal with prefetching for grad accum #34198

muellerzr merged 20 commits into main from muellerzr-fix-loss-calc
muellerzr
muellerzr bookmark
57c698fe
muellerzr Bookmark
3c579479
muellerzr Bookmark
1d57bd8f
muellerzr Actually implement
15e61f12
muellerzr Pass in kwarg explicitly
928e9271
muellerzr Adjust for if we do or don't have labels
b59c8f16
muellerzr Bookmark fix for od
79f9479e
muellerzr bookmark
13f33692
muellerzr Fin
8080f28a
muellerzr muellerzr marked this pull request as ready for review 1 year ago
muellerzr muellerzr requested a review from LysandreJik LysandreJik 1 year ago
muellerzr muellerzr requested a review from ArthurZucker ArthurZucker 1 year ago
muellerzr closer
13160e08
ArthurZucker
ArthurZucker commented on 2024-10-16
muellerzr Negate accelerate grad accum div
6fa155a8
muellerzr Fixup not training long enough
c2a705fd
muellerzr
BenjaminBossan
BenjaminBossan commented on 2024-10-17
muellerzr Add in compute_loss to take full model output
ac04e610
winglian
winglian commented on 2024-10-17
muellerzr Document
af8411b8
muellerzr compute_loss -> compute_loss_fn
a5fac5a0
ArthurZucker
ArthurZucker commented on 2024-10-17
muellerzr Add a test
39d8f28c
muellerzr muellerzr changed the title [DRAFT] Enable users to use their own loss functions + deal with prefetching for grad accum Enable users to use their own loss functions + deal with prefetching for grad accum 1 year ago
muellerzr Refactor
42849302
muellerzr Refactor
932a4910
muellerzr Uncomment tests
2a6b0383
muellerzr muellerzr requested a review from ArthurZucker ArthurZucker 1 year ago
danielhanchen
danielhanchen requested changes on 2024-10-17
ArthurZucker
ArthurZucker approved these changes on 2024-10-17
muellerzr Update tests/trainer/test_trainer.py
54d10ded
muellerzr muellerzr merged 6ba31a8a into main 1 year ago
muellerzr muellerzr deleted the muellerzr-fix-loss-calc branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone