transformers
6ba31a8a - Enable users to use their own loss functions + deal with prefetching for grad accum (#34198)

Commit

1 year ago

Enable users to use their own loss functions + deal with prefetching for grad accum (#34198) * bookmark * Bookmark * Bookmark * Actually implement * Pass in kwarg explicitly * Adjust for if we do or don't have labels * Bookmark fix for od * bookmark * Fin * closer * Negate accelerate grad accum div * Fixup not training long enough * Add in compute_loss to take full model output * Document * compute_loss -> compute_loss_fn * Add a test * Refactor * Refactor * Uncomment tests * Update tests/trainer/test_trainer.py Co-authored-by: Daniel Han <danielhanchen@gmail.com> --------- Co-authored-by: Daniel Han <danielhanchen@gmail.com>

References

#29969 - [SigLIP] Add fast tokenizer

#34198 - Enable users to use their own loss functions + deal with prefetching for grad accum

#39821 - Support MetaCLIP 2

#58 - Add EoMT DINOv3 model

#59 - Fix attention mask handling in EoMT-DINOv3 converter

#41212 - Add EoMT with DINOv3 backbone

#62 - Add initial DEIMv2 model implementation

Author

muellerzr

Parents

7a06d07e

transformers 6ba31a8a - Enable users to use their own loss functions + deal with prefetching for grad accum (#34198)

transformers
6ba31a8a - Enable users to use their own loss functions + deal with prefetching for grad accum (#34198)