transformers
a761d6e9 - Refactoring Trainer, adds `save_only_model` arg and simplifying FSDP integration (#27652)

Commit

2 years ago

Refactoring Trainer, adds `save_only_model` arg and simplifying FSDP integration (#27652) * add code changes 1. Refactor FSDP 2. Add `--save_only_model` option: When checkpointing, whether to only save the model, or also the optimizer, scheduler & rng state. 3. Bump up the minimum `accelerate` version to `0.21.0` * quality * fix quality? * Revert "fix quality?" This reverts commit 149330a6abc078827be274db84c8a2d26a76eba1. * fix fsdp doc strings * fix quality * Update src/transformers/training_args.py Co-authored-by: Zach Mueller <muellerzr@gmail.com> * please fix the quality issue 😅 * Apply suggestions from code review Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> * address comment * simplify conditional check as per the comment * update documentation --------- Co-authored-by: Zach Mueller <muellerzr@gmail.com> Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

References

#27652 - Refactoring Trainer, adds `save_only_model` arg and simplifying FSDP integration

#27720 - Add common processor tests

#29969 - [SigLIP] Add fast tokenizer

#32831 - [Docs] Update resources

#33111 - [Backbone] Remove out_features everywhere

#33174 - [Zero-shot image classification pipeline] Remove tokenizer_kwargs

#59 - Fix attention mask handling in EoMT-DINOv3 converter

#62 - Add initial DEIMv2 model implementation

#65 - Fix RTDetrV2 sine position embedding ordering

#44375 - Add RF-DETR

#71 - Use Mask2Former ignore_value in mask matching and losses