transformers
b65df514 - Add Flash Attention 2 to M2M100 model (#30256)

Commit

1 year ago

Add Flash Attention 2 to M2M100 model (#30256) * Added flash attention 2. * Fixes. * Fix inheritance. * Fixed init. * Remove stuff. * Added documentation. * Add FA2 to M2M100 documentation. * Add test. * Fixed documentation. * Update src/transformers/models/m2m_100/modeling_m2m_100.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/en/model_doc/nllb.md Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Fixed variable name. --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

References

#29969 - [SigLIP] Add fast tokenizer

#30256 - Add Flash Attention 2 to M2M100 model

#32831 - [Docs] Update resources

#33111 - [Backbone] Remove out_features everywhere

#33174 - [Zero-shot image classification pipeline] Remove tokenizer_kwargs

#39821 - Support MetaCLIP 2

#59 - Fix attention mask handling in EoMT-DINOv3 converter

#41212 - Add EoMT with DINOv3 backbone

#62 - Add initial DEIMv2 model implementation

Author

visheratin

Parents

ec92f983

transformers b65df514 - Add Flash Attention 2 to M2M100 model (#30256)

transformers
b65df514 - Add Flash Attention 2 to M2M100 model (#30256)