transformers
83f9314d - fix: cast input pixels to appropriate dtype for image_to_text pipelines (#24947)

Commit

2 years ago

fix: cast input pixels to appropriate dtype for image_to_text pipelines (#24947) * fix: cast input pixels to appropriate dtype for image_to_text tasks * fix: add casting to pixel inputs of additional models after running copy checks

References

#24947 - fix: cast input pixels to appropriate dtype for image_to_text pipelines

#27720 - Add common processor tests

#29969 - [SigLIP] Add fast tokenizer

#32831 - [Docs] Update resources

#33111 - [Backbone] Remove out_features everywhere

#33174 - [Zero-shot image classification pipeline] Remove tokenizer_kwargs

#59 - Fix attention mask handling in EoMT-DINOv3 converter

#62 - Add initial DEIMv2 model implementation

#65 - Fix RTDetrV2 sine position embedding ordering

#44375 - Add RF-DETR

#71 - Use Mask2Former ignore_value in mask matching and losses

#44385 - Fix make check-repo

#45082 - [VidEoMT] Update conversion script

#45110 - Add SAM 3.1

Author

JimAllanson

Parents

1c7e5e23

transformers 83f9314d - fix: cast input pixels to appropriate dtype for image_to_text pipelines (#24947)

transformers
83f9314d - fix: cast input pixels to appropriate dtype for image_to_text pipelines (#24947)