Add segmentation + object detection image processors (#20160)
* Add transforms for object detection
* DETR models + Yolos
* Scrappy additions
* Maskformer image processor
* Fix up; MaskFormer tests
* Update owlvit processor
* Add to docs
* OwlViT tests
* Update pad logic
* Remove changes to transforms
* Import fn directly
* Update to include pad transformation
* Remove uninstended changes
* Add new owlvit post processing function
* Tidy up
* Fix copies
* Fix some copies
* Include device fix
* Fix scipy imports
* Update _pad_image
* Update padding functionality
* Fix bug
* Properly handle ignore index
* Fix up
* Remove defaults to None in docstrings
* Fix docstrings & docs
* Fix sizes bug
* Resolve conflicts in init
* Cast to float after resizing
* Tidy & add size if missing
* Allow kwards when processing for owlvit
* Update test values