transformers
Add image text to text pipeline
#34170
Merged

Add image text to text pipeline #34170

yonigozlan
yonigozlan yonigozlan marked this pull request as ready for review 1 year ago
yonigozlan yonigozlan requested a review from Rocketknight1 Rocketknight1 1 year ago
yonigozlan yonigozlan requested a review from molbap molbap 1 year ago
yonigozlan yonigozlan requested a review from qubvel qubvel 1 year ago
HuggingFaceDocBuilderDev
zucchini-nlp
zucchini-nlp commented on 2024-10-15
yonigozlan yonigozlan force pushed 1 year ago
knkski
yonigozlan
Rocketknight1
yonigozlan
yonigozlan yonigozlan force pushed 1 year ago
knkski
yonigozlan yonigozlan force pushed 1 year ago
yonigozlan yonigozlan force pushed 1 year ago
yonigozlan yonigozlan requested a review from ArthurZucker ArthurZucker 1 year ago
Rocketknight1
Rocketknight1 commented on 2024-10-22
Wauplin
Rocketknight1
Rocketknight1 commented on 2024-10-23
yonigozlan yonigozlan force pushed 1 year ago
yonigozlan
yonigozlan
yonigozlan commented on 2024-10-25
Rocketknight1
yonigozlan
Rocketknight1
ArthurZucker
ArthurZucker commented on 2024-10-25
yonigozlan yonigozlan force pushed 1 year ago
yonigozlan yonigozlan force pushed 1 year ago
yonigozlan
yonigozlan yonigozlan requested a review from ArthurZucker ArthurZucker 1 year ago
ArthurZucker
ArthurZucker ArthurZucker removed review request from ArthurZucker ArthurZucker 1 year ago
yonigozlan
yonigozlan yonigozlan force pushed 1 year ago
yonigozlan
yonigozlan yonigozlan requested a review from ArthurZucker ArthurZucker 1 year ago
ArthurZucker
ArthurZucker
ArthurZucker commented on 2024-10-28
ydshieh
ydshieh
yonigozlan yonigozlan force pushed 1 year ago
yonigozlan
yonigozlan yonigozlan requested a review from ArthurZucker ArthurZucker 1 year ago
qubvel
qubvel commented on 2024-10-29
yonigozlan yonigozlan force pushed 1 year ago
yonigozlan
ArthurZucker
ArthurZucker approved these changes on 2024-10-31
yonigozlan Standardize image-text-to-text-models-output
de37f38e
yonigozlan nit var name post_process_image_text_to_text udop
a7e56fae
yonigozlan nit fix deprecation warnings
9161a52a
yonigozlan Add image-text-to-text pipeline
04d4f77a
yonigozlan add support for image url in chat template for pipeline
0aabcb20
yonigozlan Reformat to be fully compatible with chat templates
5189f169
yonigozlan Add tests chat template
a0c90753
yonigozlan Fix imports and tests
726933fb
yonigozlan Add pipeline tag
b1d7a34f
yonigozlan change logic handling of single prompt ans multiple images
f92628c7
yonigozlan add pipeline mapping to models
fa254114
yonigozlan fix batched inference
9fe26c72
yonigozlan fix tests
316cf7d4
yonigozlan Add manual batching for preprocessing
c633b4b3
yonigozlan Fix outputs with nested images
d6598da4
yonigozlan Add support for all common processing kwargs
c8e58026
yonigozlan Add default padding when multiple text inputs (batch size>1)
20cdd5a1
yonigozlan nit change version deprecation warning
5bc43be8
yonigozlan Add support for text only inference
6cccf5fe
yonigozlan add chat_template warnings
ba8f85f2
yonigozlan Add pipeline tests and add copied from post process function
00174e82
yonigozlan Fix batched pipeline tests
8a65ea46
yonigozlan nit
da05987b
yonigozlan Fix pipeline tests blip2
1f2dafb3
yonigozlan remove unnecessary max_new_tokens
d66e5232
yonigozlan revert processing kosmos2 and remove unnecessary max_new_tokens
5056aa53
yonigozlan fix pipeline tests idefics
fe7e75d5
yonigozlan Force try loading processor if pipeline supports it
b866c279
yonigozlan revert load_processor change
3118dac4
yonigozlan hardcode loading only processor
065542aa
yonigozlan remove unnecessary try except
7f583dfb
yonigozlan skip imagetexttotext tests for kosmos2 as tiny model causes problems
aad9ad42
yonigozlan Make code clearer
e227b832
yonigozlan Address review comments
8f370f44
yonigozlan remove preprocessing logic from pipeline
c82fe29f
yonigozlan fix fuyu
7e1fb070
yonigozlan add BC resize fuyu
f581eaac
yonigozlan Move post_process_image_text_to_text to ProcessorMixin
4eda9631
yonigozlan add guard in post_process
02632213
yonigozlan fix zero shot object detection pipeline
45c17061
yonigozlan add support for generator input in pipeline
2e69b976
yonigozlan nit
58a6fb86
yonigozlan change default image-text-to-text model to llava onevision
66c017cb
yonigozlan fix owlv2 size dict
5772312f
yonigozlan Change legacy deprecation warning to only show when True
61cc5767
yonigozlan yonigozlan force pushed to 61cc5767 1 year ago
yonigozlan
yonigozlan yonigozlan merged 203e2705 into main 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone