transformers
Add GOT-OCR 2.0 to Transformers
#34721
Merged

Add GOT-OCR 2.0 to Transformers #34721

yonigozlan
yonigozlan yonigozlan force pushed 1 year ago
yonigozlan yonigozlan added New model
yonigozlan yonigozlan added Multimodal
Ucas-HaoranWei
Ucas-HaoranWei approved these changes on 2024-11-15
Ucas-HaoranWei
Ucas-HaoranWei approved these changes on 2024-11-15
Ucas-HaoranWei
Ucas-HaoranWei approved these changes on 2024-11-15
Ucas-HaoranWei
Ucas-HaoranWei approved these changes on 2024-11-18
yonigozlan yonigozlan force pushed 1 year ago
HuggingFaceDocBuilderDev
yonigozlan yonigozlan requested a review from qubvel qubvel 1 year ago
yonigozlan yonigozlan requested a review from molbap molbap 1 year ago
molbap
molbap approved these changes on 2024-11-28
yonigozlan yonigozlan added run-slow
yonigozlan
yonigozlan yonigozlan requested a review from ArthurZucker ArthurZucker 1 year ago
yonigozlan yonigozlan requested a review from molbap molbap 1 year ago
yonigozlan yonigozlan force pushed 1 year ago
piercelamb
GXKIM
qubvel
qubvel commented on 2025-01-06
yonigozlan
test3211234
yonigozlan
yonigozlan init modular got_ocr2
cede6406
yonigozlan Get correct got_ocr architecture
72a003c3
yonigozlan add processing
7bda0529
yonigozlan run modular with processing
fdba2c56
yonigozlan add working inference
5251b05d
yonigozlan apply modular
fc64947c
yonigozlan Refactor and fix style
c8352b41
yonigozlan Refactor, cleanup, fix style
69fca747
yonigozlan fix init order
21effc48
yonigozlan Fix docs
fb9f006d
yonigozlan add base modeling tests
ca53c398
yonigozlan fix style and consistency
73da47e1
yonigozlan rename doc file
c7696353
yonigozlan fix repo consistency
fc49a527
yonigozlan fix inference with box
d546ac87
yonigozlan add image processing and support for crop_to_multi_page
66951798
yonigozlan Fix batch inference
fdbf0d20
yonigozlan add tests
764d456a
yonigozlan fixup
782e38cd
yonigozlan fix slow test
fea6929c
yonigozlan fix docstrings
b094771c
yonigozlan Add model doc
9ab7d9e2
yonigozlan update to new init
a843c4e6
yonigozlan fix input autocast pixel_values dtype
ee2b99ab
yonigozlan update doc
8fe8edeb
yonigozlan move doc to multimodal
4785527e
yonigozlan Reformat crop_image_to_patches and add docstrings
5d81df1f
yonigozlan Fix example in forward docstring
8784bcf5
yonigozlan Address Pablo review
a312ca2e
yonigozlan [run slow] got_ocr2
5500203c
yonigozlan remove defaults defined twice
f7596084
yonigozlan apply modular
7cea6a24
yonigozlan add torch_device to integration tests
3ef43eef
yonigozlan update modular
088672f1
yonigozlan follow-up Pavel review
7e6bab96
yonigozlan add device variable in doc
1f5f054c
yonigozlan fix doc multi-page
0ff44e8e
yonigozlan Force eager attention for vision encoder to avoid attn implementation…
3ae43ec5
yonigozlan yonigozlan force pushed to 3ae43ec5 1 year ago
ArthurZucker
ArthurZucker commented on 2025-01-23
ArthurZucker
ArthurZucker commented on 2025-01-23
yonigozlan Merge remote-tracking branch 'upstream/main' into add-got-ocr2
8289e69c
yonigozlan revert qwen2vl doc changes
9cbd8049
yonigozlan use Qwen2ForCausalLM instead of Qwen2Model
c87fa624
yonigozlan make fixup
34c716ed
yonigozlan refactor gotocr2 to llava style
0178c681
yonigozlan uniformize function names and reduce checks
31204126
yonigozlan Merge branch 'main' into add-got-ocr2
6b5169b4
yonigozlan yonigozlan requested a review from ArthurZucker ArthurZucker 1 year ago
yonigozlan yonigozlan removed review request from molbap molbap 1 year ago
yonigozlan Merge branch 'main' into add-got-ocr2
3b626beb
ArthurZucker
ArthurZucker approved these changes on 2025-01-30
yonigozlan final nits
b5b56d86
yonigozlan Merge branch 'add-got-ocr2' of https://github.com/yonigozlan/transfor…
991b3f37
yonigozlan Merge branch 'main' into add-got-ocr2
9046bf54
yonigozlan fix pixel_values dtype error
cc861d3e
yonigozlan change checkpoint names
506d5665
yonigozlan Merge branch 'main' into add-got-ocr2
2dbbf24e
yonigozlan fix modular
9fb4e909
yonigozlan yonigozlan merged 2b469431 into main 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone