unstructured
Chore (refactor): support table extraction with pre-computed ocr data
#1801
Merged

Chore (refactor): support table extraction with pre-computed ocr data #1801

yuming-long merged 51 commits into main from yuming/table_ocr_factor
yuming-long
yuming-long moving infer_table_structure to ocr
95972cf1
yuming-long lint
c54b07e0
yuming-long merge two ocr env, doc nit
531c9ac0
yuming-long Merge branch 'main' into yuming/table_ocr_factor
3918a878
yuming-long update test
11b8efaf
yuming-long make tidy lint
ce91fa48
yuming-long prepare structure needed for table ocr
3ae44fdb
yuming-long small nit on import
d94ca5e8
yuming-long bump inference to 0.7.9 to use optional ocr_tokens
5de02ae4
yuming-long logic for getting table tokens
b902747d
yuming-long image scaling for tesseract ocr
47b9e3d4
yuming-long fix some broken tests
c72f90de
yuming-long fix bug can't set attribute
091c720c
yuming-long Merge branch 'main' into yuming/table_ocr_factor
1105370b
yuming-long forgot to pass in zoom parm
5362b42c
yuming-long pass np array to aviod Corrupt JPEG data error
24cd20f7
yuming-long tesseract ocr change title index
53223431
yuming-long more test update due to index change
32d76809
yuming-long idx change in output dut to scaling
4fee9338
yuming-long update commented tests
7e5e8781
yuming-long more index update
09f02e3c
yuming-long two more todo to go :)
f6e632ad
yuming-long table test for ocr mode
61872001
yuming-long enhance test; table only return strcture
b490e9e4
yuming-long stage for debugging
c67dbf76
yuming-long fixed borken text where table token is mismatched
3bbe150d
yuming-long note for table token
d1fb1cd3
yuming-long add test for korean table
0a2b7a12
yuming-long for coverage
33c438fb
yuming-long Merge branch 'main' into yuming/table_ocr_factor
4554e94b
yuming-long changelog and versioin
b21db4c0
yuming-long yuming-long marked this pull request as ready for review 2 years ago
yuming-long update korean table test and note
ee1a931a
yuming-long yuming-long requested a review from badGarnet badGarnet 2 years ago
yuming-long yuming-long requested a review from christinestraub christinestraub 2 years ago
yuming-long yuming-long requested a review from qued qued 2 years ago
yuming-long update default image crop pad
b33c264f
yuming-long note nit
7cd339e9
christinestraub
christinestraub requested changes on 2023-10-19
yuming-long Merge branch 'main' into yuming/table_ocr_factor
16add638
yuming-long replace func with Rec.is_in and add unit test
37e2e397
yuming-long move table_agnet outside loop, add is none check to init
311c52b2
yuming-long nit on table import
a0f7b89e
christinestraub
christinestraub approved these changes on 2023-10-20
yuming-long getting ingest update locally
9b6641ee
yuming-long
yuming-long commented on 2023-10-20
yuming-long should be on ec2 tho
8053e606
badGarnet fix: model_name being None raises attribution error
52d212d5
badGarnet Merge remote-tracking branch 'origin/main' into fix/none-model-name-b…
28cb79c3
yuming-long ec2 docker ingest update
f06dfea1
yuming-long Merge branch 'main' into yuming/table_ocr_factor
f6f16e4d
yuming-long Merge branch 'main' into yuming/table_ocr_factor
03da2558
yuming-long Merge remote-tracking branch 'origin/fix/none-model-name-breaks-api' …
65c13bf6
yuming-long yuming-long enabled auto-merge 2 years ago
yuming-long Revert "Merge remote-tracking branch 'origin/fix/none-model-name-brea…
71b6f617
disabled auto-merge 2 years ago
Manually disabled by user
yuming-long Merge branch 'main' into yuming/table_ocr_factor
99487e00
yuming-long yuming-long enabled auto-merge 2 years ago
yuming-long Merge branch 'main' into yuming/table_ocr_factor
b898eb18
yuming-long Merge branch 'main' into yuming/table_ocr_factor
db62ac88
yuming-long yuming-long enabled auto-merge 2 years ago
yuming-long version and changlog
ff9102ce
yuming-long yuming-long merged ce40cdc5 into main 2 years ago
yuming-long yuming-long deleted the yuming/table_ocr_factor branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone