unstructured
Chore (refactor): support table extraction with pre-computed ocr data
#1801
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
51
Changes
View On
GitHub
Chore (refactor): support table extraction with pre-computed ocr data
#1801
yuming-long
merged 51 commits into
main
from
yuming/table_ocr_factor
moving infer_table_structure to ocr
95972cf1
lint
c54b07e0
merge two ocr env, doc nit
531c9ac0
Merge branch 'main' into yuming/table_ocr_factor
3918a878
update test
11b8efaf
make tidy lint
ce91fa48
prepare structure needed for table ocr
3ae44fdb
small nit on import
d94ca5e8
bump inference to 0.7.9 to use optional ocr_tokens
5de02ae4
logic for getting table tokens
b902747d
image scaling for tesseract ocr
47b9e3d4
fix some broken tests
c72f90de
fix bug can't set attribute
091c720c
Merge branch 'main' into yuming/table_ocr_factor
1105370b
forgot to pass in zoom parm
5362b42c
pass np array to aviod Corrupt JPEG data error
24cd20f7
tesseract ocr change title index
53223431
more test update due to index change
32d76809
idx change in output dut to scaling
4fee9338
update commented tests
7e5e8781
more index update
09f02e3c
two more todo to go :)
f6e632ad
table test for ocr mode
61872001
enhance test; table only return strcture
b490e9e4
stage for debugging
c67dbf76
fixed borken text where table token is mismatched
3bbe150d
note for table token
d1fb1cd3
add test for korean table
0a2b7a12
for coverage
33c438fb
Merge branch 'main' into yuming/table_ocr_factor
4554e94b
changelog and versioin
b21db4c0
yuming-long
marked this pull request as ready for review
2 years ago
update korean table test and note
ee1a931a
yuming-long
requested a review
from
badGarnet
2 years ago
yuming-long
requested a review
from
christinestraub
2 years ago
yuming-long
requested a review
from
qued
2 years ago
update default image crop pad
b33c264f
note nit
7cd339e9
christinestraub
requested changes on 2023-10-19
Merge branch 'main' into yuming/table_ocr_factor
16add638
replace func with Rec.is_in and add unit test
37e2e397
move table_agnet outside loop, add is none check to init
311c52b2
nit on table import
a0f7b89e
christinestraub
approved these changes on 2023-10-20
getting ingest update locally
9b6641ee
yuming-long
commented on 2023-10-20
should be on ec2 tho
8053e606
fix: model_name being None raises attribution error
52d212d5
Merge remote-tracking branch 'origin/main' into fix/none-model-name-b…
28cb79c3
ec2 docker ingest update
f06dfea1
Merge branch 'main' into yuming/table_ocr_factor
f6f16e4d
Merge branch 'main' into yuming/table_ocr_factor
03da2558
Merge remote-tracking branch 'origin/fix/none-model-name-breaks-api' …
65c13bf6
yuming-long
enabled auto-merge
2 years ago
Revert "Merge remote-tracking branch 'origin/fix/none-model-name-brea…
71b6f617
disabled auto-merge
2 years ago
Manually disabled by user
Merge branch 'main' into yuming/table_ocr_factor
99487e00
yuming-long
enabled auto-merge
2 years ago
Merge branch 'main' into yuming/table_ocr_factor
b898eb18
Merge branch 'main' into yuming/table_ocr_factor
db62ac88
yuming-long
enabled auto-merge
2 years ago
version and changlog
ff9102ce
yuming-long
merged
ce40cdc5
into main
2 years ago
yuming-long
deleted the yuming/table_ocr_factor branch
2 years ago
Login to write a write a comment.
Login via GitHub
Reviewers
christinestraub
badGarnet
qued
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub