Feat/1136 elements ordering for pdf #1161
chore: add example docs
dfaf0f81
Merge branch 'main' into feat/1136-elements-ordering-for-pdf
87eb1f8f
feat: add base scripts to evaluate `xy-cut` sorting result - evaluation
99445a1f
feat: scale coordinates to fit actual image size - evaluation
75cfaa89
feat: pass `PIL.Image` objects instead of image paths - evaluation
c349af98
feat: separate annotated images by pdf `strategy` - evaluation
ed40bfce
feat: handle an exception if `PageBreak` is `None` for the last page …
d7e559c3
refactor: rename the evaluation script - evaluation
56cfc16e
refactor: organization - evaluation
af451596
feat: ensure that the result of the `xy-cut` ordering is not affected…
c2b45008
feat: add functionality to switch sorting modes
eba56937
feat: add functionality to switch sorting modes for `hi_res`
3531aeda
feat: update the evaluation script - evaluation
88d813f9
feat: add jupyter notebook to provide evaluation for `xy-cut` sorting
3b540ea5
test: fix lint errors
8080c83b
Merge branch 'main' into feat/1136-elements-ordering-for-pdf
dc9ca8f8
chore: update changelog & version
e90421be
chore: include a link to the original repo in the docstring of `unstr…
d1f13dba
test: fix lint errors
fccf6839
Merge branch 'main' into feat/1136-elements-ordering-for-pdf
314a868a
refactor: move `document_to_element_list` from `file_utils/filetype.p…
114ae5b6
feat: add `sortable` param to `document_to_element_list` to avoid sor…
baf86cd5
test: fix lint errors
c9310ebd
feat: add functionality to skip sorting on empty elements
5b7ec64d
test: update test cases
18fb1979
Merge branch 'main' into feat/1136-elements-ordering-for-pdf
8740b0fa
Merge branch 'main' into feat/1136-elements-ordering-for-pdf
e273ebf1
Feat/1136 elements ordering for pdf <- Ingest test fixtures update (#…
614be6fd
Merge branch 'main' into feat/1136-elements-ordering-for-pdf
8a5f19bf
feat: optionally import `sort_page_elements()` in `common.py`
eed362dc
chore: remove `evaluate_xy_cut_sorting.ipynb` & create a Google Colab…
9b27abca
feat: import `sort_page_elements()` only if `cv2` and `numpy` exist
01303b1c
chore: update README
61128cd3
Merge branch 'main' into feat/1136-elements-ordering-for-pdf
2c770672
feat: apply basic sorting by default for fast `strategy` to avoid non…
9da3efd5
test: fix lint errors
509429f7
Feat/1136 elements ordering for pdf <- Ingest test fixtures update (#…
9d2af02d
Merge branch 'main' into feat/1136-elements-ordering-for-pdf
755885bf
cragwolfe
approved these changes
on 2023-08-25
cragwolfe
merged
483b09b3
into main 2 years ago
cragwolfe
deleted the feat/1136-elements-ordering-for-pdf branch 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub