enhancement: remove duplicate embedded images #2897
feat: add `clean_pdfminer_duplicate_image_elements()`
cc91ca5e
feat: add env_config `EMBEDDED_IMAGE_SAME_REGION_THRESHOLD`
7bb358a2
Merge branch 'main' into feat/remove-duplicate-embedded-images
200e912a
refactor: reorganize `clean_pdfminer_inner_elements()`
52e74192
chore: update changelog & version
89b65e4f
refactor
af6e082d
test: add unit test
15e66d04
test: fix lint error
d6d995b9
Merge branch 'main' into feat/remove-duplicate-embedded-images
7024d2ce
chore: bump version
f2833b9e
scanny
commented
on 2024-04-18
cragwolfe
approved these changes
on 2024-04-18
refactor: remove unused `defaultdict`
79ca250a
christinestraub
deleted the feat/remove-duplicate-embedded-images branch 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub