unstructured
Feat: return base64 encoded images for PDF's
#2310
Merged

Feat: return base64 encoded images for PDF's #2310

christinestraub
christinestraub feat: add functionality to get a Base64 encoded string from a PIL image
3dad2eb2
christinestraub feat: save base64 encoded data and `MIME` type to new metadata fields
9a48aa15
christinestraub feat: pass `extract_to_payload` param through `pdf.partition_pdf()`
163de473
christinestraub feat: pass `pdf_extract_to_payload` param through `auto.partition()`
11fa1d97
christinestraub feat: pass `pdf_extract_element_types` param through `auto.partition()`
1d05f38c
christinestraub test: update test cases for `pdf_image_utils.save_elements()`
fc2a564e
christinestraub Merge branch 'main' into feat/2302-pdf-return-base64-encoded-images
1695b997
christinestraub test: add a test function to test partition pdf with element extraction
08ff4dad
christinestraub feat: keep all `Image` elements if image saving is required
29926835
christinestraub chore: update docstring for `auto.partition()` and `pdf.partition_pdf()`
21fd9244
christinestraub chore: update changelog & version
aaceefbc
christinestraub test: fix unit test errors
02f7a227
christinestraub test: fix lint error
2630a4ee
christinestraub feat: fix logic to keep all `Image` elements if image saving is required
afdecdac
christinestraub christinestraub requested a review from cragwolfe cragwolfe 2 years ago
christinestraub christinestraub requested a review from qued qued 2 years ago
christinestraub christinestraub requested a review from awalker4 awalker4 2 years ago
christinestraub christinestraub requested a review from Coniferish Coniferish 2 years ago
christinestraub chore: add an example pdf doc with both tables and images
9f1efcf6
christinestraub Merge branch 'main' into feat/2302-pdf-return-base64-encoded-images
e333b8aa
awalker4
awalker4 approved these changes on 2023-12-22
christinestraub
christinestraub chore: update docstring for `save_elements()`
99fa181c
Coniferish
Coniferish commented on 2023-12-24
Coniferish
Coniferish
Coniferish
Coniferish
christinestraub test: add a test function to inspect `image_base64` and `image_mime_t…
b7f5e294
christinestraub feat: update `save_elements()` utility function to support image file…
04266552
christinestraub feat: add functionality to save elements from image files
cb391a5e
christinestraub test: add a test function to inspect `image_base64` and `image_mime_t…
12039c53
christinestraub Merge branch 'main' into feat/2302-pdf-return-base64-encoded-images
a2ea076e
christinestraub chore: update changelog & version
d08ea15e
christinestraub chore: docstring
39bf26c3
christinestraub Merge branch 'main' into feat/2302-pdf-return-base64-encoded-images
20701a87
christinestraub christinestraub enabled auto-merge 2 years ago
christinestraub christinestraub merged dd144456 into main 2 years ago
christinestraub christinestraub deleted the feat/2302-pdf-return-base64-encoded-images branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone