unstructured
87bfe7a1 - build(deps): PDF images, unstructured-inference==0.5.23 (#1341)

Commit
2 years ago
build(deps): PDF images, unstructured-inference==0.5.23 (#1341) Bumps unstructured-inference==05.23 to pull in @christinestraub's fix: https://github.com/Unstructured-IO/unstructured-inference/pull/198 , so embedded Images in PDF's are now included in partition results ("hi_res"). From the perspective of elements with clean text, this is not a big win as a lot of the images have OCR garbage. However, it is important to preserve image elements for other downstream use cases, so overall this is a step forward.
Author
Parents
Loading