unstructured
e359afaf - fix: coordinates bug on pdf parsing (#1462)

Commit
2 years ago
fix: coordinates bug on pdf parsing (#1462) Addresses: https://github.com/Unstructured-IO/unstructured/issues/1460 We were raising an error with invalid coordinates, which prevented us from continuing to return the element and continue parsing the pdf. Now instead of raising the error we'll return early. to test: ``` from unstructured.partition.auto import partition elements = partition(url='https://www.apple.com/environment/pdf/Apple_Environmental_Progress_Report_2022.pdf', strategy="fast") ``` --------- Co-authored-by: cragwolfe <crag@unstructured.io>
Author
Parents
Loading