unstructured
4a96d549 - chore: move logger error to debug when pdfminer extract fails (#3028)

Commit
1 year ago
chore: move logger error to debug when pdfminer extract fails (#3028) ### Summary We are seeing logger error `Invalid dictionary construct` for hosted APIs, move this logger error to debug level - we still continue partition when pdfminer text extraction fails as before (just don't throw the log error anymore) ### Test I was able to reproduce the logger error with an internal only file (please DM me if needed) and the error trace look like ``` File "/Users/yumingl/develops/unstructured/unstructured/partition/pdf.py", line 709, in _process_pdfminer_pages annotation_list = get_uris(page.annots, height, coordinate_system, page_number) File "/Users/yumingl/develops/unstructured/unstructured/partition/pdf.py", line 1049, in get_uris resolved_annots = annots.resolve() ... ``` we also won't be able to repair pdf structure on `get_uris` (not a page level) so move this exception to debug level.
Author
Parents
Loading