unstructured
5b7ae298 - fix: 521 pdf2image memory error (#924)

Commit
2 years ago
fix: 521 pdf2image memory error (#924) Closes issue #521. Implements the same logic as unstructured-inference/PR #136 for the ocr_only strategy. * Add functionality to convert a PDF in small chunks of pages at a time * Add functionality to write images to computer storage temporarily instead of keeping them in memory * Set the file's current position to the beginning after reading the file in convert_to_bytes
Parents
Loading