unstructured
be88eef0 - perf: optimize pdfminer image cleanup process for improved performance (#3630)

Commit
1 year ago
perf: optimize pdfminer image cleanup process for improved performance (#3630) This PR enhances `pdfminer` image cleanup process by repositioning the duplicate image removal step. It optimizes the removal of duplicated pdfminer images by performing the cleanup before merging elements, rather than after. This improvement reduces execution time and enhances the overall processing speed of PDF documents. --------- Co-authored-by: Yao You <theyaoyou@gmail.com>
Parents
Loading