unstructured
enhancement: render pdfs with pdfium
#4185
Merged

enhancement: render pdfs with pdfium #4185

qued merged 34 commits into main from enhancement/render-pdfs-with-pdfium
qued
qued factor rendering and use pypdfium instead
64971e7a
qued replace render with wrapper func
84b1e9ae
qued Merge branch 'main' into enhancement/render-pdfs-with-pdfium
278c6cdc
qued update changelog and version
214f7405
qued format
a57d2096
qued format
4faa341d
ryannikolaidis enhancement: render pdfs with pdfium <- Ingest test fixtures update (…
d8d7d268
qued get desired behavior re. output_folder and path_only
513424e7
qued higher dpi
ba02b994
qued Merge branch 'main' into enhancement/render-pdfs-with-pdfium
01862222
qued update tests and fix issues causing failures
7679f5e6
qued retype check slightly
9387e777
qued Update md fixtures part 1
3733b2c2
qued markdown fixtures part 2
532e0410
qued qued marked this pull request as ready for review 89 days ago
cursor
cursor commented on 2026-01-13
qued Add DPI to env config
5e4d6864
cursor
cursor commented on 2026-01-14
cragwolfe
cragwolfe approved these changes on 2026-01-14
cragwolfe
cragwolfe commented on 2026-01-14
cragwolfe
cragwolfe approved these changes on 2026-01-14
misrasaurabh1
misrasaurabh1 commented on 2026-01-14
qued Update unstructured/partition/pdf_image/pdf_image_utils.py
5b84b5d4
qued typing tweaks
65724c0e
qued Env var name change
710ef4f9
cursor
cursor commented on 2026-01-14
qued use exactly_one function for filtering
413fc636
qued formatting
0c025c64
qued pip-compile to (hopefully) deal with CVEs
e9ddc314
socket-security
qued Add jaraco.context fixed version to resolve CVE
d787765c
qued Ditch setuptools in dockerfile
0c0c5038
cursor
cursor commented on 2026-01-14
qued no cache apparently
fa15f216
qued uninstall user as well
fffa4ebf
qued no --user option
3ab5a16d
qued Revert dockerfile changes -- no joy yet, will do in separate PR
c1ed48f0
qued cast to int for typing
306610d4
qued Change dpi default in another place
95ea2a4b
cursor
cursor commented on 2026-01-15
qued arg, forgot this is scale not dpi, was blindly listening to linter
84b70700
cursor
cursor commented on 2026-01-15
qued error early
9aa9a480
qued deal with typing and remove check that's handled downstream
14426533
qued brought back bug if both filename and file are populated, fix
43ee99df
qued fix test
870ee207
qued qued merged 138661a7 into main 88 days ago
qued qued deleted the enhancement/render-pdfs-with-pdfium branch 88 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone