unstructured
Switch from pdfminer to paves to improve robustness and use multiple CPUs
#4067
Open

Commits
  • feat: switch from pdfminer to paves
    dhdaines committed 231 days ago
  • fix: manually hack deps since who knows how they get generated
    dhdaines committed 231 days ago
  • chore: black and ruff
    dhdaines committed 231 days ago
  • fix(tests): repair no longer necessary
    dhdaines committed 230 days ago
  • fix: avoid importing pypdf just to count pages!
    David Huggins-Daines committed 229 days ago
  • fix: playa needs "" as default password not None
    David Huggins-Daines committed 229 days ago
  • fix: require playa-pdf 0.6.2 for colormap issue
    David Huggins-Daines committed 229 days ago
  • fix: isort
    David Huggins-Daines committed 229 days ago
  • fix(tests): playa/paves do not output (cid:N) droppings
    David Huggins-Daines committed 229 days ago
  • fix(tests): update indices since (cid:N) no longer occurs
    David Huggins-Daines committed 229 days ago
  • fix(tests): update markdown and html fixtures
    David Huggins-Daines committed 229 days ago
  • fix(tests): fix missing or not missing newline for silly diff
    David Huggins-Daines committed 229 days ago
Loading