unstructured
fix: enable `partition_pdf` to recursively grab text with fast strategy
#796
Merged

fix: enable `partition_pdf` to recursively grab text with fast strategy #796

MthwRobinson merged 9 commits into main from fix/fast-skips-pages
MthwRobinson
MthwRobinson initial pass on text in figures
91ade0d6
MthwRobinson refactor text extraction
e3afea36
MthwRobinson update tests
f6f32f84
MthwRobinson fix title test
c6f1d0dc
MthwRobinson add test for docs that require recursive text grab
0d140994
MthwRobinson version and changelog
31b90f32
MthwRobinson fix merge conflicts
adb554ff
MthwRobinson MthwRobinson requested a review from qued qued 2 years ago
qued
qued approved these changes on 2023-06-22
MthwRobinson ingest-test-fixtures-update
484985b9
MthwRobinson there are 8 pdf files now
36d94ab6
MthwRobinson MthwRobinson merged 8683e269 into main 2 years ago
MthwRobinson MthwRobinson deleted the fix/fast-skips-pages branch 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone