unstructured
Chore(ingest) : add tests on PDFs with fast strategy
#614
Merged

Chore(ingest) : add tests on PDFs with fast strategy #614

cragwolfe merged 47 commits into main from yuming/fast_tests_ingest
yuming-long
yuming-long add --preserve-downloads to all ingest tests
cfe09171
yuming-long restruct output dir to end with ingest-output
b930daf9
yuming-long Revert "restruct output dir to end with ingest-output"
bd3ba37f
yuming-long add reprocess test
be8d8fb6
yuming-long add test to test set
695ed8f1
yuming-long specify download dir
fdbaeee7
yuming-long local recursive
e7e2d1e6
yuming-long skip s3 pdfs for now
89fce93c
yuming-long no local-recursive to aviod cp -r
d1ce7783
yuming-long expected output
681c4a80
yuming-long remove duplicate
a771b31d
yuming-long changelog and version
00821849
yuming-long Merge branch 'main' into yuming/fast_tests_ingest
7eb4584b
yuming-long yuming-long marked this pull request as draft 2 years ago
yuming-long Merge branch 'main' into yuming/fast_tests_ingest
8f262d11
yuming-long s3 is backk
780816fc
yuming-long Revert "no local-recursive to aviod cp -r"
b056d783
yuming-long s3 recur
afdd2025
yuming-long use cp -a instead
1e7910f2
yuming-long expected output azure and s3
7c6d7e11
yuming-long Merge branch 'main' into yuming/fast_tests_ingest
c81d65bc
yuming-long put back comment
e875aacc
yuming-long slack discord up
6828fcb6
yuming-long Merge branch 'main' into yuming/fast_tests_ingest
b281ed90
yuming-long no nned to preserve download in local test
43bb6fb4
yuming-long local api
84ef99f5
yuming-long put back
144eb189
rerun on ec2
f52e2e8d
yuming-long Merge branch 'main' into yuming/fast_tests_ingest
261b630e
yuming-long run is again
2df8c781
yuming-long echo diff
cf5483df
yuming-long Merge branch 'main' into yuming/fast_tests_ingest
6641bc75
yuming-long changelog
47ffd154
yuming-long python3.8 output
469071b7
christinestraub Merge branch 'main' into yuming/fast_tests_ingest
425a23eb
christinestraub feat: add functionality to sort elements in `partition_pdf` for `fast…
e6e79bad
christinestraub test: update `metadata-exclude` in `test-ingest-pdf-fast-reprocess.sh`
e74c319d
christinestraub test: update ingest test fixtures
89d62515
christinestraub Merge branch 'main' into yuming/fast_tests_ingest
cdcc6c81
christinestraub chore: update changelog & version
b81217de
christinestraub test: handle the case where `el.coordinates` is `None`
1650ea37
christinestraub test: update unit test function for `pdf`
1b328a7c
cragwolfe update discord fixtures
0d0cf52a
cragwolfe update slack fixture
9939b2f8
cragwolfe Merge branch 'main' into yuming/fast_tests_ingest
f8a6a538
cragwolfe Update __version__.py
526a3f9d
cragwolfe fix the bump
27a76274
cragwolfe
cragwolfe approved these changes on 2023-06-10
cragwolfe
cragwolfe commented on 2023-06-10
cragwolfe cragwolfe requested a review from qued qued 2 years ago
cragwolfe cragwolfe marked this pull request as ready for review 2 years ago
qued
qued approved these changes on 2023-06-12
yuming-long Merge branch 'main' into yuming/fast_tests_ingest
21a40829
cragwolfe cragwolfe enabled auto-merge (squash) 2 years ago
cragwolfe cragwolfe merged 2fbb1ccd into main 2 years ago
cragwolfe cragwolfe deleted the yuming/fast_tests_ingest branch 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone