Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
Unstructured-IO/unstructured
Pull Requests
Commits
Open
Closed
Switch from pdfminer to paves to improve robustness and use multiple CPUs
#4067 opened 2025-07-19 04:10 by
dhdaines
perf: add early page count check to prevent expensive PDFMiner proces…
#4048 opened 2025-07-08 20:09 by
CyMule
Config for VoyageAI's v3.5 embedding models
#4004 opened 2025-05-21 11:33 by
fzowl
Feature/remove unnessary re for table ele in pdf
#3984 opened 2025-04-09 11:24 by
JIAQIA
bugfix/fix missing extensions in file detection
#3926 opened 2025-02-18 17:24 by
rbiseck3
Improve readability of the text by adding new line to the end of row
#3913 opened 2025-02-07 14:56 by
Sheripov
chore: bump deps
#3901 opened 2025-02-04 03:58 by
cragwolfe
fix: preserve text after line breaks in PowerPoint table cells
#3877 opened 2025-01-18 04:07 by
yamazombie
Add password
#3876 opened 2025-01-18 00:26 by
Coniferish
add post chunking strategy
#3869 opened 2025-01-16 17:45 by
tbs17
feat: Allow deactivating OCR entirely with hi_res strategy
#3839 opened 2024-12-17 19:58 by
dhdaines
fix: Fix issue #3815
#3835 opened 2024-12-17 09:30 by
PhorstenkampFuzzy
fix: when convert doc to docx, UnicodeDecodeError may be raised
#3830 opened 2024-12-14 09:10 by
YooshiJay
Add a note to README.md about CHANGELOG.md
#3824 opened 2024-12-12 12:46 by
dhdaines
chore: Switch to v4 of upload artifact
#3820 opened 2024-12-10 19:11 by
fxdgear
Prefer using provided filename over detection from file.name
#3786 opened 2024-11-19 11:16 by
framp
update scarf_analytics() GET request with timeouts
#3780 opened 2024-11-13 09:51 by
garyfanhku
fixed pdf path error.
#3777 opened 2024-11-09 02:04 by
mzdz
build(deps): bump ruff from 0.4.10 to 0.7.2 in /requirements
dependencies
python
#3771 opened 2024-11-01 16:32 by
dependabot[bot]
build(deps): bump tqdm from 4.66.5 to 4.66.6 in /requirements
dependencies
python
#3770 opened 2024-11-01 16:27 by
dependabot[bot]
build(deps): bump anchore/scan-action from 3 to 5
dependencies
github_actions
#3769 opened 2024-11-01 16:24 by
dependabot[bot]
build(deps): update botocore requirement from <1.34.132 to <1.35.54 in /requirements
dependencies
python
#3768 opened 2024-11-01 16:22 by
dependabot[bot]
build(deps): bump paddlepaddle from 3.0.0b1 to 3.0.0b2 in /requirements
dependencies
python
#3767 opened 2024-11-01 16:18 by
dependabot[bot]
feat: LanceDB integration
#3739 opened 2024-10-19 12:16 by
PrashantDixit0
Fix typing issue in inference_utils.py
#3716 opened 2024-10-12 01:00 by
cckolon
#3713 fix the wrong file path in README.md
documentation
#3714 opened 2024-10-10 06:50 by
shaofengshi
build: Fix build reproducibility.
#3712 opened 2024-10-09 22:07 by
jsirois
build(deps): bump peter-evans/create-pull-request from 5 to 7
dependencies
github_actions
#3682 opened 2024-10-01 16:32 by
dependabot[bot]
build(deps): bump ammaraskar/sphinx-action from e781e9af3e80bfe0ea539e4ea46858d51e027214 to c61ac11d9ee097caf8983c10c8b5af5861b32b54
dependencies
github_actions
#3681 opened 2024-10-01 16:32 by
dependabot[bot]
Fix bug causing partition_xlsx to raise error
#3663 opened 2024-09-25 02:13 by
bawgz
Older