Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
Unstructured-IO/unstructured
Pull Requests
Commits
Open
Closed
Remov pdfminer.six version constraint, bump dependencies to address high severity CVEs
#4156 opened 2026-01-04 06:40 by
lawrence-u10d
fix(deps): Update opensearchproject/opensearch Docker tag to v3
dependencies
security
#4137 opened 2025-12-24 18:26 by
utic-renovate[bot]
fix(deps): Update docker.elastic.co/elasticsearch/elasticsearch Docker tag to v9
dependencies
security
#4136 opened 2025-12-24 18:26 by
utic-renovate[bot]
fix(deps): Update semitechnologies/weaviate Docker tag to v1.35.2
dependencies
security
#4135 opened 2025-12-24 18:26 by
utic-renovate[bot]
fix(deps): Update opensearchproject/opensearch Docker tag to v2.19.4
dependencies
security
#4134 opened 2025-12-24 18:26 by
utic-renovate[bot]
fix(deps): Update docker.elastic.co/elasticsearch/elasticsearch Docker tag to v8.19.9
dependencies
security
#4133 opened 2025-12-24 18:26 by
utic-renovate[bot]
update README.md
#4121 opened 2025-11-12 10:57 by
vhsakpal
new file: .idx/mcp.json
#4111 opened 2025-11-05 02:21 by
romethefixer
Bug 4105
#4107 opened 2025-10-13 20:35 by
carminoplata
fix: None text attribute when normalizing Picture to Image element
#4083 opened 2025-08-22 15:25 by
ishahroz
Switch from pdfminer to paves to improve robustness and use multiple CPUs
#4067 opened 2025-07-19 04:10 by
dhdaines
perf: add early page count check to prevent expensive PDFMiner proces…
#4048 opened 2025-07-08 20:09 by
CyMule
Feature/remove unnessary re for table ele in pdf
#3984 opened 2025-04-09 11:24 by
JIAQIA
bugfix/fix missing extensions in file detection
#3926 opened 2025-02-18 17:24 by
rbiseck3
Improve readability of the text by adding new line to the end of row
#3913 opened 2025-02-07 14:56 by
Sheripov
fix: preserve text after line breaks in PowerPoint table cells
#3877 opened 2025-01-18 04:07 by
yamazombie
Add password
#3876 opened 2025-01-18 00:26 by
Coniferish
add post chunking strategy
#3869 opened 2025-01-16 17:45 by
tbs17
feat: Allow deactivating OCR entirely with hi_res strategy
#3839 opened 2024-12-17 19:58 by
dhdaines
fix: Fix issue #3815
#3835 opened 2024-12-17 09:30 by
PhorstenkampFuzzy
fix: when convert doc to docx, UnicodeDecodeError may be raised
#3830 opened 2024-12-14 09:10 by
YooshiJay
Add a note to README.md about CHANGELOG.md
#3824 opened 2024-12-12 12:46 by
dhdaines
Prefer using provided filename over detection from file.name
#3786 opened 2024-11-19 11:16 by
framp
update scarf_analytics() GET request with timeouts
#3780 opened 2024-11-13 09:51 by
garyfanhku
fixed pdf path error.
#3777 opened 2024-11-09 02:04 by
mzdz
feat: LanceDB integration
#3739 opened 2024-10-19 12:16 by
PrashantDixit0
#3713 fix the wrong file path in README.md
documentation
#3714 opened 2024-10-10 06:50 by
shaofengshi
build: Fix build reproducibility.
#3712 opened 2024-10-09 22:07 by
jsirois
Fix bug causing partition_xlsx to raise error
#3663 opened 2024-09-25 02:13 by
bawgz
chore: Fix spelling
#3622 opened 2024-09-12 16:51 by
jsoref
Older