Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
Unstructured-IO/unstructured
Pull Requests
Commits
Open
Closed
feat: chunking by character and title now isolates tables
#4197 opened 2026-01-15 19:26 by
badGarnet
fix: Preserve Line Breaks in Code Blocks During Chunking
#4196 opened 2026-01-15 16:39 by
eureka928
fix: NameError: LayoutElements not defined in paddle_ocr.py
#4195 opened 2026-01-15 16:18 by
mohansinghi
fix: remove sandbox=True from pypandoc to fix ODT conversion
#4193 opened 2026-01-15 11:16 by
MkDev11
Luke/update dockerfile
#4192 opened 2026-01-14 21:29 by
luke-kucing
feat: add clean_newline helper function for hyphenated line breaks
#4188 opened 2026-01-12 21:08 by
davidfertube
fix(deps): Update security vulnerability in pyasn1 to v0.6.2 [SECURITY]
security
#4178 opened 2026-01-08 09:12 by
utic-renovate[bot]
Eliminate cleaners/core import time bottleneck
#4167 opened 2026-01-07 03:44 by
aseembits93
fix(deps): Update opensearchproject/opensearch Docker tag to v3
dependencies
security
#4137 opened 2025-12-24 18:26 by
utic-renovate[bot]
fix(deps): Update docker.elastic.co/elasticsearch/elasticsearch Docker tag to v9
dependencies
security
#4136 opened 2025-12-24 18:26 by
utic-renovate[bot]
fix(deps): Update semitechnologies/weaviate Docker tag to v1.35.3
dependencies
security
#4135 opened 2025-12-24 18:26 by
utic-renovate[bot]
fix(deps): Update opensearchproject/opensearch Docker tag to v2.19.4
dependencies
security
#4134 opened 2025-12-24 18:26 by
utic-renovate[bot]
fix(deps): Update docker.elastic.co/elasticsearch/elasticsearch Docker tag to v8.19.10
dependencies
security
#4133 opened 2025-12-24 18:26 by
utic-renovate[bot]
update README.md
#4121 opened 2025-11-12 10:57 by
vhsakpal
new file: .idx/mcp.json
#4111 opened 2025-11-05 02:21 by
romethefixer
Bug 4105
#4107 opened 2025-10-13 20:35 by
carminoplata
fix: None text attribute when normalizing Picture to Image element
#4083 opened 2025-08-22 15:25 by
ishahroz
Switch from pdfminer to paves to improve robustness and use multiple CPUs
#4067 opened 2025-07-19 04:10 by
dhdaines
perf: add early page count check to prevent expensive PDFMiner proces…
#4048 opened 2025-07-08 20:09 by
CyMule
Feature/remove unnessary re for table ele in pdf
#3984 opened 2025-04-09 11:24 by
JIAQIA
bugfix/fix missing extensions in file detection
#3926 opened 2025-02-18 17:24 by
rbiseck3
Improve readability of the text by adding new line to the end of row
#3913 opened 2025-02-07 14:56 by
Sheripov
fix: preserve text after line breaks in PowerPoint table cells
#3877 opened 2025-01-18 04:07 by
yamazombie
Add password
#3876 opened 2025-01-18 00:26 by
Coniferish
add post chunking strategy
#3869 opened 2025-01-16 17:45 by
tbs17
feat: Allow deactivating OCR entirely with hi_res strategy
#3839 opened 2024-12-17 19:58 by
dhdaines
fix: Fix issue #3815
#3835 opened 2024-12-17 09:30 by
PhorstenkampFuzzy
fix: when convert doc to docx, UnicodeDecodeError may be raised
#3830 opened 2024-12-14 09:10 by
YooshiJay
Add a note to README.md about CHANGELOG.md
#3824 opened 2024-12-12 12:46 by
dhdaines
Prefer using provided filename over detection from file.name
#3786 opened 2024-11-19 11:16 by
framp
Older