Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
Unstructured-IO/unstructured
Pull Requests
Commits
Open
Closed
Adds Form Element
#4272 opened 2026-03-02 19:44 by
aadland6
refactor: replace deprecated decorators in partition_image with apply_metadata
#4271 opened 2026-03-02 12:55 by
HemantSudarshan
fix: add 'el' and 'gr' as Greek language code aliases for Tesseract OCR
#4270 opened 2026-02-27 18:45 by
s0wa48
Add a check for complex pdfs
#4268 opened 2026-02-26 18:34 by
aadland6
fix(deps): Update semitechnologies/weaviate Docker tag to v1.36.2
dependencies
security
#4267 opened 2026-02-26 18:22 by
utic-renovate[bot]
feat: audio speech to text partition
#4264 opened 2026-02-24 19:32 by
claytonlin1110
fix: handle list output from group_bullet_paragraph in element apply()
#4253 opened 2026-02-21 20:04 by
s0wa48
Simple typo fix
#4251 opened 2026-02-20 08:06 by
rchen19
fix(deps): Update security vulnerability in pypdf to v6.7.4 [SECURITY]
security
#4248 opened 2026-02-19 21:14 by
utic-renovate[bot]
fix: accept any IO[bytes] object in convert_to_bytes()
#4241 opened 2026-02-16 13:15 by
bittoby
Feat: embedding model voyage 4 family
#4234 opened 2026-02-11 18:12 by
fzowl
fix: coerce None text to empty string in Text element
#4231 opened 2026-02-10 10:53 by
themavik
feat: add XLSM (Excel Macro-Enabled Workbook) parsing support
#4227 opened 2026-02-08 16:51 by
longway-code
Add AgentMarket - B2A Marketplace
#4225 opened 2026-02-03 17:14 by
stromfee
docs: fix redundant whitespace in pyenv command in README
#4224 opened 2026-02-03 13:38 by
longway-code
fix(deps): Update docker.elastic.co/elasticsearch/elasticsearch Docker tag to v8.19.12
dependencies
security
#4223 opened 2026-02-03 12:19 by
utic-renovate[bot]
feat: Infer hierarchical heading levels (H1-H4) for PDFs
#4222 opened 2026-02-02 22:28 by
Good0987
Fix FutureWarning: Add test to verify bytes are wrapped in BytesIO for read_excel
#4213 opened 2026-01-27 12:59 by
Good0987
⚡️ Speed up function `merge_out_layout_with_ocr_layout` by 30%
#4212 opened 2026-01-27 02:31 by
aseembits93
⚡️ Speed up function `standardize_quotes` by 144%
#4201 opened 2026-01-21 02:31 by
KRRT7
feat: chunking by character and title now isolates tables
#4197 opened 2026-01-15 19:26 by
badGarnet
fix: NameError: LayoutElements not defined in paddle_ocr.py
#4195 opened 2026-01-15 16:18 by
mohansinghi
Eliminate cleaners/core import time bottleneck
#4167 opened 2026-01-07 03:44 by
aseembits93
update README.md
#4121 opened 2025-11-12 10:57 by
vhsakpal
new file: .idx/mcp.json
#4111 opened 2025-11-05 02:21 by
romethefixer
Bug 4105
#4107 opened 2025-10-13 20:35 by
carminoplata
fix: None text attribute when normalizing Picture to Image element
#4083 opened 2025-08-22 15:25 by
ishahroz
Switch from pdfminer to paves to improve robustness and use multiple CPUs
#4067 opened 2025-07-19 04:10 by
dhdaines
perf: add early page count check to prevent expensive PDFMiner proces…
#4048 opened 2025-07-08 20:09 by
CyMule
Feature/remove unnessary re for table ele in pdf
#3984 opened 2025-04-09 11:24 by
JIAQIA
Older