Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
Unstructured-IO/unstructured
Pull Requests
Commits
Open
Closed
add post chunking strategy
#3869 opened 2025-01-16 17:45 by
tbs17
feat: Allow deactivating OCR entirely with hi_res strategy
#3839 opened 2024-12-17 19:58 by
dhdaines
fix: Fix issue #3815
#3835 opened 2024-12-17 09:30 by
PhorstenkampFuzzy
fix: when convert doc to docx, UnicodeDecodeError may be raised
#3830 opened 2024-12-14 09:10 by
YooshiJay
Add a note to README.md about CHANGELOG.md
#3824 opened 2024-12-12 12:46 by
dhdaines
Prefer using provided filename over detection from file.name
#3786 opened 2024-11-19 11:16 by
framp
update scarf_analytics() GET request with timeouts
#3780 opened 2024-11-13 09:51 by
garyfanhku
fixed pdf path error.
#3777 opened 2024-11-09 02:04 by
mzdz
feat: LanceDB integration
#3739 opened 2024-10-19 12:16 by
PrashantDixit0
#3713 fix the wrong file path in README.md
documentation
#3714 opened 2024-10-10 06:50 by
shaofengshi
build: Fix build reproducibility.
#3712 opened 2024-10-09 22:07 by
jsirois
Fix bug causing partition_xlsx to raise error
#3663 opened 2024-09-25 02:13 by
bawgz
chore: Fix spelling
#3622 opened 2024-09-12 16:51 by
jsoref
fix: remove mesa-gl workaround
#3615 opened 2024-09-10 18:14 by
MthwRobinson
fix: process attchments in partitioning nested emails (#3604)
#3605 opened 2024-09-07 10:30 by
S1M0N38
added note about nltk issue in readme
#3573 opened 2024-08-28 09:32 by
codeAshu
Add files via upload
#3560 opened 2024-08-23 17:34 by
sms7234
Compatibility Issue with Chinese Text in Document Parsing
#3530 opened 2024-08-16 18:50 by
Coniferish
Added 'inline' to content disposition check
#3489 opened 2024-08-07 04:46 by
taylorn-ai
Remove clean_bullets from partition_docx
#3464 opened 2024-08-01 15:23 by
jgen1
feat: Merges words which are split across two lines in text partition
#3394 opened 2024-07-14 09:10 by
sksharma0
bugfix: prevent unintended shared metadata updates in function `assign_and_map_hash_ids`
#3385 opened 2024-07-11 05:26 by
non-nil
fix: csv/tsv encoding
#3369 opened 2024-07-09 10:24 by
jaluma
Compatibility Issue with Chinese Text in Document Parsing
#3267 opened 2024-06-21 13:27 by
JIAQIA
Feat: Add-rc-locator-to-partition-excel
#3258 opened 2024-06-20 11:35 by
marctorsoc
FIX: The <div> text element with one <br> will not be regarded as a text element by `_is_text_tag`
#3209 opened 2024-06-14 09:38 by
heya5
feat: skip ocr for certain element types (Issue #3163)
#3182 opened 2024-06-11 05:59 by
beez2022
CORE-5030 gpt-4o ocr adam
#3098 opened 2024-05-24 15:46 by
amaciaszek-dsai
add bug fix for table metric
#3025 opened 2024-05-15 17:07 by
tbs17
Newer