CORE-5030 gpt-4o ocr adam #3098
improve: add Python 3.12 support (#3033) (#3047)
d7608014
chore: add py.typed (#3043)
8802535e
fix: Add `pip` as explicit dep in `environment.yml` to prevent warnin…
84cec1f4
Updated Weaviate Docker image url (auto PR by bot) (#2659)
60f10fe6
fix: update container link in README.md (#2889)
6066a264
fix: set `skip_infer_tables` explicitly in `test_partition_via_api_wi…
acda4d07
docs: redirect to docs.unstructured.io on github pages (#3054)
73739b38
feat: refactor ingest (#3009)
3eaf65a8
build: apk add libreoffice24 (#3065)
059fc64b
feat: `partiton_pdf()` set inferred elements text (#3061)
b0d8a779
feat: add attribution for pinecone (#3067)
7832dfc7
rfctr(docx): organize docx tests (#3070)
30e5a0cd
chore: bump unstructured-inference 0.7.33 (#3074)
18428f24
rfctr: flatten test_unstructured/partition (#3073)
b4ee0191
fix: revert back to old requirements file for sphinx docs (#3077)
c9976760
feat/Move the category field to Element (#3056)
b8d894f9
fix: added the missing function argument (#3085)
9b83330b
fix: set `resolve_entities=False` in `partition_xml` (#3088)
171b5df0
feat(docx): add pluggable picture sub-partitioner (#3081)
47d28612
Fix: Chroma Upsert instead of Add (#3086)
31a53c8a
fix: decide table extraction (#3090)
35ec21ec
fix: add missing params to ElementMetadata (#3092)
26d403d7
chore: reduce excessive logging (#3095)
809c7e51
amaciaszek-dsai
changed the title Core 5030/gpt4o ocr adam v2 CORE-5030 gpt-4o ocr adam 1 year ago
fix: disable table_as_cells output by default (#3093)
32df4ee1
Adding gpt4o as ocr for ocr_only mode
9c937d6a
logs & basic error handling for openai
2aa5f9d8
openai in requirements
ab3ac239
change assert
8af7b98c
modify prompt & log img size
00246db4
resize too large imgs
13b474d7
increase max_tokens
5ef0dfe5
max_tokens for gpt-4o is 4k
06fdf939
pdf_text_extractable always as False
cedbd0d7
base.txt update after rebase
be9ad3d7
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub