unstructured
feat: add pinecone destination connector
#1774
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
118
Changes
View On
GitHub
feat: add pinecone destination connector
#1774
ahmetmeleq
merged 118 commits into
main
from
ahmet/pinecone-connector
ahmetmeleq
added
ingest
ahmetmeleq
assigned
ahmetmeleq
2 years ago
ahmetmeleq
marked this pull request as ready for review
2 years ago
ahmetmeleq
requested a review
from
ryannikolaidis
2 years ago
ahmetmeleq
requested a review
from
rbiseck3
2 years ago
ahmetmeleq
changed the base branch from
main
to
CORE-1503-dont-use-partition
2 years ago
ahmetmeleq
changed the base branch from
CORE-1503-dont-use-partition
to
main
2 years ago
ahmetmeleq
force pushed
from
003f89ff
to
a3735890
2 years ago
add index creation script
0389b527
rebase off main for the changes in ingest cli
f5fe2a2f
trials on bugfix
655ffb6d
fix dependency name
33f5054f
apply roman's updates to pinecone
473f73c6
trials on pinecone example
0ee9e6bd
serially batched upsert with embeddings issue workaround
c6b1dc59
parallelized upsert with session handles
4bb0b1b5
skip chunking to avoid missing embeddings, remove zipping (another wo…
22394949
fix for logging error
d781698e
alphabetic order setup.py
6a96193b
add docs
0289dcc0
docs
4bec171e
docs
10fb5e76
rearrange imports
5a3975c9
add dependencies
397328f3
update example
1f6aacb5
add tests
ab73a497
add pinecone ingest test
d040010f
obfuscate embedding api keys
03b32bc8
update pinecone cli based on the new cli rebase
9895077a
shellcheck
9b3096e3
changelog and version
099fc4fb
linting
d1b1045b
linting
4fb14b01
linting
5c33688c
fix chunking node logs
b1069d4f
remove redundant secret from test fixtures update pr job
67ccfaf6
remove redundant helper script
9dbad768
remove redundant comments in test
2470dc7f
update example
2e4dda21
fix log in pipeline embedding node
ae8598ec
change pinecone batching size
15d2459f
add debugging tip
0c28c179
update ingest test with chunking
5f39a648
update example with chunking
f307a0a7
organize requirements
daeecf98
update expected uploads based on the updates in main
134a8bf9
ahmetmeleq
force pushed
from
3c05cecc
to
134a8bf9
2 years ago
session handle fix
9c12d3bc
ahmetmeleq
enabled auto-merge
2 years ago
rbiseck3
commented on 2023-10-27
rbiseck3
commented on 2023-10-27
rbiseck3
commented on 2023-10-27
rbiseck3
commented on 2023-10-27
rbiseck3
commented on 2023-10-27
rbiseck3
commented on 2023-10-27
doc, comment and logging updates
6d84efc0
test and session creation updates
84e65e53
ahmetmeleq
requested a review
from
rbiseck3
2 years ago
Merge branch 'main' into ahmet/pinecone-connector
8d7612e3
update for cli changes
5a438b8c
rbiseck3
commented on 2023-11-02
rbiseck3
commented on 2023-11-02
do not exclude metadata
cf773158
multiple attempts for testing
17c724bc
fix path typos on setup.py
02577697
ryannikolaidis
commented on 2023-11-03
ryannikolaidis
commented on 2023-11-03
ryannikolaidis
commented on 2023-11-03
ryannikolaidis
commented on 2023-11-03
Merge branch 'main' into ahmet/pinecone-connector
fec22635
ahmetmeleq
changed the title
feat: pinecone destination (downstream) connector
feat: add pinecone destination connector
2 years ago
reorder test, update path in test
830e3878
potter-potter
commented on 2023-11-05
potter-potter
commented on 2023-11-05
potter-potter
commented on 2023-11-06
Merge branch 'main' into ahmet/pinecone-connector
5ef236f2
setup py changes from main
ca949471
ingest test uses huggingface embedder
791bf03d
remove comment
f434c3d7
add secret to test_ingest_dest job
b7345c76
disabled auto-merge
2 years ago
Manually disabled by user
make batch size a parameter
766d4853
bugfix on chunking params and implementing related test
69e1949a
Merge branch 'main' into ahmet/pinecone-connector
3fd8c628
pass metadata fields individually
f2786ce1
Merge branch 'ahmet/pinecone-connector' of https://github.com/Unstruc…
d1b1cd28
Merge branch 'main' into ahmet/pinecone-connector
c945851d
implement check_connection
4b49fbd2
ahmetmeleq
requested a review
from
ryannikolaidis
2 years ago
ahmetmeleq
requested a review
from
potter-potter
2 years ago
potter-potter
commented on 2023-11-09
ryannikolaidis
commented on 2023-11-09
ryannikolaidis
commented on 2023-11-09
ryannikolaidis
commented on 2023-11-09
ryannikolaidis
commented on 2023-11-09
expose writer num_processes, apply parallelization in ingest test
1ff1fd66
fix session handles
007ad367
Merge branch 'main' into ahmet/pinecone-connector
b4b858ab
logging updates
3d81cfdb
changelog and version
fac751e6
random index names to avoid test run collisions
35be64a3
re-add --chunk-new-after-n-chars
0440eb25
add support for new_after_n_chars
1e8f34e9
check existence of num_processes (dest) when logging
e700a75e
update docs
80fed3bd
update example and docs
00b123e1
ahmetmeleq
requested a review
from
ryannikolaidis
2 years ago
ahmetmeleq
requested a review
from
potter-potter
2 years ago
ahmetmeleq
requested a review
from
rbiseck3
2 years ago
Merge branch 'main' into ahmet/pinecone-connector
1ead5e02
changelog
bac73f02
fix typo in example
1e6ff4c3
index creation retry logic for when another index is being deleted in…
c44ab12a
index creation retry logic for when another index is being deleted in…
94e66b3e
Merge branch 'ahmet/pinecone-connector' of https://github.com/Unstruc…
be87dd42
Merge branch 'ahmet/pinecone-connector' of https://github.com/Unstruc…
567ed4ea
Merge branch 'ahmet/pinecone-connector' of https://github.com/Unstruc…
074c1cac
update project variables, update sleep amounts
9adefb7e
update docs
65fce1c9
update docs
3a16a087
Merge branch 'main' into ahmet/pinecone-connector
362eb811
rbiseck3
commented on 2023-11-16
rbiseck3
commented on 2023-11-16
rbiseck3
commented on 2023-11-16
rbiseck3
commented on 2023-11-16
rbiseck3
commented on 2023-11-16
rbiseck3
commented on 2023-11-16
rbiseck3
commented on 2023-11-16
rbiseck3
commented on 2023-11-16
rbiseck3
commented on 2023-11-16
rbiseck3
commented on 2023-11-16
rbiseck3
commented on 2023-11-16
update docs
c3266f0b
Merge branch 'ahmet/pinecone-connector' of https://github.com/Unstruc…
8a6a0cbc
remove download_dir, remove index creation loop
387a7ad9
update example
f884123c
pythonic approach in docs
cbd734f4
update log
fa083ff7
move upsert method
29758f88
Merge branch 'main' into ahmet/pinecone-connector
14d4e511
ahmetmeleq
requested a review
from
rbiseck3
2 years ago
shellcheck
5723467c
Merge branch 'main' into ahmet/pinecone-connector
992b60d2
rbiseck3
commented on 2023-11-20
Update docs/source/ingest/destination_connectors/pinecone.rst
49391134
rbiseck3
commented on 2023-11-20
rbiseck3
commented on 2023-11-20
rbiseck3
commented on 2023-11-20
Merge branch 'main' into ahmet/pinecone-connector
9812d932
version
b6e97733
s3 docs pythonic approach and local connector
4f65b496
add comment on why we use random rather than uuidgen
738d75c8
check if test variables are defined before setting
fe818e47
shellcheck double quotes
937bdfa9
update parent classes for cliconfig
a2b2fc37
different number of processes for processor and writer in test
940f72d5
Merge branch 'main' into ahmet/pinecone-connector
e51b88f2
ahmetmeleq
requested a review
from
rbiseck3
2 years ago
add comment, add field selection from element, add list items separat…
35701dcd
potter-potter
commented on 2023-11-25
rbiseck3
commented on 2023-11-27
rbiseck3
commented on 2023-11-27
rbiseck3
commented on 2023-11-27
rbiseck3
commented on 2023-11-27
rbiseck3
commented on 2023-11-27
walrus syntax := instead of if [-z $...] for default parameters
12783796
better type checking for session handles
0ec7cae1
potter-potter
commented on 2023-11-28
implement check_connection
7b9e02b4
potter-potter
commented on 2023-11-28
move log for number of (upload) processes from pipeline to connector
1cf12903
update embedding docs to have embedding prepend for cli args
ca0785e9
add potter's flatten lists to flatten dicts
83518b0f
make all element fields indexable, add element_serialized
e1a6365f
ahmetmeleq
requested a review
from
rbiseck3
2 years ago
ahmetmeleq
requested a review
from
potter-potter
2 years ago
Merge branch 'main' into ahmet/pinecone-connector
f8688e5a
rbiseck3
commented on 2023-11-29
unique ids for pinecone entries rather than using element ids
e353c6b2
Merge branch 'ahmet/pinecone-connector' of https://github.com/Unstruc…
7e0c7e7a
rbiseck3
commented on 2023-11-29
an additional error wrapper for check connection
b54e5ce3
ahmetmeleq
requested a review
from
rbiseck3
2 years ago
rbiseck3
approved these changes on 2023-11-29
Merge branch 'main' into ahmet/pinecone-connector
07dfbd85
changelog and version
cebfcb67
ahmetmeleq
enabled auto-merge
2 years ago
Merge branch 'main' into ahmet/pinecone-connector
5cfef3c1
ahmetmeleq
enabled auto-merge
2 years ago
ahmetmeleq
merged
ed08773d
into main
2 years ago
ahmetmeleq
deleted the ahmet/pinecone-connector branch
2 years ago
Login to write a write a comment.
Login via GitHub
Reviewers
potter-potter
ryannikolaidis
rbiseck3
Assignees
ahmetmeleq
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub