You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
When ingest is being run with chunking parameters, the parameters except --chunk-elements are not successfully passed to the Chunking node, resulting in Chunking node acting in unexpected ways.
Same number of vectors will be upserted each time the test is run, no matter the value of --chunk-max-characters parameter.
When the CliChunkingConfig is logged, values will be CliChunkingConfig(chunk_elements=True, multipage_sections=True, combine_text_under_n_chars=500, max_characters=1500) no matter what the provided chunking parameters are.
Environment Info make install + pip install pinecone-client
The text was updated successfully, but these errors were encountered:
Closes#1414Closes#2039
This PR:
- Uses Pinecone python cli to implement a destination connector for
Pinecone and provides the ingest readme requirements
[(here)](https://github.com/Unstructured-IO/unstructured/tree/main/unstructured/ingest#the-checklist)
for the connector
- Updates documentation for the s3 destination connector
- Alphabetically sorts setup.py contents
- Updates logs for the chunking node in ingest pipeline
- Adds a baseline session handle implementation for destination
connectors, to be able to parallelize their operations
- For the
[bug](#1892)
related to persisting element data to ingest embedding nodes; this PR
tests the
[solution](#1893)
with its ingest test
- Solves a bug on ingest chunking params with [bugfix on chunking params
and implementing related
test](69e1949)
---------
Co-authored-by: Roman Isecke <[email protected]>
Describe the bug
When ingest is being run with chunking parameters, the parameters except
--chunk-elements
are not successfully passed to the Chunking node, resulting in Chunking node acting in unexpected ways.To Reproduce
Clone https://github.com/Unstructured-IO/unstructured/tree/ahmet/pinecone-connector, run
/test_unstructured_ingest/dest/pinecone.sh
with these modifications:Expected behavior
--chunk-max-characters
parameter.CliChunkingConfig(chunk_elements=True, multipage_sections=True, combine_text_under_n_chars=500, max_characters=1500)
no matter what the provided chunking parameters are.Environment Info
make install
+pip install pinecone-client
The text was updated successfully, but these errors were encountered: