You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Implement workflow set up interface to facilitate the creation of an ingest pipeline given the pipeline name, list of processors and their associated inputs.
This first implementation for the semantic search use case should construct an ingestion pipeline using the text_embedding pre-processor which handles transforming IngestDocuments into vector embeddings prior to being indexed into a KNN index and stores the pipeline ID within the GlobalContext index. This pipeline ID will then be used to set as the default_pipeline of the KNN index.
The implementation should parse the use case template to retrieve the following inputs for the text_embedding pre-processor:
input_field_name
output_field_name
The implementation should leverage the provided client to read the model_id from the global context.
Using this information, create the PutPipelineRequest payload:
@joshpalis this issue shouldn't just talk about neural pipeline but any ingest pipeline which we can create. Our building blocks shouldn't be tightly coupled. There should be a type associated while creating the pipeline. If it's neural and the processor is text_embedding the above pipeline would be created then.
Sure ill make this generic for any ingest pipeline. I do not agree with a type associated with the pipeline, pipelines should just have a pipelineID and the list of processors that were chosen, along with the necessary inputs. Having a type associated with these pipelines would require unnecessary mapping.
joshpalis
changed the title
[FEATURE] Create an ingest pipeline for neural search
[FEATURE] Create an ingest pipeline workflow step
Sep 13, 2023
Is your feature request related to a problem?
Implement workflow set up interface to facilitate the creation of an ingest pipeline given the pipeline name, list of processors and their associated inputs.
This first implementation for the semantic search use case should construct an ingestion pipeline using the
text_embedding
pre-processor which handles transformingIngestDocument
s into vector embeddings prior to being indexed into a KNN index and stores the pipeline ID within theGlobalContext
index. This pipeline ID will then be used to set as thedefault_pipeline
of the KNN index.The implementation should parse the use case template to retrieve the following inputs for the
text_embedding
pre-processor:The implementation should leverage the provided client to read the
model_id
from the global context.Using this information, create the
PutPipelineRequest
payload:Do you have any additional context?
Creation of the ingest pipeline should be done via a
PutPipelineRequest
using the transport client. Reference : https://github.com/opensearch-project/geospatial/blob/main/src/main/java/org/opensearch/geospatial/action/upload/geojson/PipelineManager.java#L92The text was updated successfully, but these errors were encountered: