-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Vector Search #2526
feat: Vector Search #2526
Conversation
cd3d10f
to
694e2ab
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
several todos
- Cohere embed-multilingual-light-v3.0: 384 | ||
""" | ||
|
||
embed: Embeddings |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should probably accept a raw function.
We also don't (rn) support multimodal well?
value: The stored value. | ||
created_at: When the item was first created. | ||
updated_at: When the item was last updated. | ||
response_metadata: Optional metadata about the response/result. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Rn contains the similarity score.
694e2ab
to
585a194
Compare
6cc4662
to
9c25bd1
Compare
Transfered to #2535 |
Right now you enable it by:
Each document has 1 or more vectors associated with it for each json path in the embedding config.
Would welcome critique and requests!
Especially around handling migrations, whether we want to support multimedia (multimodal embeddings is old school already so kinda bad langchain embeddings classes don't support well), querying (do we need mmr or other strategies...?) and anything else that feels off.
Also happy to shelve the use of json path for saving embeddings if we think that's too complicated.
Also the default
__root__
embedding of the whole JSON object retains the keys. I'm guessing there's a bit of an impact here but probably better to have it remain self descriptive than to drop the keys before embedding the object.