Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Support byte sized vector in Lucene engine #812

Closed
vamshin opened this issue Mar 20, 2023 · 1 comment
Closed

[FEATURE] Support byte sized vector in Lucene engine #812

vamshin opened this issue Mar 20, 2023 · 1 comment

Comments

@vamshin
Copy link
Member

vamshin commented Mar 20, 2023

Is your feature request related to a problem?
Currently vectors are represented as floats(4 Bytes) for each dimension. This could lead to large storage and memory requirements to construct/load/save/search graphs and at scale becomes costly and also hard to manage the cluster requirements. In some cases, customers prefer to reduce the storage/memory foot prints at the cost of loosing tolerable recall.

What solution would you like?
We could provide Byte sized vectors that can use one Byte per dimension instead of current 4 Bytes using the ByteVector feature of Lucene. By providing quantization techniques we could let the floats convert to Int[-128 to 127] at loss of some precision and hence provide reduction to storage and memory requirements

@vamshin vamshin changed the title [FEATURE] Support byte sized vector [FEATURE] Support byte sized vector in Lucene engine Mar 20, 2023
@vamshin vamshin added backlog and removed untriaged labels Mar 20, 2023
@vamshin vamshin self-assigned this Mar 20, 2023
@vamshin vamshin added the v2.9.0 label Mar 20, 2023
@navneet1v
Copy link
Collaborator

If you are interested in this feature, please do a +1. If possible please leave a comment about your usecase.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants