Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[META] Binary type support with Faiss engine #1764

Closed
12 of 19 tasks
heemin32 opened this issue Jun 14, 2024 · 0 comments
Closed
12 of 19 tasks

[META] Binary type support with Faiss engine #1764

heemin32 opened this issue Jun 14, 2024 · 0 comments
Assignees
Labels
Features Introduces a new unit of functionality that satisfies a requirement v2.16.0

Comments

@heemin32
Copy link
Collaborator

heemin32 commented Jun 14, 2024

There has been an increasing interest on binary format support from customers recently with many evidence showing good recall with binary value generated from LLM. As KNN with large data set is expensive due to cost of memory, customer want to use binary format instead of float format to reduce the required memory size by 1/32.

Tasks

High Level Tasks:

  • Create RFC for the Feature Request. [RFC] Binary vector support #1767
  • HNSW support
    • JNI interface
    • POC heemin32@efa0199
    • Faiss patch to support SearchParameter for binary index
    • Index mapping, Ingestion, Query
    • Benchmark
  • Script scoring support on binary vector with hamming distance
    • Block non hamming distance scoring on binary vector
    • Implement script scoring with hamming distance on binary vector
  • IVF support
    • JNI interface
    • Faiss patch
    • Training, Ingestion, Query
    • Benchmark
  • Radial search
    • Radial search with HNSW
    • Radial search with IVF
    • Benchmark

Community Requests

  1. [FEATURE] Hamming distance / binary vector support #81
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Features Introduces a new unit of functionality that satisfies a requirement v2.16.0
Projects
Status: Done
Development

No branches or pull requests

3 participants