Add support for Lucene inbuilt Scalar Quantizer #1848

naveentatikonda · 2024-07-17T23:56:39Z

Description

This PR adds support for inbuilt Scalar Quantizer with Lucene engine which takes fp32 vectors as input and dynamically quantizes them based on the number of bits(default 7) and other parameters like confidence_interval provided in the index mapping. For now, we are not supporting 8 bits due to a recall issue and only supporting 7 bits.

UX changes

To keep it consistent with Faiss, we are adding sq encoder under parameters field. The sq encoder in Lucene supports 3 optional parameters:

bits - Which decides the number of buckets for the input vector data to be quantized into; Defaults to 7.
confidence_interval - Which is used to determine the minQuantile and maxQuantile params used in the Quantization process. Acceptable values are 0, >=0.9 && <=1.0. If the confidence_interval is not provided we will set default value as null which is computed later in Lucene from the dimension of the vector as 1 - 1/(1 + dimension).

Sample Index mapping using sq encoder is shown below:

    "mappings": {
        "properties": {
            "my_vector1": {
                "type": "knn_vector",
                "dimension": 2,
                "method": {
                    "name": "hnsw",
                    "space_type": "l2",
                    "engine": "lucene",
                    "parameters": {
                        "encoder": {
                            "name": "sq",
                            "parameters": {
                                "bits": 7,
                                "confidence_interval": 1.0
                            }
                        }
                        "ef_construction": 128,
                        "m": 24,
                    }
                }
            }
        }
    }

Related Issues

#1277

Check List

New functionality includes testing.
New functionality has been documented.
API changes companion pull request created.
Commits are signed per the DCO using --signoff.
Public documentation issue/PR created.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

naveentatikonda · 2024-07-18T00:04:45Z

Note - I'm still running some benchmarking tests for 4 bits, will complete by weekend and include 4 bits if there are no performance issues. That will be a very small change.

src/main/java/org/opensearch/knn/index/Parameter.java

src/main/java/org/opensearch/knn/index/codec/Function5Arity.java

src/main/java/org/opensearch/knn/index/codec/BasePerFieldKnnVectorsFormat.java

src/main/java/org/opensearch/knn/index/codec/KNN920Codec/KNN920PerFieldKnnVectorsFormat.java

navneet1v · 2024-07-18T06:16:54Z

bits - Which decides the number of buckets for the input vector data to be quantized into; Defaults to 7.

confidence_interval - Which is used to determine the minQuantile and maxQuantile params used in the Quantization process. Acceptable values are 0, >=0.9 && <=1.0. If the confidence_interval is not provided we will set default value as null which is computed later in Lucene from the dimension of the vector as 1 - 1/(1 + dimension).

compress - Default value is false, which is useful only with 4 bits to compress and provide more memory reduction.

Seeing these different parameters and how they work. Lets ensure documentation is very clear otherwise there will be a lot of confusions.

jmazanec15

Whats the validation on the bits parameter?

src/main/java/org/opensearch/knn/common/KNNConstants.java

src/main/java/org/opensearch/knn/index/codec/BasePerFieldKnnVectorsFormat.java

jmazanec15 · 2024-07-18T17:38:36Z

Is 7-bit quantization implemented significantly different than 8-bit?

src/main/java/org/opensearch/knn/index/codec/KNNVectorsFormatParams.java

naveentatikonda · 2024-07-18T22:19:39Z

Whats the validation on the bits parameter?

In Lucene.java as part of parameter validation, created a list of integers which has the bits that we are supporting and validating against it.

naveentatikonda · 2024-07-18T22:22:30Z

Is 7-bit quantization implemented significantly different than 8-bit?

no, the difference is the number of buckets we are quantizing the given vectors into, which is 2^(bits) - 1 buckets

src/main/java/org/opensearch/knn/index/codec/KNNVectorsFormatParams.java

src/main/java/org/opensearch/knn/index/codec/KNNScalarQuantizedVectorsFormatParams.java

src/main/java/org/opensearch/knn/index/codec/KNNVectorsFormatParams.java

src/main/java/org/opensearch/knn/index/codec/KNNScalarQuantizedVectorsFormatParams.java

src/main/java/org/opensearch/knn/index/codec/KNN920Codec/KNN920PerFieldKnnVectorsFormat.java

src/main/java/org/opensearch/knn/index/codec/KNN990Codec/KNN990PerFieldKnnVectorsFormat.java

src/main/java/org/opensearch/knn/common/KNNConstants.java

Signed-off-by: Naveen Tatikonda <[email protected]>

heemin32 · 2024-07-23T00:27:48Z

src/main/java/org/opensearch/knn/index/Parameter.java

+                String validationErrorMsg = String.format(Locale.ROOT, "Null value provided for Double " + "parameter \"%s\".", getName());
+                return getValidationException(validationErrorMsg);
+            }
+            if (value.equals(0)) value = 0.0;


Nit. This style is error prune.

Suggested change

if (value.equals(0)) value = 0.0;

if (value.equals(0)) {

value = 0.0;

}

heemin32 · 2024-07-23T00:30:04Z

src/main/java/org/opensearch/knn/index/Parameter.java

+                String validationErrorMsg = String.format(Locale.ROOT, "Null value provided for Double " + "parameter \"%s\".", getName());
+                return getValidationException(validationErrorMsg);
+            }
+


No conversion here?

if (value.equals(0)) { value = 0.0; }

* Add support for Lucene Inbuilt Scalar Quantizer Signed-off-by: Naveen Tatikonda <[email protected]> * Refactor code Signed-off-by: Naveen Tatikonda <[email protected]> * Add Tests Signed-off-by: Naveen Tatikonda <[email protected]> * Address Review Comments Signed-off-by: Naveen Tatikonda <[email protected]> * Refactoring changes Signed-off-by: Naveen Tatikonda <[email protected]> * Remove compress as an input parameter and set default as true Signed-off-by: Naveen Tatikonda <[email protected]> * Add Constructor overloading and other refactoring changes Signed-off-by: Naveen Tatikonda <[email protected]> * Add more unit tests Signed-off-by: Naveen Tatikonda <[email protected]> * Set default encoder as encoder flat Signed-off-by: Naveen Tatikonda <[email protected]> --------- Signed-off-by: Naveen Tatikonda <[email protected]> (cherry picked from commit 71fff47)

opensearch-trigger-bot · 2024-07-23T00:37:50Z

The backport to 2.x failed:

The process '/usr/bin/git' failed with exit code 1

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add .worktrees/backport-2.x 2.x
# Navigate to the new working tree
cd .worktrees/backport-2.x
# Create a new branch
git switch --create backport/backport-1848-to-2.x
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 71fff475cd76f3fe7fd195a15f54138c9127ad6a
# Push it to GitHub
git push --set-upstream origin backport/backport-1848-to-2.x
# Go back to the original working tree
cd ../..
# Delete the working tree
git worktree remove .worktrees/backport-2.x

Then, create a pull request where the base branch is 2.x and the compare/head branch is backport/backport-1848-to-2.x.

opensearch-trigger-bot · 2024-07-23T00:37:53Z

The backport to 2.16 failed:

The process '/usr/bin/git' failed with exit code 1

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add .worktrees/backport-2.16 2.16
# Navigate to the new working tree
cd .worktrees/backport-2.16
# Create a new branch
git switch --create backport/backport-1848-to-2.16
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 71fff475cd76f3fe7fd195a15f54138c9127ad6a
# Push it to GitHub
git push --set-upstream origin backport/backport-1848-to-2.16
# Go back to the original working tree
cd ../..
# Delete the working tree
git worktree remove .worktrees/backport-2.16

Then, create a pull request where the base branch is 2.16 and the compare/head branch is backport/backport-1848-to-2.16.

heemin32 · 2024-07-23T00:44:42Z

For now, we are not supporting 8 bits due to a recall issue and only supporting 7 bits.

Does that mean, user can only set 7 bits but nothing else?

naveentatikonda · 2024-07-23T00:47:40Z

For now, we are not supporting 8 bits due to a recall issue and only supporting 7 bits.

Does that mean, user can only set 7 bits but nothing else?

yes, for now only 7 bits. We will support 8 bits and 4 bits after the underlying quantization issues are resolved in lucene.

naveentatikonda added Features Introduces a new unit of functionality that satisfies a requirement backport 2.x v2.16.0 labels Jul 17, 2024

naveentatikonda self-assigned this Jul 17, 2024

naveentatikonda requested review from heemin32, navneet1v, VijayanB, vamshin, jmazanec15, junqiu-lei, martin-gaievski, ryanbogan and luyuncheng as code owners July 17, 2024 23:56

naveentatikonda force-pushed the add_support_for_lucene_inbuilt_sq branch 2 times, most recently from 7b192cf to dcbe6ad Compare July 17, 2024 23:59

Vikasht34 reviewed Jul 18, 2024

View reviewed changes

navneet1v reviewed Jul 18, 2024

View reviewed changes

jmazanec15 reviewed Jul 18, 2024

View reviewed changes

src/main/java/org/opensearch/knn/common/KNNConstants.java Outdated Show resolved Hide resolved

src/main/java/org/opensearch/knn/index/codec/BasePerFieldKnnVectorsFormat.java Outdated Show resolved Hide resolved

naveentatikonda force-pushed the add_support_for_lucene_inbuilt_sq branch from a9fd88b to d3e98cc Compare July 18, 2024 22:09

navneet1v reviewed Jul 18, 2024

View reviewed changes

src/main/java/org/opensearch/knn/index/codec/KNNVectorsFormatParams.java Outdated Show resolved Hide resolved

naveentatikonda force-pushed the add_support_for_lucene_inbuilt_sq branch from d3e98cc to 2beaba6 Compare July 18, 2024 22:31

navneet1v reviewed Jul 18, 2024

View reviewed changes

Vikasht34 reviewed Jul 21, 2024

View reviewed changes

src/main/java/org/opensearch/knn/index/codec/KNN920Codec/KNN920PerFieldKnnVectorsFormat.java Outdated Show resolved Hide resolved

navneet1v reviewed Jul 21, 2024

View reviewed changes

src/main/java/org/opensearch/knn/index/codec/KNN990Codec/KNN990PerFieldKnnVectorsFormat.java Outdated Show resolved Hide resolved

navneet1v reviewed Jul 21, 2024

View reviewed changes

src/main/java/org/opensearch/knn/common/KNNConstants.java Outdated Show resolved Hide resolved

naveentatikonda added 8 commits July 22, 2024 18:12

Add support for Lucene Inbuilt Scalar Quantizer

fdeba47

Signed-off-by: Naveen Tatikonda <[email protected]>

Refactor code

73e813b

Signed-off-by: Naveen Tatikonda <[email protected]>

Add Tests

5471768

Signed-off-by: Naveen Tatikonda <[email protected]>

Address Review Comments

d819860

Signed-off-by: Naveen Tatikonda <[email protected]>

Refactoring changes

4affc94

Signed-off-by: Naveen Tatikonda <[email protected]>

Remove compress as an input parameter and set default as true

4016e70

Signed-off-by: Naveen Tatikonda <[email protected]>

Add Constructor overloading and other refactoring changes

c66a592

Signed-off-by: Naveen Tatikonda <[email protected]>

Add more unit tests

5ff5db4

Signed-off-by: Naveen Tatikonda <[email protected]>

naveentatikonda force-pushed the add_support_for_lucene_inbuilt_sq branch from afea7cc to 4f7ae01 Compare July 22, 2024 23:17

naveentatikonda added the skip-changelog label Jul 22, 2024

Set default encoder as encoder flat

8631f19

Signed-off-by: Naveen Tatikonda <[email protected]>

naveentatikonda force-pushed the add_support_for_lucene_inbuilt_sq branch from 4f7ae01 to 8631f19 Compare July 22, 2024 23:19

jmazanec15 approved these changes Jul 23, 2024

View reviewed changes

navneet1v approved these changes Jul 23, 2024

View reviewed changes

naveentatikonda merged commit 71fff47 into opensearch-project:main Jul 23, 2024
52 checks passed

heemin32 approved these changes Jul 23, 2024

View reviewed changes

naveentatikonda added backport 2.x backport 2.16 and removed backport 2.x backport 2.16 labels Jul 23, 2024

opensearch-trigger-bot bot mentioned this pull request Jul 23, 2024

[Backport 2.x] Add support for Lucene inbuilt Scalar Quantizer #1871

Merged

opensearch-trigger-bot bot mentioned this pull request Jul 23, 2024

[Backport 2.16] Add support for Lucene inbuilt Scalar Quantizer #1872

Merged

naveentatikonda pushed a commit that referenced this pull request Jul 23, 2024

Add support for Lucene inbuilt Scalar Quantizer (#1848) (#1871)

f84caf8

naveentatikonda pushed a commit that referenced this pull request Jul 23, 2024

Add support for Lucene inbuilt Scalar Quantizer (#1848) (#1872)

bd5b633

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Lucene inbuilt Scalar Quantizer #1848

Add support for Lucene inbuilt Scalar Quantizer #1848

naveentatikonda commented Jul 17, 2024 •

edited

Loading

naveentatikonda commented Jul 18, 2024

navneet1v commented Jul 18, 2024

jmazanec15 left a comment

jmazanec15 commented Jul 18, 2024

naveentatikonda commented Jul 18, 2024

naveentatikonda commented Jul 18, 2024

heemin32 Jul 23, 2024

heemin32 Jul 23, 2024

opensearch-trigger-bot bot commented Jul 23, 2024

opensearch-trigger-bot bot commented Jul 23, 2024

heemin32 commented Jul 23, 2024 •

edited

Loading

naveentatikonda commented Jul 23, 2024

Add support for Lucene inbuilt Scalar Quantizer #1848

Add support for Lucene inbuilt Scalar Quantizer #1848

Conversation

naveentatikonda commented Jul 17, 2024 • edited Loading

Description

UX changes

Related Issues

Check List

naveentatikonda commented Jul 18, 2024

navneet1v commented Jul 18, 2024

jmazanec15 left a comment

Choose a reason for hiding this comment

jmazanec15 commented Jul 18, 2024

naveentatikonda commented Jul 18, 2024

naveentatikonda commented Jul 18, 2024

heemin32 Jul 23, 2024

Choose a reason for hiding this comment

heemin32 Jul 23, 2024

Choose a reason for hiding this comment

opensearch-trigger-bot bot commented Jul 23, 2024

opensearch-trigger-bot bot commented Jul 23, 2024

heemin32 commented Jul 23, 2024 • edited Loading

naveentatikonda commented Jul 23, 2024

naveentatikonda commented Jul 17, 2024 •

edited

Loading

heemin32 commented Jul 23, 2024 •

edited

Loading