Add map result support in neural search for non text embedding models #258

zane-neo · 2023-08-22T04:10:14Z

Description

Neural search only support text embedding model result which are usually in List<List> structure, for other models that returns non vector list like OpenAI, SPLADE etc, there's no available functions to get extract the json object result. This PR is to add several new functions to support these cases, the response are in Map<String, ?> structure and upper layer can extract more information from the map based on their purpose.

Issues Resolved

#260

Check List

New functionality includes testing.
- All tests pass
New functionality has been documented.
- New functionality has javadoc added
Commits are signed as per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

navneet1v · 2023-08-22T08:08:02Z

@zane-neo lets not skip the change log.

Also, motivation is not clear for this change. Can you add a github issue and talk about why we need this change? Is there any new feature that we are building which is going to use the new responses?

navneet1v · 2023-08-22T08:09:20Z

src/main/java/org/opensearch/neuralsearch/ml/MLCommonsClientAccessor.java

+        }
+        List<ModelTensor> tensorList = tensorOutputList.get(0).getMlModelTensors();
+        if (CollectionUtils.isEmpty(tensorList)) {
+            log.error("No tensor found!");


lets make this error message more understandable and with actions what happened wrong which resulted in this error

navneet1v · 2023-08-22T08:09:49Z

src/main/java/org/opensearch/neuralsearch/ml/MLCommonsClientAccessor.java

+        final ModelTensorOutput modelTensorOutput = (ModelTensorOutput) mlOutput;
+        final List<ModelTensors> tensorOutputList = modelTensorOutput.getMlModelOutputs();
+        if (CollectionUtils.isEmpty(tensorOutputList)) {
+            log.error("No tensor output found!");


lets make this error message more understandable and with actions what happened wrong which resulted in this error

martin-gaievski · 2023-08-22T15:42:36Z

src/main/java/org/opensearch/neuralsearch/ml/MLCommonsClientAccessor.java

@@ -144,4 +174,19 @@ private List<List<Float>> buildVectorFromResponse(MLOutput mlOutput) {
        return vector;
    }

+    private Map<String, ?> buildMapResultFromResponse(MLOutput mlOutput) {
+        final ModelTensorOutput modelTensorOutput = (ModelTensorOutput) mlOutput;


should we check the type of mlOutput before casting?

zane-neo · 2023-08-23T01:52:31Z

@zane-neo lets not skip the change log.

Also, motivation is not clear for this change. Can you add a github issue and talk about why we need this change? Is there any new feature that we are building which is going to use the new responses?

Sure, created this issue: #260. Yes, the SPLADE feature needs to use the new response.

navneet1v · 2023-08-23T06:04:50Z

@zane-neo as this is a new feature, and will probably go through multiple iterations before it is going to be released. Hence, for all new features let's not merge the changes directly in main. Lets merge keep reviewing the changes and merge them in a feature branch. Let me create a feature branch for this new feature.

So the process will go like this:

Neural Search plugin maintainer will cut a feature branch from main branch.
Contributors working on Sparse Vectors will raise the PR against the feature branch.
Once the PR is reviewed it will go in the feature branch.
Once all the changes are done and performance testing is done, commits of feature branch will be merged in the main branch.

Please let me know if there is any further questions.

cc: @vamshin

navneet1v · 2023-08-23T06:12:31Z

@zane-neo Feature branch for sparse vector support: https://github.com/opensearch-project/neural-search/tree/feature/sparseVectorSupport

Please use this branch for this and all future PRs related to SparseVector Support.

Signed-off-by: zane-neo <[email protected]>

zane-neo · 2023-09-01T08:23:16Z

Closing this PR since raised another PR to feature branch: #270

zane-neo requested review from heemin32, navneet1v, VijayanB, vamshin, jmazanec15, naveentatikonda, junqiu-lei, martin-gaievski, sean-zheng-amazon, model-collapse, wujunshen, ylwu-amzn and jngz-es as code owners August 22, 2023 04:10

zane-neo added the skip-changelog label Aug 22, 2023

navneet1v reviewed Aug 22, 2023

View reviewed changes

martin-gaievski reviewed Aug 22, 2023

View reviewed changes

zane-neo added 2 commits September 1, 2023 16:17

Add map result support in neural search for non text embedding models

feeb927

Signed-off-by: zane-neo <[email protected]>

Fix compilation failure issue

8dddbeb

Signed-off-by: zane-neo <[email protected]>

zane-neo force-pushed the map_result_support branch from 5892f92 to 8dddbeb Compare September 1, 2023 08:20

zane-neo closed this Sep 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add map result support in neural search for non text embedding models #258

Add map result support in neural search for non text embedding models #258

zane-neo commented Aug 22, 2023 •

edited

Loading

navneet1v commented Aug 22, 2023

navneet1v Aug 22, 2023

navneet1v Aug 22, 2023

martin-gaievski Aug 22, 2023 •

edited

Loading

zane-neo commented Aug 23, 2023

navneet1v commented Aug 23, 2023

navneet1v commented Aug 23, 2023

zane-neo commented Sep 1, 2023

Add map result support in neural search for non text embedding models #258

Add map result support in neural search for non text embedding models #258

Conversation

zane-neo commented Aug 22, 2023 • edited Loading

Description

Issues Resolved

Check List

navneet1v commented Aug 22, 2023

navneet1v Aug 22, 2023

Choose a reason for hiding this comment

navneet1v Aug 22, 2023

Choose a reason for hiding this comment

martin-gaievski Aug 22, 2023 • edited Loading

Choose a reason for hiding this comment

zane-neo commented Aug 23, 2023

navneet1v commented Aug 23, 2023

navneet1v commented Aug 23, 2023

zane-neo commented Sep 1, 2023

zane-neo commented Aug 22, 2023 •

edited

Loading

martin-gaievski Aug 22, 2023 •

edited

Loading