Add batch inference API #7853

Zhangxunmt · 2024-07-29T19:25:43Z

Description

Add doc for Batch Inference as a new API under the Ml-Commons/Model-API.

Issues Resolved

Version

List the OpenSearch version to which this PR applies, e.g. 2.14, 2.12--2.14, or all.

Frontend features

If you're submitting documentation for an OpenSearch Dashboards feature, add a video that shows how a user will interact with the UI step by step. A voiceover is optional.

Checklist

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license and subject to the Developers Certificate of Origin.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

github-actions · 2024-07-29T19:25:53Z

Thank you for submitting your PR. The PR states are In progress (or Draft) -> Tech review -> Doc review -> Editorial review -> Merged.

Before you submit your PR for doc review, make sure the content is technically accurate. If you need help finding a tech reviewer, tag a maintainer.

When you're ready for doc review, tag the assignee of this PR. The doc reviewer may push edits to the PR directly or leave comments and editorial suggestions for you to address (let us know in a comment if you have a preference). The doc reviewer will arrange for an editorial review.

_ml-commons-plugin/api/model-apis/batch-predict.md

dhrubo-os · 2024-07-31T21:41:37Z

_ml-commons-plugin/api/model-apis/batch-predict.md

+For information about user access for this API, see [Model access control considerations]({{site.url}}{{site.baseurl}}/ml-commons-plugin/api/model-apis/index/#model-access-control-considerations).
+
+
+For information about connectors and remote models, see [Connecting to externally hosted models]({{site.url}}{{site.baseurl}}/ml-commons-plugin/remote-models/index/). For more details of the connector blurprints for batch predict, see [GitHub docs](https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/batch_inference_openAI_connector_blueprint.md)


I think best action item would be put this blueprint under a sub folder for batch_prediction and link to that sub folder. In this way, if we add blueprints for sagemaker and cohere later, CX will still find these in the sub folder.

Right now we are saying this will work for Sagemaker or Cohere but there's no example for this. And also why customer needs to go another link to get the same blue print what is here?

I agree with Dhrubo. This will avoid having to maintain a list of blueprints here in the documentation. @Zhangxunmt could you create a subfolder so we can link to that from the docs?

It's fine to have a subfolder for offline actions. But please note that this is the API page showing how this API works. So let's keep this page directly to the point. Using OpenAI as an example to show this API is enough. Other details will be documented elsewhere in blueprints or tutorials.

I agree that this API page should be simple. Ideally, I would even remove the prerequisite steps from this API. However, our users have a very disjointed experience when going back and forth from the doc website to the ML repo. I didn't realize blueprints contained the workflow and not just the blueprint itself. I think we should port all ML blueprints and tutorials to the doc repo and have them on the doc site. I can take this on once this version is released. For now, it's fine to leave this API page with the current information.

kolchfa-aws

Thank you, @Zhangxunmt! Please see my comments below.

kolchfa-aws · 2024-08-01T14:19:18Z

_ml-commons-plugin/api/model-apis/batch-predict.md

+---
+layout: default
+title:  Batch inference
+parent: Model APIs


We currently have the Predict API under the train-predict directory, not model-apis. Either we need to move this one to train-predict, or we can move the predict API into the model-apis section. What do you think?

I think it makes more sense to move Predict to the model-apis section. The training part doesn't matter much as most of the cases are remote models or pre-trained models which are directly predicable.

@Zhangxunmt Should we move train APIs to the model-apis section as well so the train-predict and model-apis sections are combined?

kolchfa-aws · 2024-08-01T14:22:28Z

_ml-commons-plugin/api/model-apis/batch-predict.md

+
+# Batch inference
+
+ML Commons can predict large datasets in an offline asynchronous mode with your remote model deployed in external model servers. To use the Batch_Predict API, the `model_id` for a remote model is required. This new API is released as an experimental feature in the OpenSearch version 2.16, and only SageMaker, Cohere, and OpenAI are verified as the external servers that support this features.


Suggested change

ML Commons can predict large datasets in an offline asynchronous mode with your remote model deployed in external model servers. To use the Batch_Predict API, the `model_id` for a remote model is required. This new API is released as an experimental feature in the OpenSearch version 2.16, and only SageMaker, Cohere, and OpenAI are verified as the external servers that support this features.

ML Commons can perform inference on large datasets in an offline asynchronous mode using a model deployed on external model servers. To use the Batch Predict API, you must provide the `model_id` for an externally hosted model. This new API is released as experimental in OpenSearch version 2.16, and only Amazon SageMaker, Cohere, and OpenAI are verified as the external servers that support this feature.

kolchfa-aws · 2024-08-01T14:37:16Z

_ml-commons-plugin/api/model-apis/batch-predict.md

+grand_parent: ML Commons APIs
+nav_order: 20
+---
+


Please add an experimental header https://github.com/opensearch-project/documentation-website/blob/main/templates/EXPERIMENTAL_TEMPLATE.md and provide either a link to an issue where users can track the progress of the feature or a link to the OpenSearch forum.

kolchfa-aws · 2024-08-01T14:42:54Z

_ml-commons-plugin/api/model-apis/batch-predict.md

+For information about user access for this API, see [Model access control considerations]({{site.url}}{{site.baseurl}}/ml-commons-plugin/api/model-apis/index/#model-access-control-considerations).
+
+
+For information about connectors and remote models, see [Connecting to externally hosted models]({{site.url}}{{site.baseurl}}/ml-commons-plugin/remote-models/index/). For more details of the connector blurprints for batch predict, see [GitHub docs](https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/batch_inference_openAI_connector_blueprint.md)


Suggested change

For information about connectors and remote models, see [Connecting to externally hosted models]({{site.url}}{{site.baseurl}}/ml-commons-plugin/remote-models/index/). For more details of the connector blurprints for batch predict, see [GitHub docs](https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/batch_inference_openAI_connector_blueprint.md)

For information about externally hosted models, see [Connecting to externally hosted models]({{site.url}}{{site.baseurl}}/ml-commons-plugin/remote-models/index/). For the batch predict operation connector blueprints, see:

- [Amazon SageMaker batch predict connector blueprint](https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/batch_inference_sagemaker_connector_blueprint.md).

- [OpenAI batch predict connector blueprint](https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/batch_inference_openAI_connector_blueprint.md).

Is there a Cohere blueprint for batch predict?

kolchfa-aws · 2024-08-01T14:44:35Z

_ml-commons-plugin/api/model-apis/batch-predict.md

+For information about user access for this API, see [Model access control considerations]({{site.url}}{{site.baseurl}}/ml-commons-plugin/api/model-apis/index/#model-access-control-considerations).
+
+
+For information about connectors and remote models, see [Connecting to externally hosted models]({{site.url}}{{site.baseurl}}/ml-commons-plugin/remote-models/index/). For more details of the connector blurprints for batch predict, see [GitHub docs](https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/batch_inference_openAI_connector_blueprint.md)


I agree with Dhrubo. This will avoid having to maintain a list of blueprints here in the documentation. @Zhangxunmt could you create a subfolder so we can link to that from the docs?

kolchfa-aws · 2024-08-01T14:54:20Z

_ml-commons-plugin/api/model-apis/batch-predict.md

+  "model_id": "lyjxwZABNrAVdFa9zrcZ"
+}
+```
+


Suggested change

To check the status of the operation, provide the task ID to the [Tasks API]({{site.url}}{{site.baseurl}}/ml-commons-plugin/api/tasks-apis/get-task/). Once the registration is complete, the task `state` changes to `COMPLETED`.

kolchfa-aws · 2024-08-01T14:58:07Z

_ml-commons-plugin/api/model-apis/batch-predict.md

+```
+
+#### Example request
+


Suggested change

Once you have completed the prerequisite steps, you can call the Batch Predict API. The parameters in the batch predict request override those defined in the connector:

kolchfa-aws · 2024-08-01T14:59:16Z

_ml-commons-plugin/api/model-apis/batch-predict.md

+POST /_plugins/_ml/models/lyjxwZABNrAVdFa9zrcZ/_batch_predict
+{
+  "parameters": {
+    "model": "text-embedding-ada-002"


This parameter has the same value as the one in the connector. Can we show the users how to change this or any other parameters to a different value?

kolchfa-aws · 2024-08-01T15:00:13Z

_ml-commons-plugin/api/model-apis/batch-predict.md

+}
+```
+{% include copy-curl.html %}
+The parameters in the batch_predict request will override those defined in the connector.


Suggested change

The parameters in the batch_predict request will override those defined in the connector.

kolchfa-aws · 2024-08-01T15:19:22Z

_ml-commons-plugin/api/model-apis/batch-predict.md

+{
+  "inference_results": [
+    {
+      "output": [


We normally need to provide the descriptions of all response fields in the API doc. Is this the format of all batch predict responses? And where is the actual predict result? Maybe this API page should just show the API itself, and we need to add a complete end-to-end example under the remote-models section?

The actual predict results are in the output_file_id in the response, as this is offline asyc prediction. I provided some descriptions of this results in the OpenAI blueprint which is linked in this API. I think this page should just show the API itself and we should keep it simple and straight. The end-to-end example/explanation should be done in another tutorial page somewhere else?

I think we should have a complete example in a file under the remote-models directory.

kolchfa-aws · 2024-08-01T15:24:04Z

_ml-commons-plugin/api/model-apis/batch-predict.md

+      "request_body": "{ \"input_file_id\": \"${parameters.input_file_id}\", \"endpoint\": \"${parameters.endpoint}\", \"completion_window\": \"24h\" }"
+    }
+  ]
+}


Suggested change

}

}

{% include copy-curl.html %}

kolchfa-aws · 2024-08-01T15:24:18Z

_ml-commons-plugin/api/model-apis/batch-predict.md

+    "function_name": "remote",
+    "description": "OpenAI text embedding model",
+    "connector_id": "XU5UiokBpXT9icfOM0vt"
+}


Suggested change

}

}

{% include copy-curl.html %}

Signed-off-by: Xun Zhang <[email protected]>

Zhangxunmt requested review from hdhalter, kolchfa-aws, Naarcha-AWS, vagimeli, AMoo-Miki, natebower, dlvenable and epugh as code owners July 29, 2024 19:25

github-actions bot assigned hdhalter Jul 29, 2024

Zhangxunmt mentioned this pull request Jul 29, 2024

[DOC] Integrate new Action Type of Batch Transform in Connectors and support CRUD transform jobs in Ml-Commons Tasks #7848

Closed

4 tasks

hdhalter added release-notes PR: Include this PR in the automated release notes v2.16.0 labels Jul 29, 2024

kolchfa-aws assigned kolchfa-aws and unassigned hdhalter Jul 29, 2024

Zhangxunmt force-pushed the main branch from f09483e to e1dfe90 Compare July 29, 2024 22:24

hdhalter added the 4 - Doc review PR: Doc review in progress label Jul 29, 2024

hdhalter changed the title ~~add batch inference API~~ Add batch inference API Jul 29, 2024

dhrubo-os reviewed Jul 30, 2024

View reviewed changes

_ml-commons-plugin/api/model-apis/batch-predict.md Outdated Show resolved Hide resolved

dhrubo-os reviewed Jul 30, 2024

View reviewed changes

_ml-commons-plugin/api/model-apis/batch-predict.md Outdated Show resolved Hide resolved

mingshl reviewed Jul 30, 2024

View reviewed changes

_ml-commons-plugin/api/model-apis/batch-predict.md Show resolved Hide resolved

_ml-commons-plugin/api/model-apis/batch-predict.md Show resolved Hide resolved

Zhangxunmt force-pushed the main branch from 416be3a to d7b33b4 Compare July 30, 2024 18:26

dhrubo-os reviewed Jul 30, 2024

View reviewed changes

_ml-commons-plugin/api/model-apis/batch-predict.md Outdated Show resolved Hide resolved

Zhangxunmt force-pushed the main branch from 00b05f4 to 4bdfb79 Compare July 31, 2024 21:17

dhrubo-os reviewed Jul 31, 2024

View reviewed changes

kolchfa-aws requested changes Aug 1, 2024

View reviewed changes

kolchfa-aws added the experimental label Aug 1, 2024

kolchfa-aws reviewed Aug 1, 2024

View reviewed changes

add batch inference API

3302007

Signed-off-by: Xun Zhang <[email protected]>

Zhangxunmt added 3 commits August 2, 2024 09:47

add more links and mark the api as experimental

599fa27

Signed-off-by: Xun Zhang <[email protected]>

use openAI as the blueprint example details

fd5f8df

Signed-off-by: Xun Zhang <[email protected]>

address comments

f882f20

Signed-off-by: Xun Zhang <[email protected]>

Zhangxunmt force-pushed the main branch from 4bdfb79 to f882f20 Compare August 2, 2024 16:47

Zhangxunmt mentioned this pull request Aug 2, 2024

Ml commons batch inference #7899

Merged

1 task

Zhangxunmt closed this Aug 2, 2024

hdhalter added Closed - Duplicate or Cancelled Issue: Nothing to be done and removed 4 - Doc review PR: Doc review in progress release-notes PR: Include this PR in the automated release notes v2.16.0 experimental labels Aug 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add batch inference API #7853

Add batch inference API #7853

Zhangxunmt commented Jul 29, 2024 •

edited by hdhalter

Loading

github-actions bot commented Jul 29, 2024

dhrubo-os Jul 31, 2024

kolchfa-aws Aug 1, 2024

Zhangxunmt Aug 1, 2024

kolchfa-aws Aug 1, 2024

kolchfa-aws left a comment

kolchfa-aws Aug 1, 2024

Zhangxunmt Aug 1, 2024

kolchfa-aws Aug 1, 2024

kolchfa-aws Aug 1, 2024

kolchfa-aws Aug 1, 2024

kolchfa-aws Aug 1, 2024

kolchfa-aws Aug 1, 2024

kolchfa-aws Aug 1, 2024

kolchfa-aws Aug 1, 2024

kolchfa-aws Aug 1, 2024 •

edited

Loading

kolchfa-aws Aug 1, 2024

kolchfa-aws Aug 1, 2024

kolchfa-aws Aug 1, 2024

Zhangxunmt Aug 1, 2024

kolchfa-aws Aug 1, 2024

kolchfa-aws Aug 1, 2024

kolchfa-aws Aug 1, 2024

		For information about user access for this API, see [Model access control considerations]({{site.url}}{{site.baseurl}}/ml-commons-plugin/api/model-apis/index/#model-access-control-considerations).


		For information about connectors and remote models, see [Connecting to externally hosted models]({{site.url}}{{site.baseurl}}/ml-commons-plugin/remote-models/index/). For more details of the connector blurprints for batch predict, see [GitHub docs](https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/batch_inference_openAI_connector_blueprint.md)


		# Batch inference

		ML Commons can predict large datasets in an offline asynchronous mode with your remote model deployed in external model servers. To use the Batch_Predict API, the `model_id` for a remote model is required. This new API is released as an experimental feature in the OpenSearch version 2.16, and only SageMaker, Cohere, and OpenAI are verified as the external servers that support this features.

-For information about connectors and remote models, see [Connecting to externally hosted models]({{site.url}}{{site.baseurl}}/ml-commons-plugin/remote-models/index/). For more details of the connector blurprints for batch predict, see [GitHub docs](https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/batch_inference_openAI_connector_blueprint.md)
+For information about externally hosted models, see [Connecting to externally hosted models]({{site.url}}{{site.baseurl}}/ml-commons-plugin/remote-models/index/). For the batch predict operation connector blueprints, see:
+- [Amazon SageMaker batch predict connector blueprint](https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/batch_inference_sagemaker_connector_blueprint.md).
+- [OpenAI batch predict connector blueprint](https://github.com/opensearch-project/ml-commons/blob/main/docs/remote_inference_blueprints/batch_inference_openAI_connector_blueprint.md).


	To check the status of the operation, provide the task ID to the [Tasks API]({{site.url}}{{site.baseurl}}/ml-commons-plugin/api/tasks-apis/get-task/). Once the registration is complete, the task `state` changes to `COMPLETED`.


	Once you have completed the prerequisite steps, you can call the Batch Predict API. The parameters in the batch predict request override those defined in the connector:

		```

		#### Example request

Add batch inference API #7853

Add batch inference API #7853

Conversation

Zhangxunmt commented Jul 29, 2024 • edited by hdhalter Loading

Description

Issues Resolved

Version

Frontend features

Checklist

github-actions bot commented Jul 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kolchfa-aws left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kolchfa-aws Aug 1, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Zhangxunmt commented Jul 29, 2024 •

edited by hdhalter

Loading

kolchfa-aws Aug 1, 2024 •

edited

Loading