Skip to content

Commit

Permalink
add doc
Browse files Browse the repository at this point in the history
Signed-off-by: zhichao-aws <[email protected]>
  • Loading branch information
zhichao-aws committed Jun 7, 2024
1 parent a7cf8d8 commit 60dd58e
Show file tree
Hide file tree
Showing 2 changed files with 17 additions and 2 deletions.
6 changes: 5 additions & 1 deletion _ml-commons-plugin/remote-models/blueprints.md
Original file line number Diff line number Diff line change
Expand Up @@ -87,9 +87,13 @@ The `client_config` parameter supports the following options.

| Field | Data type | Description |
|:---------------------|:----------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| `max_connection` | Integer | The maximum number of concurrent connections that the client can establish with the server. |
| `max_connection` | Integer | The maximum number of concurrent connections that the client can establish with the server. Some remote service like SageMaker constraints the max concurrent connections and throw throttling exception if the concurrent connections exceeds the threshold. And the max concurent connections from OpenSearch is `max_connection`*`node_number_for_connector`. To mitigate this issue, please try decrease the value of this parameter, and modify the retry settings in client_config. Default value is 30. |

Check failure on line 90 in _ml-commons-plugin/remote-models/blueprints.md

View workflow job for this annotation

GitHub Actions / style-job

[vale] reported by reviewdog 🐶 [OpenSearch.Spelling] Error: concurent. If you are referencing a setting, variable, format, function, or repository, surround it with tic marks. Raw Output: {"message": "[OpenSearch.Spelling] Error: concurent. If you are referencing a setting, variable, format, function, or repository, surround it with tic marks.", "location": {"path": "_ml-commons-plugin/remote-models/blueprints.md", "range": {"start": {"line": 90, "column": 304}}}, "severity": "ERROR"}

Check warning on line 90 in _ml-commons-plugin/remote-models/blueprints.md

View workflow job for this annotation

GitHub Actions / style-job

[vale] reported by reviewdog 🐶 [OpenSearch.Please] Using 'please' is unnecessary. Remove. Raw Output: {"message": "[OpenSearch.Please] Using 'please' is unnecessary. Remove.", "location": {"path": "_ml-commons-plugin/remote-models/blueprints.md", "range": {"start": {"line": 90, "column": 415}}}, "severity": "WARNING"}
| `connection_timeout` | Integer | The maximum amount of time (in seconds) that the client will wait while trying to establish a connection to the server. A timeout prevents the client from waiting indefinitely and allows it to recover from unreachable network endpoints. |
| `read_timeout` | Integer | The maximum amount of time (in seconds) that the client will wait for a response from the server after sending a request. Useful when the server is slow to respond or encounters issues while processing a request. |
| `retry_backoff_policy` | String | The backoff policy of retry for remote connector. Useful when there is spike traffic causing throttling exception. Supported policy: `constant`, `exponential_equal_jitter`, `exponential_full_jitter`. Default value is `constant`. |
| `max_retry_times` | Integer | The maximum retry times for a single remote inference request. Useful when there is spike traffic causing throttling exception. When set as 0, retry is disabled. When set as -1, we don't limit the retry by retry times. Set it to positive integers means the max retry times. Default value is 0. |
| `retry_backoff_millis` | Integer | The base backoff time in milliseconds for retry policy. The suspend time during two retries is determined by this parameter and `retry_backoff_policy`. Default value is 200. |
| `retry_timeout_seconds` | Integer | The timeout value in seconds to endup the retry. If the retry can not succeed in this time, the connector will stop retrying and throw exceptions. Default value is 30. |

Check failure on line 96 in _ml-commons-plugin/remote-models/blueprints.md

View workflow job for this annotation

GitHub Actions / style-job

[vale] reported by reviewdog 🐶 [OpenSearch.Spelling] Error: endup. If you are referencing a setting, variable, format, function, or repository, surround it with tic marks. Raw Output: {"message": "[OpenSearch.Spelling] Error: endup. If you are referencing a setting, variable, format, function, or repository, surround it with tic marks.", "location": {"path": "_ml-commons-plugin/remote-models/blueprints.md", "range": {"start": {"line": 96, "column": 79}}}, "severity": "ERROR"}

## Built-in pre- and post-processing functions

Expand Down
13 changes: 12 additions & 1 deletion _search-plugins/neural-sparse-search.md
Original file line number Diff line number Diff line change
Expand Up @@ -390,4 +390,15 @@ The response contains both documents:

## Next steps

- To learn more about splitting long text into passages for neural search, see [Text chunking]({{site.url}}{{site.baseurl}}/search-plugins/text-chunking/).
- To learn more about splitting long text into passages for neural search, see [Text chunking]({{site.url}}{{site.baseurl}}/search-plugins/text-chunking/).

## FAQ

1. When using connectors to call remote service like SageMaker, the ingestion/search fails due to remote connector throttling exception. How to mitigate this?

- Please modify the connector [`client_config`]({{site.url}}{{site.baseurl}}/ml-commons-plugin/remote-models/blueprints/#configuration-parameters) to decrease the `max_connection` to prevent max concurrent connection exceeds the threshold of remote service. You can also modify the retry settings to flatten the request spike during ingestion.

Check warning on line 399 in _search-plugins/neural-sparse-search.md

View workflow job for this annotation

GitHub Actions / style-job

[vale] reported by reviewdog 🐶 [OpenSearch.Please] Using 'Please' is unnecessary. Remove. Raw Output: {"message": "[OpenSearch.Please] Using 'Please' is unnecessary. Remove.", "location": {"path": "_search-plugins/neural-sparse-search.md", "range": {"start": {"line": 399, "column": 3}}}, "severity": "WARNING"}

> **_NOTE:_** For versions before OpenSearch 2.15, the SageMaker throttling exception will be thrown as `"error": {
"type": "status_exception",

Check failure on line 402 in _search-plugins/neural-sparse-search.md

View workflow job for this annotation

GitHub Actions / style-job

[vale] reported by reviewdog 🐶 [OpenSearch.Spelling] Error: _exception. If you are referencing a setting, variable, format, function, or repository, surround it with tic marks. Raw Output: {"message": "[OpenSearch.Spelling] Error: _exception. If you are referencing a setting, variable, format, function, or repository, surround it with tic marks.", "location": {"path": "_search-plugins/neural-sparse-search.md", "range": {"start": {"line": 402, "column": 27}}}, "severity": "ERROR"}
"reason": "Error from remote service: {\"message\":null}"
}`

0 comments on commit 60dd58e

Please sign in to comment.