[ML] Preserve order of inference results #100143

davidkyle · 2023-10-02T14:45:36Z

When a request contains multiple inputs the order in which those inputs are processed is not deterministic if the C++ process is using more than one allocation. This change ensures the inference results are returned in the same order as the request inputs so that a caller knows result 1 is for input 1 etc.

Another change is to return all results even if there was a failure. Failures are returned as ErrorInferenceResults, the caller should check for instances of ErrorInferenceResults and handle them appropriately.

This is labelled as a bug because the _infer API accepts multiple inputs and previously the returned order was not guaranteed to match the input order.

elasticsearchmachine · 2023-10-02T14:46:00Z

Pinging @elastic/ml-core (Team:ML)

elasticsearchmachine · 2023-10-02T15:02:06Z

Hi @davidkyle, I've created a changelog YAML for you.

docs/changelog/100143.yaml

jonathan-buttner

Not sure how hard it'd be, but would it be worth adding a test for the ordering?

.../main/java/org/elasticsearch/xpack/ml/action/TransportInferTrainedModelDeploymentAction.java

jonathan-buttner · 2023-10-02T15:48:14Z

.../main/java/org/elasticsearch/xpack/ml/action/TransportInferTrainedModelDeploymentAction.java

+     * the listener will never call {@code finalListener::onFailure}
+     * instead failures are returned as inference results.
+     */
+    private ActionListener<InferenceResults> orderedListener(


nit: Can we make this static?

👍 and I've added a test

jonathan-buttner · 2023-10-02T15:54:22Z

...in/ml/src/main/java/org/elasticsearch/xpack/ml/action/TransportInternalInferModelAction.java

+                                if (result instanceof ErrorInferenceResults errorResult) {
+                                    // Any failure fails all requests
+                                    // TODO is this the correct behaviour for batched requests?
+                                    finalListener.onFailure(errorResult.getException());


I don't know the code well enough but maybe in the future we could make the response similar to a bulk response where an entry in the results array can either be a failure or a successful result?

That the idea. The rest response does not have to change but internal users (such as ingest) can make better decisions about how to handle a response which is partially successful

…on/TransportInferTrainedModelDeploymentAction.java Co-authored-by: Jonathan Buttner <[email protected]>

Preserve order of inference results

cf87694

davidkyle added >bug >non-issue :ml Machine learning v8.11.0 labels Oct 2, 2023

elasticsearchmachine added the Team:ML Meta label for the ML team label Oct 2, 2023

davidkyle removed the >non-issue label Oct 2, 2023

Update docs/changelog/100143.yaml

f9f888f

davidkyle commented Oct 2, 2023

View reviewed changes

docs/changelog/100143.yaml Outdated Show resolved Hide resolved

Update docs/changelog/100143.yaml

8a8b162

davidkyle commented Oct 2, 2023

View reviewed changes

docs/changelog/100143.yaml Outdated Show resolved Hide resolved

Update docs/changelog/100143.yaml

5d00694

jonathan-buttner approved these changes Oct 2, 2023

View reviewed changes

davidkyle and others added 2 commits October 2, 2023 17:42

Update x-pack/plugin/ml/src/main/java/org/elasticsearch/xpack/ml/acti…

6bfbe0b

…on/TransportInferTrainedModelDeploymentAction.java Co-authored-by: Jonathan Buttner <[email protected]>

add a test

2177f6b

davidkyle merged commit d721d6f into elastic:main Oct 2, 2023

davidkyle deleted the order-multiple-inputs branch October 2, 2023 20:57

davidkyle mentioned this pull request Oct 6, 2023

[ML] Fix empty requests being sent to nodes with the model allocations #100388

Merged

droberts195 mentioned this pull request Oct 9, 2023

[CI] IndexingIT testIndexing failing #100371

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Preserve order of inference results #100143

[ML] Preserve order of inference results #100143

davidkyle commented Oct 2, 2023

elasticsearchmachine commented Oct 2, 2023

elasticsearchmachine commented Oct 2, 2023

jonathan-buttner left a comment

jonathan-buttner Oct 2, 2023

davidkyle Oct 2, 2023

jonathan-buttner Oct 2, 2023

davidkyle Oct 2, 2023

[ML] Preserve order of inference results #100143

[ML] Preserve order of inference results #100143

Conversation

davidkyle commented Oct 2, 2023

elasticsearchmachine commented Oct 2, 2023

elasticsearchmachine commented Oct 2, 2023

jonathan-buttner left a comment

Choose a reason for hiding this comment

jonathan-buttner Oct 2, 2023

Choose a reason for hiding this comment

davidkyle Oct 2, 2023

Choose a reason for hiding this comment

jonathan-buttner Oct 2, 2023

Choose a reason for hiding this comment

davidkyle Oct 2, 2023

Choose a reason for hiding this comment