[ML] improve trained model stats API performance #87978

benwtrent · 2022-06-23T15:01:50Z

Previous, get trained model stats API would build every pipeline defined in cluster state.

This is problematic when MANY pipelines are defined. Especially if those pipelines take some time to parse (consider GROK).

This improvement is part of fixing: #87931

elasticsearchmachine · 2022-06-23T15:02:22Z

Hi @benwtrent, I've created a changelog YAML for you.

elasticmachine · 2022-06-23T15:02:35Z

Pinging @elastic/ml-core (Team:ML)

benwtrent · 2022-06-23T15:02:45Z

Reviewer, issue #87931 is focused on making the API cancellable.

I will do that in a separate PR to reduce the review space and code churn :).

benwtrent · 2022-06-24T17:25:21Z

@elasticmachine update branch

droberts195 · 2022-06-27T10:38:39Z

I know we have a report saying that this got slow after upgrading from 7.17 to 8.x, but I cannot see why this couldn't have been just as slow in 7.17 assuming the same ingest pipelines existed.

Is there a good explanation for why the 7.17 get trained model stats code would not have suffered from the problem of slow Grok processor instantiation? If there isn't then I think this fix should be backported to 7.17.6 (and 8.3.1 too, to get it out to 8.x users sooner).

benwtrent · 2022-06-27T11:42:30Z

@droberts195 the only thing I can think of is that there may be more solutions/apps in Kibana and Elasticsearch making more "system managed" ingest pipelines in 8 vs 7. So, the increase in the number of pipelines pushed this over the edge.

Doing a quick search of any "grok" specific changes, indicates nothing that would make stuff incredibly slower.

It is true, any user with a large number of pipelines (especially complicated GROK ones), may run into this problem. I am good with backporting it to 7.17

droberts195

LGTM apart from a couple of minor nits

.../plugin/ml/src/main/java/org/elasticsearch/xpack/ml/inference/ingest/InferenceProcessor.java

.../ml/src/test/java/org/elasticsearch/xpack/ml/utils/InferenceProcessorInfoExtractorTests.java

.../plugin/ml/src/main/java/org/elasticsearch/xpack/ml/inference/ingest/InferenceProcessor.java

benwtrent · 2022-06-27T18:26:58Z

@elasticmachine update branch

benwtrent · 2022-06-27T19:00:36Z

@elasticmachine update branch

benwtrent · 2022-06-28T11:33:59Z

@elasticmachine update branch

Previous, get trained model stats API would build every pipeline defined in cluster state. This is problematic when MANY pipelines are defined. Especially if those pipelines take some time to parse (consider GROK). This improvement is part of fixing: elastic#87931

elasticsearchmachine · 2022-06-28T12:40:05Z

💔 Backport failed

Status	Branch	Result
❌	7.17	Commit could not be cherrypicked due to conflicts
✅	8.3

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 87978

Previous, get trained model stats API would build every pipeline defined in cluster state. This is problematic when MANY pipelines are defined. Especially if those pipelines take some time to parse (consider GROK). This improvement is part of fixing: #87931

Previous, get trained model stats API would build every pipeline defined in cluster state. This is problematic when MANY pipelines are defined. Especially if those pipelines take some time to parse (consider GROK). This improvement is part of fixing: elastic#87931

Previous, get trained model stats API would build every pipeline defined in cluster state. This is problematic when MANY pipelines are defined. Especially if those pipelines take some time to parse (consider GROK). This improvement is part of fixing: #87931

[ML] improve trained model stats API performance

86627ce

benwtrent added >bug :ml Machine learning v8.4.0 labels Jun 23, 2022

Update docs/changelog/87978.yaml

b18891c

elasticmachine added the Team:ML Meta label for the ML team label Jun 23, 2022

Merge branch 'master' into feature/ml-fix-get-trained-model-stats

cf887d8

benwtrent added v7.17.5 v8.3.1 auto-backport-and-merge labels Jun 27, 2022

droberts195 approved these changes Jun 27, 2022

View reviewed changes

.../plugin/ml/src/main/java/org/elasticsearch/xpack/ml/inference/ingest/InferenceProcessor.java Show resolved Hide resolved

.../ml/src/test/java/org/elasticsearch/xpack/ml/utils/InferenceProcessorInfoExtractorTests.java Outdated Show resolved Hide resolved

benwtrent commented Jun 27, 2022

View reviewed changes

.../ml/src/test/java/org/elasticsearch/xpack/ml/utils/InferenceProcessorInfoExtractorTests.java Outdated Show resolved Hide resolved

benwtrent commented Jun 27, 2022

View reviewed changes

.../plugin/ml/src/main/java/org/elasticsearch/xpack/ml/inference/ingest/InferenceProcessor.java Outdated Show resolved Hide resolved

Apply suggestions from code review

caee509

Merge branch 'master' into feature/ml-fix-get-trained-model-stats

b787124

Merge branch 'master' into feature/ml-fix-get-trained-model-stats

658a07a

Merge branch 'master' into feature/ml-fix-get-trained-model-stats

b64135e

benwtrent merged commit 6847c0b into elastic:master Jun 28, 2022

benwtrent deleted the feature/ml-fix-get-trained-model-stats branch June 28, 2022 12:38

benwtrent mentioned this pull request Jun 28, 2022

[8.3] [ML] improve trained model stats API performance (#87978) #88129

Merged

davidkyle mentioned this pull request May 3, 2023

[CI] MlTrainedModelsUpgradeIT testTrainedModelInference and MLModelDeploymentsUpgradeIT failing #95360

Closed

droberts195 mentioned this pull request Feb 1, 2024

[ML] Deleting a trained model can emit deprecation warnings related to ingest pipeline configs #105004

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] improve trained model stats API performance #87978

[ML] improve trained model stats API performance #87978

benwtrent commented Jun 23, 2022

elasticsearchmachine commented Jun 23, 2022

elasticmachine commented Jun 23, 2022

benwtrent commented Jun 23, 2022

benwtrent commented Jun 24, 2022

droberts195 commented Jun 27, 2022

benwtrent commented Jun 27, 2022

droberts195 left a comment

benwtrent commented Jun 27, 2022

benwtrent commented Jun 27, 2022

benwtrent commented Jun 28, 2022

elasticsearchmachine commented Jun 28, 2022

[ML] improve trained model stats API performance #87978

[ML] improve trained model stats API performance #87978

Conversation

benwtrent commented Jun 23, 2022

elasticsearchmachine commented Jun 23, 2022

elasticmachine commented Jun 23, 2022

benwtrent commented Jun 23, 2022

benwtrent commented Jun 24, 2022

droberts195 commented Jun 27, 2022

benwtrent commented Jun 27, 2022

droberts195 left a comment

Choose a reason for hiding this comment

benwtrent commented Jun 27, 2022

benwtrent commented Jun 27, 2022

benwtrent commented Jun 28, 2022

elasticsearchmachine commented Jun 28, 2022

💔 Backport failed