[ML] track inference model feature usage per node #79752

benwtrent · 2021-10-25T19:22:36Z

This adds feature usage tracking for deployed inference models. The models are tracked under the existing, inference feature and contain context related to the model ID.

I decided to track the feature via the allocation task to keep the logic similar between allocation tasks and licensed persistent tasks.

closes: #76452

elasticmachine · 2021-10-25T19:22:39Z

Pinging @elastic/ml-core (Team:ML)

benwtrent · 2021-10-25T19:23:12Z

x-pack/plugin/core/src/main/java/org/elasticsearch/license/XPackLicenseState.java

@@ -406,13 +406,13 @@ void featureUsed(LicensedFeature feature) {
        usage.put(new FeatureUsage(feature, null), epochMillisProvider.getAsLong());
    }

-    void enableUsageTracking(LicensedFeature feature, String contextName) {
+    public void enableUsageTracking(LicensedFeature feature, String contextName) {


@rjernst I made these public for testing purposes, I need to check if tracking is enabled/disabled for inference in a different package.

It's a shame to do this so TrainedModelDeploymentTaskTests can verify TrainedModelDeploymentTask calls start/stopTracking. Instead of statically importing MachineLearning.ML_MODEL_INFERENCE_FEATURE you could pass the LicensedFeature as a ctor parameter and mock a LicensedFeature in the tests which can be verified

fixed, I am passing in the value now as @davidkyle suggests, these changes are removed.

Note for future reference there is a test class called MockLicenseState that makes these package private methods public for testing purposes, so there shouldn't ever be a need to make them public on XPackLicenseState.

davidkyle

Looks good!

I think it would be useful to track usage of DFA models and PyTorch models separately. I know we consider both inference and the code is structured that way but for reporting it's nice to know which is used. Can you add a MachineLearning.ML_PYTORCH_INFERENCE_FEATURE field or similar pls

davidkyle · 2021-10-26T09:42:04Z

x-pack/plugin/core/src/main/java/org/elasticsearch/license/XPackLicenseState.java

@@ -406,13 +406,13 @@ void featureUsed(LicensedFeature feature) {
        usage.put(new FeatureUsage(feature, null), epochMillisProvider.getAsLong());
    }

-    void enableUsageTracking(LicensedFeature feature, String contextName) {
+    public void enableUsageTracking(LicensedFeature feature, String contextName) {


It's a shame to do this so TrainedModelDeploymentTaskTests can verify TrainedModelDeploymentTask calls start/stopTracking. Instead of statically importing MachineLearning.ML_MODEL_INFERENCE_FEATURE you could pass the LicensedFeature as a ctor parameter and mock a LicensedFeature in the tests which can be verified

davidkyle

LGTM

* upstream/master: (209 commits) Enforce license expiration (elastic#79671) TSDB: Automatically add timestamp mapper (elastic#79136) [DOCS] `_id` is required for bulk API's `update` action (elastic#79774) EQL: Add optional fields and limit joining keys on non-null values only (elastic#79677) [DOCS] Document range enrich policy (elastic#79607) [DOCS] Fix typos in 8.0 security migration (elastic#79802) Allow listing older repositories (elastic#78244) [ML] track inference model feature usage per node (elastic#79752) Remove IncrementalClusterStateWriter & related code (elastic#79738) Reuse previous indices lookup when possible (elastic#79004) Reduce merging in PersistedClusterStateService (elastic#79793) SQL: Adjust JDBC docs to use milliseconds for timeouts (elastic#79628) Remove endpoint for freezing indices (elastic#78918) [ML] add timeout parameter for DELETE trained_models API (elastic#79739) [ML] wait for .ml-state-write alias to be readable (elastic#79731) [Docs] Update edgengram-tokenizer.asciidoc (elastic#79577) [Test][Transform] fix UpdateTransformActionRequestTests failure (elastic#79787) Limit CS Update Task Description Size (elastic#79443) Apply the reader wrapper on can_match source (elastic#78988) [DOCS] Adds new transform limitation item and a note to the tutorial (elastic#79479) ... # Conflicts: # server/src/main/java/org/elasticsearch/index/IndexMode.java # server/src/test/java/org/elasticsearch/index/TimeSeriesModeTests.java

This adds feature usage tracking for deployed inference models. The models are tracked under the existing, inference feature and contain context related to the model ID. I decided to track the feature via the allocation task to keep the logic similar between allocation tasks and licensed persistent tasks. closes: elastic#76452

benwtrent added 2 commits October 25, 2021 13:32

[ML] track inference model deployment on nodes

85872e0

[ML] track inference model feature usage per node

13ef48f

benwtrent added >non-issue :ml Machine learning v8.0.0 labels Oct 25, 2021

elasticmachine added the Team:ML Meta label for the ML team label Oct 25, 2021

benwtrent commented Oct 25, 2021

View reviewed changes

davidkyle reviewed Oct 26, 2021

View reviewed changes

addressing PR comments

6b115db

benwtrent requested a review from davidkyle October 26, 2021 11:46

davidkyle approved these changes Oct 26, 2021

View reviewed changes

benwtrent added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Oct 26, 2021

elasticsearchmachine merged commit 5ffc4b5 into elastic:master Oct 26, 2021

benwtrent deleted the feature/ml-track-model-inference-deployments branch October 26, 2021 16:55

jakelandis added v8.0.0-beta1 and removed v8.0.0 labels Oct 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] track inference model feature usage per node #79752

[ML] track inference model feature usage per node #79752

benwtrent commented Oct 25, 2021 •

edited

Loading

elasticmachine commented Oct 25, 2021

benwtrent Oct 25, 2021

davidkyle Oct 26, 2021

benwtrent Oct 26, 2021

rjernst Oct 26, 2021

davidkyle left a comment

davidkyle Oct 26, 2021

davidkyle left a comment

[ML] track inference model feature usage per node #79752

[ML] track inference model feature usage per node #79752

Conversation

benwtrent commented Oct 25, 2021 • edited Loading

elasticmachine commented Oct 25, 2021

benwtrent Oct 25, 2021

Choose a reason for hiding this comment

davidkyle Oct 26, 2021

Choose a reason for hiding this comment

benwtrent Oct 26, 2021

Choose a reason for hiding this comment

rjernst Oct 26, 2021

Choose a reason for hiding this comment

davidkyle left a comment

Choose a reason for hiding this comment

davidkyle Oct 26, 2021

Choose a reason for hiding this comment

davidkyle left a comment

Choose a reason for hiding this comment

benwtrent commented Oct 25, 2021 •

edited

Loading