[ML] Merge the pytorch-inference feature branch #73660

davidkyle · 2021-06-02T11:50:47Z

The feature branch contains changes to configure PyTorch models with a TrainedModelConfig and defines a format to store the binary models. The _start and _stop deployment actions control the model lifecycle and the model can be directly evaluated with the _infer endpoint. 2 Types of NLP tasks are supported: Named Entity Recognition and Fill Mask.

The feature branch consists of these PRs:

Initial start/stop trained model deployment actions.

Adds the model_type field to TrainedModelConfig for distinguishing between models that can be loaded via the model loading service and those that require a native process.

This adds a temporary API for doing inference against a trained model deployment.

Introduces code for re-assembling the individual chunks a model is stored in and streaming those chunks to the inference process. Re-uses the TrainedModelDefinitionDoc format already defined for boosted tree models

Binary data is stored in lucene base64 encoded, the same data stored in a Java string uses 2 bytes (UTF16) to represent each base64 character consuming twice the amount of memory required. The compressed binary representation of the models can stored in ByteReferences more efficiently. For BWC a new field mapping binary_definition is added .ml-inference-* and the index version incremented.

This adds a location field to TrainedModelConfig for large models that cannot be PUT inline with the config. Large models are reassembled from their location.

Adds tokenisation for BERT models via the WordPiece algorithm using the vocabulary that defined with the model and introduces the concept of NLP tasks. Each task is configured with a BERT model supporting that task, pre-processing and post-processing is defined by the task. Named Entity Recognition and Fill Mask are the 2 task types supported by this PR

elasticmachine · 2021-06-02T12:11:04Z

Pinging @elastic/ml-core (Team:ML)

elasticmachine · 2021-06-02T12:27:48Z

Pinging @elastic/clients-team (Team:Clients)

sethmlarson

Early review comments, very excited for this functionality.

rest-api-spec/src/main/resources/rest-api-spec/api/ml.start_deployment.json

rest-api-spec/src/main/resources/rest-api-spec/api/ml.stop_deployment.json

davidkyle · 2021-06-02T12:59:02Z

Thanks for jumping in with an early review @sethmlarson

use /_ml/trained_models/{model_id}/deployment/[_start|_stop]?

👍 This makes sense to me I'll raise it with the team

I've missed out the spec of the _infer endpoint which follows the same pattern as start & stop: /_ml/trained_models/deployment/{model_id}/_infer and will add that soon.

These APIs may be in flux for a short while as we work through all the use cases. Is that a problem for the clients team? Would you prefer us to tell you when we settled on something we like?

sethmlarson · 2021-06-02T13:09:53Z

@davidkyle It's no problem for us that these APIs may change especially if they're experimental/on master branch, we'll regenerate from the specs each time :) We like being a part of the process even if it's just seeing changes happen.

sethmlarson

Looks good from an API spec perspective 🎉 One comment I was unsure about.

rest-api-spec/src/main/resources/rest-api-spec/api/ml.infer_trained_model_deployment.json

dimitris-athanasiou

LGTM

dimitris-athanasiou and others added 17 commits March 29, 2021 16:38

[ML] Start and stop model deployments (#70713)

8ba697b

Initial start/stop trained model deployment actions.

Merge branch 'master' into feature/pytorch-inference

89a85e1

Merge branch 'master' into feature/pytorch-inference

0897e8a

[ML] Add PyTorch model configuration (#71035)

99ed8b0

Adds the model_type field to TrainedModelConfig for distinguishing between models that can be loaded via the model loading service and those that require a native process.

[ML] Infer against model deployment (#71177)

a261f0d

This adds a temporary API for doing inference against a trained model deployment.

Merge branch 'master' into feature/pytorch-inference

720fbed

[ML] Model storage for 3rd Party models (#71323)

52df9ad

Introduces code for re-assembling the individual chunks a model is stored in and streaming those chunks to the inference process. Re-uses the TrainedModelDefinitionDoc format already defined for boosted tree models

Merge branch 'master' into feature/pytorch-inference

ce3a136

Merge branch 'master' into feature/pytorch-inference

91eb2cf

Merge branch 'master' into feature/pytorch-inference

f157cde

Merge branch 'master' into feature/pytorch-inference

84da366

[ML] Load and evaluate 3rd Party Model (#72218)

63bb0c6

This adds a location field to TrainedModelConfig for large models that cannot be PUT inline with the config. Large models are reassembled from their location.

Merge branch 'master' into feature/pytorch-inference

6f113bf

Merge branch 'master' into feature/pytorch-inference

7ed153c

Merge branch 'master' into feature/pytorch-inference

418985b

davidkyle changed the title ~~Feature/pytorch inference~~ [ML] Merge the pytorch-inference feature branch Jun 2, 2021

davidkyle added :ml Machine learning v8.0.0 labels Jun 2, 2021

elasticmachine added the Team:ML Meta label for the ML team label Jun 2, 2021

Resolve minor TODOs

41e8ba4

sethmlarson added the Team:Clients Meta label for clients team label Jun 2, 2021

sethmlarson suggested changes Jun 2, 2021

View reviewed changes

davidkyle added 2 commits June 2, 2021 17:40

Use more standard ml/trained_models/{ID}/deployment/.. URL

c450f3b

Merge branch 'master' into feature/pytorch-inference

5ec5a54

davidkyle added 2 commits June 2, 2021 18:08

Fix doc test tags

4fc7cf6

fix typo

55e9ab0

sethmlarson approved these changes Jun 2, 2021

View reviewed changes

rest-api-spec/src/main/resources/rest-api-spec/api/ml.infer_trained_model_deployment.json Show resolved Hide resolved

dimitris-athanasiou approved these changes Jun 3, 2021

View reviewed changes

davidkyle merged commit 94adaa5 into master Jun 3, 2021

davidkyle deleted the feature/pytorch-inference branch June 3, 2021 11:43

This was referenced Jul 15, 2021

[DOCS] Update doc URLs for trained model deployment APIs #75388

Merged

[DOCS] Drafts trained model deployment APIs #75497

Merged

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

lcawl added the >enhancement label Jul 27, 2021

This was referenced Nov 23, 2021

Adds ML infer trained model deployment API elastic/elasticsearch-specification#1060

Merged

Adds ML start and stop trained model deployment specifications elastic/elasticsearch-specification#1061

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Merge the pytorch-inference feature branch #73660

[ML] Merge the pytorch-inference feature branch #73660

davidkyle commented Jun 2, 2021 •

edited

Loading

elasticmachine commented Jun 2, 2021

elasticmachine commented Jun 2, 2021

sethmlarson left a comment

davidkyle commented Jun 2, 2021

sethmlarson commented Jun 2, 2021

sethmlarson left a comment

dimitris-athanasiou left a comment

[ML] Merge the pytorch-inference feature branch #73660

[ML] Merge the pytorch-inference feature branch #73660

Conversation

davidkyle commented Jun 2, 2021 • edited Loading

elasticmachine commented Jun 2, 2021

elasticmachine commented Jun 2, 2021

sethmlarson left a comment

Choose a reason for hiding this comment

davidkyle commented Jun 2, 2021

sethmlarson commented Jun 2, 2021

sethmlarson left a comment

Choose a reason for hiding this comment

dimitris-athanasiou left a comment

Choose a reason for hiding this comment

davidkyle commented Jun 2, 2021 •

edited

Loading