Define future approach to metadata and metrics with new Executor #1362

RafalSkolasinski · 2020-01-23T18:55:54Z

What this issue is about

Very soon Executor, in Go replacement for currently used Java engine, will be merged into master.

In this context we need to define our future approach to metadata and metrics. How it is gonna be ideally handled in the future and how are we gonna avoid breaking backward compatibility during period of time when we support both Executor and Engine.

In order to help discussion I did try to put together a summary of the current state of matter.
As the topic seems to cause some confusion, especially on the semantic level, the description of this issue may be a bit lengthy, apologies for that.

Definitions

Metadata (model):

Model specific information: model name, author, training dataset, input's shape and type, etc...
May or may not be dynamic (depending on whether it is exposed)
Does not depend on incoming requests

Metadata (request):

This is the metadata that is sent with each request / response
This is the metadata of the request itself (e.g. sender or unique identifier)
May depend on external resources (e.g. external api version on time of request processing)
In the future may be included in the header of the HTTP request
It is part of the SeldonMessage

Metrics:

Should represent current state of model / deployment
Standard devops metrics (like number of requests per second, cpu usage) we provide by default and expose via endpoint for Prometheus
User defined custom metrics is returned with requests (Seldon Engine)
In the executor each container will expose its own metrics endpoint (picked up by Prometheus)

Now: only Java Engine

User-provided models can define two methods: metrics and tags.
Both of these are included in response.meta when /predict end point is called.
All of that logic is defined in the Python wrapper, not in Java engine.

Metrics method:

Return type: list of dictionaries
Directly used in user_model.py::client_custom_metrics method
Returned in response.meta.metrics of SeldonMessage
Commonly used to for custom metrics defined by user

Tags method:

Return type: dictionary
Directly used in user_model.py::client_custom_tags method
Returned in response.meta.tags of SeldonMessage
Commonly used to return request metadata
Problem: predict method must modify current state of model in order to provide data returned by this method

Future: compatibility with both Executor and Engine

For some time we will be supporting both Executor and Engine.
Python wrapper must be therefore compatible with both.
Wrapper will know if it is working with Executor or Engine through environmental variables.

Go Executor:

Metadata:

User-provided models can define a metadata method.
Output from that method will be exposed via /metadata endpoint.
It contains pure model metadata in the context of the above definitions.

Metrics:

Executor’s /metadata endpoint will only contain general metrics
Custom metrics will be exposed on /metadata of each container (and consumed by Prometheus)

Java Engine:

Main compatibility with the current functionality until we deprecate Java Engine.

Problems / objectives

What to do with current metrics and tags method? Shall these get deprecated?”
How to provide backward compatibility with models defined to work with pre-executor seldon-core? These may use metrics and tags in a non-trivial manner.
Shall per-request metadata be appended to response?

Main questions:

How do we want to handle these things (especially custom metrics) in the future?
How do we keep compatibility with earlier defined models?

P.S. I did my best to get these points right. However, if something is wrong or confusing, please, leave a comment or let me know and we'll try to clarify it.

The text was updated successfully, but these errors were encountered:

RafalSkolasinski · 2020-01-23T18:59:55Z

Ideally, this issue will lead us to a clear new picture on how we want to handle these things or set of well defined issues that precisely describe desired functionality.

RafalSkolasinski · 2020-01-27T13:29:44Z

~~I believe equivalent of ours request metadata in kfserving dataplane proposal would be $parameters present in $inference_request, $request_input, $request_output and $inference_response.~~

Actually, I think I was not exactly correct. Seems that $parameters are more about controlling inference process.

Not sure then if request metadata is in any way present in this dataplane proposal.

RafalSkolasinski · 2020-02-20T10:24:52Z

The request metadata is in principle metadata about the request itself.

It can be immutable data that comes as a part of the request. In example:

some id of user who send the request (for audit purposes)
some unique request identifier (could be useful in "Batch" processing)

This data should be exposed to predict method but guaranteed to be immutable.

The other example of request metadata provided by @adriangonz is recording version of external libraries accessed during the time of processing the request. This type of data is:

unknown to sender on time of the request
may change in between requests

We should provide an easy and threadsafe way to include that kind of metadata in the request response.

RafalSkolasinski · 2020-03-30T13:28:42Z

Reference: kfserving dataplane proposal

ukclivecox · 2020-04-23T07:08:43Z

already in progress in other issues

RafalSkolasinski added the triage Needs to be triaged and prioritised accordingly label Jan 23, 2020

RafalSkolasinski mentioned this issue Jan 24, 2020

Custom metrics in feedback #315

Closed

ukclivecox mentioned this issue Jan 31, 2020

Add Health and Status endpoints to grpc spec #1387

Closed

ukclivecox added this to the 1.2 milestone Jan 31, 2020

ukclivecox removed the triage Needs to be triaged and prioritised accordingly label Jan 31, 2020

ukclivecox self-assigned this Jan 31, 2020

ukclivecox mentioned this issue Jan 31, 2020

Updated Seldon Data Plane #1389

Closed

axsaucedo mentioned this issue Feb 15, 2020

WIP: Seldon Core Async "Batch" Processing with Long Running Capability #1447

Closed

This was referenced Feb 21, 2020

executor: tags do not propagate through inference graph #1474

Closed

Request metadata: discussion and design #1480

Closed

RafalSkolasinski mentioned this issue Mar 30, 2020

add model metadata support #1638

Closed

ukclivecox removed this from the 1.2 milestone Apr 23, 2020

ukclivecox closed this as completed Apr 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define future approach to metadata and metrics with new Executor #1362

Define future approach to metadata and metrics with new Executor #1362

RafalSkolasinski commented Jan 23, 2020 •

edited

Loading

RafalSkolasinski commented Jan 23, 2020

RafalSkolasinski commented Jan 27, 2020 •

edited

Loading

RafalSkolasinski commented Feb 20, 2020

RafalSkolasinski commented Mar 30, 2020

ukclivecox commented Apr 23, 2020

Define future approach to metadata and metrics with new Executor #1362

Define future approach to metadata and metrics with new Executor #1362

Comments

RafalSkolasinski commented Jan 23, 2020 • edited Loading

What this issue is about

Definitions

Metadata (model):

Metadata (request):

Metrics:

Now: only Java Engine

Metrics method:

Tags method:

Future: compatibility with both Executor and Engine

Go Executor:

Java Engine:

Problems / objectives

Main questions:

RafalSkolasinski commented Jan 23, 2020

RafalSkolasinski commented Jan 27, 2020 • edited Loading

RafalSkolasinski commented Feb 20, 2020

RafalSkolasinski commented Mar 30, 2020

ukclivecox commented Apr 23, 2020

RafalSkolasinski commented Jan 23, 2020 •

edited

Loading

RafalSkolasinski commented Jan 27, 2020 •

edited

Loading