-
Notifications
You must be signed in to change notification settings - Fork 834
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow access to puid within the predict API #795
Comments
It might be helpful to pass opentracing active span via the context also. For example, in the python model wrapper, I can't find a clean way to pass the span that flask created to the model's functions, so we end up with two disjoint set of traces. |
@pisymbol there is an optional This |
@phsiao the For example, you can see the following trace where the context was created on the orchestrator (first two traces in yellow) and a 3rd trace was submitted from the Python wrapper (the last one in blue). |
@pisymbol have you had a chance to check if the |
Please reopen if still an issue |
Reopen please! As of 0.5.1, puid is still not available in 'meta' within predict. Verified today. |
Every prediction should come with a transaction ID. This ID could be used for statistics as well as a way to uniquely identify a thread of inference execution (worker process under gunicorn) etc.
Right now, that transaction ID is essentially the puid under the raw Seldon Message which is only accessible via the predict_raw entry point NOT the predict one. It does get returned in the 'meta' but that is not helpful at run-time.
Seldon Core should expose the puid (or some other transaction ID) that model developers can read at the time of each predict call.
Small but related corollary: I actually think an entire Seldon Context should be created and be available at the time of a "predict" call and it could house information such as this transaction ID as well as other useful information such as request IP, custom parameters added via deployment annotations etc. I envision a world for instance where I deploy a model under Seldon and depending on some deployment annotation (say a version string or a debug mode) I can change the model behavior without ever making a single code change. Currently, the only real way to dynamically do this is via sending a model version string as a feature or a canary type deployment (kinda lame if you ask me).
The text was updated successfully, but these errors were encountered: