Updating inference logic to add node level request-response logging #3874

SachinVarghese · 2022-01-20T14:31:58Z

What this PR does / why we need it:
This PR corrects the paylod logging mechanism in seldon executor for logging node level request-response pairs in complex inference graphs
Which issue(s) this PR fixes:
Fixes #3873

seldondev · 2022-01-20T14:32:26Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
To complete the pull request process, please assign sachinvarghese
You can assign the PR to them by writing /assign @sachinvarghese in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

michaelcheah · 2022-01-24T11:20:50Z

executor/predictor/predictor_process.go

+		//Log Request
+		if node.Logger != nil && (node.Logger.Mode == v1.LogRequest || node.Logger.Mode == v1.LogAll) {
+			err := p.logPayload(node.Name, node.Logger, payloadLogger.InferenceRequest, msg, puid)
+			if err != nil {
+				return nil, err
+			}


I'm not a fan of logging inside transformInput. It is not clear from the Predict function where transformInput is called that the message may or may not be logged.

edit; unfortunately, due to the way the code has been structured, this is the easiest way to do what's needed because the callTransformInput and callModel flags are determined in here.

Yeah I also tried structuring the code in a different manner to start with but this seemed easiest considering other aspects of the implementation.

ukclivecox · 2022-01-31T18:35:43Z

/test integration

ukclivecox · 2022-01-31T18:35:55Z

/test notebooks

RafalSkolasinski · 2022-02-01T17:18:49Z

/test integration

RafalSkolasinski · 2022-02-01T17:27:39Z

logs of integration tests are quite confusing, it is not clear if there was an actual failure or not

===Flaky Test Report===

test_namespace_update[1.10.0] passed 1 out of the required 1 times. Success!
test_namespace_update[1.11.0] passed 1 out of the required 1 times. Success!
test_namespace_update[1.12.0] passed 1 out of the required 1 times. Success!
test_rolling_update6[ambas] passed 1 out of the required 1 times. Success!
test_rolling_update6[istio] passed 1 out of the required 1 times. Success!
test_rolling_update7[ambas] passed 1 out of the required 1 times. Success!
test_rolling_update7[istio] passed 1 out of the required 1 times. Success!
test_rolling_update8[ambas] failed (4 runs remaining out of 5).
	<class 'AssertionError'>
	assert 503 == 200
  -503
  +200
	[<TracebackEntry /workspace/source/testing/scripts/test_rolling_updates.py:140>]
test_rolling_update8[ambas] failed (3 runs remaining out of 5).
	<class 'AssertionError'>
	assert 503 == 200
  -503
  +200
	[<TracebackEntry /workspace/source/testing/scripts/test_rolling_updates.py:140>]
test_rolling_update8[ambas] failed (2 runs remaining out of 5).
	<class 'AssertionError'>
	assert 500 == 200
  -500
  +200
	[<TracebackEntry /workspace/source/testing/scripts/test_rolling_updates.py:140>]
test_rolling_update8[ambas] passed 1 out of the required 1 times. Success!
test_rolling_update8[istio] passed 1 out of the required 1 times. Success!
test_rolling_update9[ambas] passed 1 out of the required 1 times. Success!
test_rolling_update9[istio] passed 1 out of the required 1 times. Success!

===End Flaky Test Report===

========== 52 passed, 6 skipped, 67 deselected in 6307.24s (1:45:07) ===========
Test returned errors
kind delete cluster
Deleting cluster "kind" ...
Stopping Docker: dockerProgram process in pidfile '/var/run/docker-ssd.pid', 1 process(es), refused to die.
�[31m
Pipeline failed on stage 'integration-test-task' : container 'step-integration-step'. The execution of the pipeline has stopped.�[0m

if it was, it could have been on test_rolling_update8[ambas] failed but why it does not count towards total number in the summary?

RafalSkolasinski · 2022-02-01T17:29:47Z

Actually, I also see failure on TestPrepack.test_mlflow_v2.

Triggered tests again anyway to rule out flakiness first

RafalSkolasinski · 2022-02-02T10:04:31Z

/test integration

RafalSkolasinski · 2022-02-02T11:32:14Z

The TestPrepack.test_mlflow_v2 seem to passed I do see failure now on test_rolling_deployment[graph1.json-graph4.json-True-ambas].

Updating inference logic to add node level request-response logging

72e8339

seldondev added the size/M label Jan 20, 2022

SachinVarghese requested a review from ukclivecox January 20, 2022 14:32

SachinVarghese added 2 commits January 21, 2022 21:24

Update to component level logging

7282318

Adding payload logging to combiner component

6bf82ab

seldondev added size/L and removed size/M labels Jan 24, 2022

SachinVarghese requested review from michaelcheah and RafalSkolasinski January 24, 2022 10:59

michaelcheah reviewed Jan 24, 2022

View reviewed changes

ukclivecox merged commit bfb30c1 into SeldonIO:master Feb 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updating inference logic to add node level request-response logging #3874

Updating inference logic to add node level request-response logging #3874

SachinVarghese commented Jan 20, 2022

seldondev commented Jan 20, 2022

michaelcheah Jan 24, 2022

SachinVarghese Jan 24, 2022

ukclivecox commented Jan 31, 2022

ukclivecox commented Jan 31, 2022

RafalSkolasinski commented Feb 1, 2022

RafalSkolasinski commented Feb 1, 2022

RafalSkolasinski commented Feb 1, 2022

RafalSkolasinski commented Feb 2, 2022

RafalSkolasinski commented Feb 2, 2022

Updating inference logic to add node level request-response logging #3874

Updating inference logic to add node level request-response logging #3874

Conversation

SachinVarghese commented Jan 20, 2022

seldondev commented Jan 20, 2022

michaelcheah Jan 24, 2022

Choose a reason for hiding this comment

SachinVarghese Jan 24, 2022

Choose a reason for hiding this comment

ukclivecox commented Jan 31, 2022

ukclivecox commented Jan 31, 2022

RafalSkolasinski commented Feb 1, 2022

RafalSkolasinski commented Feb 1, 2022

RafalSkolasinski commented Feb 1, 2022

RafalSkolasinski commented Feb 2, 2022

RafalSkolasinski commented Feb 2, 2022