-
Notifications
You must be signed in to change notification settings - Fork 834
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Seldon not creating services for NVIDIA TRT Deployment #826
Comments
Does it create a deployment and do all the containers in that deployment run? |
It creates a deployment and all the containers run including the seldon-engine and istio proxies. The only thing I can see that is different is the services are not created right. As you mentioned, it should create two services but I only see one service with seldon-{hashcode}. Here are the logs from the manager: https://gist.github.com/damitkwr/a25c9bc0ca5361ccddd662a427fe5ee9 What resources do you want for the |
for the pod created
…On Thu, 29 Aug 2019, 18:34 damitkwr, ***@***.***> wrote:
It creates a deployment and all the containers run including the
seldon-engine and istio proxies. The only thing I can see that is different
is the services are not created right. As you mentioned, it should create
two services but I only see one service with seldon-{hashcode}.
Here are the logs from the manager:
https://drive.google.com/file/d/1ylf4I6kz_VdWr-nyo1U51sqAFivu1r8-/view?usp=sharing
What resources do you want for the kubectl describe command?
—
You are receiving this because you were assigned.
Reply to this email directly, view it on GitHub
<#826?email_source=notifications&email_token=ACQS4AZA57Y3O5PGX2A4E43QHACD5A5CNFSM4IRPTMZKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD5PH5GY#issuecomment-526286491>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ACQS4A6YN7JSYI557J4OKGLQHACD5ANCNFSM4IRPTMZA>
.
|
@cliveseldon I put the logs in a gist now. You should be able to view it. |
Your container failed. |
The was the previous state. If you look at the state now it says its running |
Sorry yes. Are you able to get the raw logs using |
For some reason, the seldon-operator-controller-manager pod is not found as a pod with |
There should be a pod in seldon-system. |
Nvm, I was in the wrong namespace. Here are the logs: https://gist.github.com/damitkwr/1d151c6c7786967ad04c1b0f6eeaaaad |
ok - found the issue. There is a |
For further background. We need to compare the Deployment we want with what is running and if different then update. This is made complex by Kubernetes adding defaults to some fields. |
Thanks for helping to find this issue. Hopefull we can do a fix tomorrow. |
No worries, it's awesome that you guys can respond this fast! |
I have updated - can you check with seldonio/seldon-core-operator:0.4.1-SNAPSHOT latest |
It works now. Awesome! Thank you! |
Hi, here is the yaml to reproduce the action and feel free to substitute the images for actual NVIDIA TRT deployments. This exact YAML works on seldon v0.3.1 but does not work for v0.3.2 and above. This YAML will only create one service. The issue is discussed in detail in: https://seldondev.slack.com/archives/C8Y9A8G0Y/p1567009614010500
The text was updated successfully, but these errors were encountered: