-
Notifications
You must be signed in to change notification settings - Fork 710
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
run tfjob failded with self build image #840
Comments
Can you give us the detail of the pod: |
the description of master pod: Name: example-job1-master-0 |
I think the error is from your code, since |
Maybe you could output something in the container, to debug. |
my code is the same as the file |
I think the error is from the image, then. |
But when I use the sample image from the official |
I have built this Dockerfile before I open this issue. My error is caused by this image. So how to build an useful image for tfjob? |
I think you just need to make sure that the tensorflow is installed correctly in the image. |
Thank you,I solve this problem. After I used the |
I build an image by a Dockerfile which is the same as tf-operator/examples/tf_sample/Dockerfile. When I try to apply and run a tfjob. The container is failed without any logs.....
The text was updated successfully, but these errors were encountered: