-
Notifications
You must be signed in to change notification settings - Fork 710
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
no matches for tensorflow.org/, Kind=TfJob #173
Comments
Thanks for replying. Let me have a look at the comment and refs. |
@jlewi I checked my case according to your comment in #149 . The TF Job CRD is not created actually. But, the deployment and pod of TF Job are running. I took a look at the logs of TF Job pod, and found that it may be a permission problem of namespace.
|
I forgot to set
|
I confirmed the docker image exists. I also confirmed the bucket is public. Could this be an issue with gcr.io being blocked by the firewall? In the meantime you should be able to build the Docker image from the repo using https://github.com/tensorflow/k8s/blob/master/examples/tf_sample/build_and_push.py |
Ok, I'm going to build the Docker image. Thanks! |
@jlewi I noticed that we directly call subprocess to build and push Docker image in build_and_push.py file. I think replace it with Docker SDK would be better and more portable. What do you think? |
|
Anyway, the issue of |
Hi @jlewi Is there any solution for the same issue when tf-job-operator is deployed by Kubeflow instead of helm? |
I have same problem, how can i fix? describe: |
I have set a configMap for bare metal environment, and got a problem when I created a TF job. I'm sure that I followed the instructions of README.md file step by step. I attached the error log as below:
The text was updated successfully, but these errors were encountered: