-
Notifications
You must be signed in to change notification settings - Fork 221
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MPI-Operator run example failed #464
Comments
I try to run the mpi-operator-0.3.0/examples/horovod/tensorflow-mnist-elastic.yaml kubectl logs tensorflow-mnist-elastic-launcher-rjlbk |
This is a duplicate of #445 If you can send a fix, I'm happy to review. Otherwise, you will have to resort to the v1 controller. |
I setup the mpi-operator v0.3.0
and try to deploy the example:
mpi-operator-0.3.0/examples/horovod/tensorflow-mnist.yaml
but it seem can not run it correctly:
NAME READY STATUS RESTARTS AGE
tensorflow-mnist-launcher-ffkrh 0/1 Error 4 2m39s
tensorflow-mnist-worker-0 1/1 Running 0 2m39s
tensorflow-mnist-worker-1 1/1 Running 0 2m39s
#kubectl logs tensorflow-mnist-launcher-ffkrh
Failed to add the host to the list of known hosts (/root/.ssh/known_hosts).
Failed to add the host to the list of known hosts (/root/.ssh/known_hosts).
Permission denied, please try again.
Permission denied, please try again.
[email protected]: Permission denied (publickey,password).
The text was updated successfully, but these errors were encountered: