-
Notifications
You must be signed in to change notification settings - Fork 791
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update TFJob docs with information about monitoring. #107
Conversation
* Move over the documentation in kubeflow/tf-operator/README.md and update it for v1alpha2. * Add a what is TFJob section that describes the TFJob * Provide information about the spec * Provide information about exit codes. * Provide stackdriver sections for both the legacy and new Kubernetes stackdriver. * Add a troubleshooting section.
* We should direct users to the official documentation and remove the documentation in the README.md * kubeflow/website#107 will update the website to include all information currently in the README as well as other information * README.md is outdated; e.g. it is still referring to v1alpha1 Fix kubeflow#760
/assign @kunmingg |
* We should direct users to the official documentation and remove the documentation in the README.md * kubeflow/website#107 will update the website to include all information currently in the README as well as other information * README.md is outdated; e.g. it is still referring to v1alpha1 Fix #760
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This document in general looks great and really informative.
@@ -74,14 +204,14 @@ Typically you will want to change the following values | |||
|
|||
1. Attach PV's if you want to use PVs for storage. | |||
|
|||
## Submitting a TensorFlow training job | |||
### Submitting a TensorFlow training job |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Lines 207 - 271 seem to be duplicate of lines 143-206.
/lgtm |
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: jlewi The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Move over the documentation in kubeflow/tf-operator/README.md and
update it for v1alpha2.
Add a what is TFJob section that describes the TFJob
Provide stackdriver sections for both the legacy and new Kubernetes
stackdriver.
Add a troubleshooting section.
This change is