[v1alpha2] Pods not deleted when job finishes #671

jlewi · 2018-06-15T04:06:03Z

Doesn't look like the pods are being deleted when the job finishes.

This means the pods will still be running and continue to consume resources.

For v1alpha1 we decided that was undesirable and that pods should be deleted when the job finishes.

I think we should preserve that behavior even though it means access to logs depends on cluster level logging.

If a user wants to leave pods running until job finishes they could do this by keeping the processes alive via their code.

jlewi · 2018-06-15T04:06:24Z

/assign @gaocegege

gaocegege · 2018-06-15T04:07:24Z

Is this for v1alpha1 or v1alpha2?

jlewi · 2018-06-15T04:15:53Z

v1alpha2.

Here's the issue related to changing the behavior in v1alpha1 to delete pods when job finishes.
#128

Per that issue, I think this is a real issue for using TFJob. If you launch a job which uses GPUs then that job will continue to consume GPUs even after the job finishes.

I think this is a major barrier to doing actual work.

Bumping this to P0 for this reason and to match priority with which we rated it in v1alpha1.

jlewi · 2018-06-19T12:20:50Z

@yph152 and @gaocegege Any update on when the PR for this issue will be ready? Do you need help?

jlewi added priority/p1 api/v1alpha1 area/0.2.0 labels Jun 15, 2018

k8s-ci-robot assigned gaocegege Jun 15, 2018

jlewi added priority/p0 and removed priority/p1 labels Jun 15, 2018

gaocegege added api/v1alpha2 and removed api/v1alpha1 labels Jun 15, 2018

gaocegege mentioned this issue Jun 19, 2018

[v1alpha2] Need conditions Succeeded and Failed indicating when job is done #673

Closed

gaocegege closed this as completed Jun 20, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[v1alpha2] Pods not deleted when job finishes #671

[v1alpha2] Pods not deleted when job finishes #671

jlewi commented Jun 15, 2018

jlewi commented Jun 15, 2018

gaocegege commented Jun 15, 2018

jlewi commented Jun 15, 2018

jlewi commented Jun 19, 2018

[v1alpha2] Pods not deleted when job finishes #671

[v1alpha2] Pods not deleted when job finishes #671

Comments

jlewi commented Jun 15, 2018

jlewi commented Jun 15, 2018

gaocegege commented Jun 15, 2018

jlewi commented Jun 15, 2018

jlewi commented Jun 19, 2018