Skip to content
This repository has been archived by the owner on Sep 12, 2023. It is now read-only.

Skip some logics for terminated job and add PodGroup reconcile loop #93

Merged
merged 2 commits into from
May 17, 2020

Conversation

Jeffwan
Copy link
Member

@Jeffwan Jeffwan commented May 17, 2020

Resolve #92 #94

  1. Skip checking activeDeadline or backoffLimit if job terminated
  2. Add PodGroup reconcile logic

This is missing in kubeflow/common. We need this to make sure minAvailableReplicas is correct in PodGroup for each training job

Signed-off-by: Jiaxin Shan <[email protected]>
@kubeflow-bot
Copy link

This change is Reviewable

@ChanYiLin
Copy link
Member

/LGTM

Copy link
Member

@terrytangyuan terrytangyuan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm
/approve

@k8s-ci-robot
Copy link

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: terrytangyuan

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot merged commit a63fa3d into kubeflow:master May 17, 2020
georgkaleido pushed a commit to georgkaleido/common that referenced this pull request Jun 9, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Skip check active deadline and backoff limit if job is terminated
5 participants