Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feature] Support SuccessPolicy/FailurePolicy Based on % of Succeeded/Failed Workers #1188

Closed
gaocegege opened this issue Sep 9, 2020 · 4 comments

Comments

@gaocegege
Copy link
Member

We now support allworkers in success/failure policy. Users want to make the Job succeeded/failed based on the percentage of the succeeded/failed workers. I think we can support it in the operator side.

/cc @DeliangFan @terrytangyuan @ChanYiLin

@issue-label-bot
Copy link

Issue-Label Bot is automatically applying the labels:

Label Probability
area/operator 0.61
feature 0.99

Please mark this comment with 👍 or 👎 to give our bot feedback!
Links: app homepage, dashboard and code for this bot.

@gaocegege
Copy link
Member Author

/priority p1

@gaocegege
Copy link
Member Author

We think a Job failed when:

  • Chief fails (it reaches backofflimit) or
  • Any PS fails (it reaches backofflimit) or
  • percentage of workers (they all reach backofflimit) fail

@jlewi jlewi added kind/feature and removed feature labels Sep 9, 2020
@stale
Copy link

stale bot commented Dec 11, 2020

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot closed this as completed Dec 19, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants