-
Notifications
You must be signed in to change notification settings - Fork 262
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix Pod finalizers for succeeded groups #1905
Fix Pod finalizers for succeeded groups #1905
Conversation
✅ Deploy Preview for kubernetes-sigs-kueue ready!
To edit notification comments on pull requests, go to your Netlify site configuration. |
76993cb
to
24ca225
Compare
Change-Id: I7f17d51df66de6766af44e48f45b4165e8bd3e30
24ca225
to
b59df4c
Compare
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: alculquicondor The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Basically, lgtm
if err := r.finalizeJob(ctx, job); err != nil { | ||
return ctrl.Result{}, err | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since this PR, we always finalize the Job when the Job has WorkloadFinished condition.
So, is this workload-controller mechanism still needed?
kueue/pkg/controller/core/workload_controller.go
Lines 158 to 160 in 5023dd2
if len(wl.ObjectMeta.OwnerReferences) == 0 && !wl.DeletionTimestamp.IsZero() { | |
return ctrl.Result{}, workload.RemoveFinalizer(ctx, r.client, &wl) | |
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It still helps in the case of a rogue user manually deleting finalizers.
But I'm also never sure which cases we are missing when dealing with finalizers. So I prefer to keep it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That makes sense.
Thank you for the clarification!
/lgtm
/cherry-pick release-0.6 |
@alculquicondor: once the present PR merges, I will cherry-pick it on top of release-0.6 in a new PR and assign it to you. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
LGTM label has been added. Git tree hash: 7cd9ff8e054e24d92b08bacc57feb6e3349a5703
|
Let's do a few runs 1/5 passed. /test pull-kueue-test-integration-main |
/hold |
/cherry-pick website |
@alculquicondor: once the present PR merges, I will cherry-pick it on top of website in a new PR and assign it to you. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Oops, I put the |
2/5 passed. /test pull-kueue-test-integration-main |
3/5 passed. /test pull-kueue-test-integration-main |
4/5 passed. /test pull-kueue-test-integration-main |
@alculquicondor: new pull request created: #1908 In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
@alculquicondor: new pull request created: #1909 In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/cherry-pick release-0.6 |
@alculquicondor: new pull request created: #1916 In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Change-Id: I7f17d51df66de6766af44e48f45b4165e8bd3e30
/release-note-edit
|
Change-Id: I7f17d51df66de6766af44e48f45b4165e8bd3e30
What type of PR is this?
/kind bug
What this PR does / why we need it:
The initial attempt to clear finalizers after declaring a Workload as finished might fail partially. At this point, some succeeded pods might be gone, so the group can no longer be interpreted as finished. But we shouldn't be checking that again.
Which issue(s) this PR fixes:
Fixes #1898
Special notes for your reviewer:
Does this PR introduce a user-facing change?