-
Notifications
You must be signed in to change notification settings - Fork 262
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[workload] WaitForPodsReady: Requeue at the back of the queue after timeout #689
[workload] WaitForPodsReady: Requeue at the back of the queue after timeout #689
Conversation
✅ Deploy Preview for kubernetes-sigs-kueue canceled.
|
Hi @trasc. Thanks for your PR. I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
2285aad
to
6464e21
Compare
/unhold |
/ok-to-test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
cc @mimowo
68edd48
to
3e4fa75
Compare
c981389
to
cc766a8
Compare
CHANGELOG/CHANGELOG-0.4.md
Outdated
|
||
- Move the workloads evicted due to pods ready timeout to the end of the queue. #689 | ||
|
||
## Production Readiness |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
uhm... this was a mistake... they should all be ###
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/hold
cc766a8
to
61eb227
Compare
/hold cancel |
c3d9155
to
749d43c
Compare
/hold cancel I thought we had configuration that prevented merge commits. |
6c695a3
to
4f3018e
Compare
@@ -127,6 +127,17 @@ func (r *WorkloadReconciler) Reconcile(ctx context.Context, req ctrl.Request) (c | |||
ctx = ctrl.LoggerInto(ctx, log) | |||
log.V(2).Info("Reconciling Workload") | |||
|
|||
// if a pods ready timeout eviction is ongoing. | |||
if evictionCond := apimeta.FindStatusCondition(wl.Status.Conditions, kueue.WorkloadEvicted); evictionCond != nil && evictionCond.Status == metav1.ConditionTrue && | |||
evictionCond.Reason == kueue.WorkloadEvictedByPodsReadyTimeout && |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe I said this before, but I think it shouldn't be conditional on the reason.
But we can standardize with preemption in a follow up, and based on the outcome of #510
Leaving this comment for future reference.
The new condition is set when a workload is preempted or it's pod ready timeout expired. In case of pods ready timeout, the condition's transition timestamp will be used in ordering the workloads in the scheduling queues.
4f3018e
to
468cc41
Compare
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: alculquicondor, trasc The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
What type of PR is this?
/kind feature
What this PR does / why we need it:
A new workload condition ,
Evicted
, is added, which gets set when:In case of PodsReady timeout, the condition transition timestamp will be used in scheduler sorting, therefore the workload will be moved at the end of the queue.
Which issue(s) this PR fixes:
Fixes #599
Special notes for your reviewer:
The preemption based
Eviction
will be used in solving #510.