scheduler: impose a backoff penalty on gated Pods #126029

sanposhiho · 2024-07-11T11:51:43Z

What type of PR is this?

/kind feature

What this PR does / why we need it:

Starting the story from the concept of backoff in the scheduler; a backoff time in the scheduler is a penalty that we impose on Pods when they consume a scheduling cycle, but they didn't get scheduled and came back to the queue.

But, currently all gated Pods are always regarded as not backing off.
That is only correct for a vanilla scheduler because all Pods gated by SchedulingGates haven't experienced any scheduling and thus are not backing off for sure.
A custom PreEnqueue plugin might gate Pods after they experience some scheduling cycles; those mean -

Pods have experienced some scheduling cycles.
They get gated by a custom PreEnqueue plugin.
They get un-gated for some reasons.
💥 Whoa! They're moved to activeQ without a backoff penalty.

Regardless of whether a Pod is gated or not, they are supposed to get a penalty if they wasted some scheduling cycles before.
It's the law in the scheduler; it's their obligation that they must meet before retrying a schedule again.

This PR changes isPodBackingoff() not to skip gated Pods so that we can prevent such Pods from exploiting loopholes, ignoring the law, and escaping a penalty.

Which issue(s) this PR fixes:

Fixes #125538

Special notes for your reviewer:

Does this PR introduce a user-facing change?

The scheduler retries gated Pods more appropriately, giving them a backoff penalty too.

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

k8s-ci-robot · 2024-07-11T11:51:45Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

k8s-ci-robot · 2024-07-11T11:51:51Z

This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-ci-robot · 2024-07-11T11:52:24Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: sanposhiho

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/scheduler/OWNERS~~ [sanposhiho]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

sanposhiho · 2024-07-11T11:52:45Z

/hold

to go thru an approver.

sanposhiho · 2024-07-11T11:52:53Z

/cc @alculquicondor

sanposhiho · 2024-07-11T12:10:17Z

I found out I have to fix some tests. Just converted to WIP for now.

k8s-ci-robot · 2024-07-22T07:43:20Z

LGTM label has been added.

Git tree hash: 9f6ecea95badeb34063e338f2136bb6a905b4cd0

sanposhiho · 2024-07-22T08:40:17Z

/assign @alculquicondor
for approval

sanposhiho · 2024-07-22T08:41:03Z

Looks like Aldo is on vacation.
/assign @kerthcet

sanposhiho · 2024-08-14T08:51:20Z

/cc @macsko @alculquicondor

Just fixed the conflict.

macsko · 2024-08-14T09:33:07Z

/lgtm

k8s-ci-robot · 2024-08-14T09:33:13Z

LGTM label has been added.

Git tree hash: 6aa2aea7fee9c6b4a12019a9b4be80d5b8432268

pkg/scheduler/internal/queue/scheduling_queue_test.go

alculquicondor · 2024-08-19T19:35:05Z

pkg/scheduler/internal/queue/scheduling_queue_test.go

@@ -3089,6 +3106,7 @@ scheduler_plugin_execution_duration_seconds_count{extension_point="PreEnqueue",p
 	for _, test := range tests {
 		t.Run(test.name, func(t *testing.T) {
 			resetMetrics()
+			resetPodInfos()


what is increasing attempts in this case?

Could we recreate podInfos for every case, instead?

Some of test.operations (addPodUnschedulablePods) do.

Could we recreate podInfos for every case, instead?

Given all test.operands referpInfos / pInfosWithDelay like the following, that'd require a big change in the test implementation, which I want to avoid (at least, in this PR)

operands: [][]*framework.QueuedPodInfo{ pInfos[:30], // Evern test case refers to the same pInfos. pInfos[30:], },

alculquicondor · 2024-08-19T19:35:28Z

pkg/scheduler/internal/queue/scheduling_queue.go

@@ -1461,6 +1458,12 @@ func (p *PriorityQueue) getBackoffTime(podInfo *framework.QueuedPodInfo) time.Ti
 // calculateBackoffDuration is a helper function for calculating the backoffDuration
 // based on the number of attempts the pod has made.
 func (p *PriorityQueue) calculateBackoffDuration(podInfo *framework.QueuedPodInfo) time.Duration {
+	if podInfo.Attempts == 0 {


what's the name of the test for this?

QueueHintFunction is called when Pod is gated by a plugin other than SchedulingGate test case in TestPriorityQueue_MoveAllToActiveOrBackoffQueueWithQueueingHint ensures that the Pod with zero attempt doesn't get backoff.

what if the hints are disabled?

The test isn't related to the feature gate.
It's just that when the feature gate is disabled we don't accept the queueing hint from the plugin, but still use the default queueing hint.

sanposhiho · 2024-08-27T00:59:00Z

@alculquicondor Updated based on your point.

sanposhiho · 2024-08-27T01:58:44Z

/retest

alculquicondor

/lgtm

k8s-ci-robot · 2024-08-28T15:15:30Z

LGTM label has been added.

Git tree hash: 5189157ac5ac5fbfac5a470f15e8e97efd6353a2

alculquicondor · 2024-08-28T15:15:39Z

/hold cancel

k8s-triage-robot · 2024-08-28T18:31:36Z

The Kubernetes project has merge-blocking tests that are currently too flaky to consistently pass.

This bot retests PRs for certain kubernetes repos according to the following rules:

The PR does have any do-not-merge/* labels
The PR does not have the needs-ok-to-test label
The PR is mergeable (does not have a needs-rebase label)
The PR is approved (has cncf-cla: yes, lgtm, approved labels)
The PR is failing tests required for merge

You can:

Review the full test history for this PR
Prevent this bot from retesting with /lgtm cancel or /hold
Help make our tests less flaky by following our Flaky Tests Guide

/retest

k8s-ci-robot added the needs-priority Indicates a PR lacks a `priority/foo` label and requires one. label Jul 11, 2024

sanposhiho marked this pull request as ready for review July 11, 2024 11:51

k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 11, 2024

k8s-ci-robot requested review from damemi and denkensk July 11, 2024 11:52

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 11, 2024

k8s-ci-robot requested a review from kerthcet July 11, 2024 11:52

k8s-ci-robot added sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. and removed do-not-merge/needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Jul 11, 2024

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jul 11, 2024

k8s-ci-robot requested a review from alculquicondor July 11, 2024 11:52

sanposhiho marked this pull request as draft July 11, 2024 12:09

k8s-ci-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 11, 2024

sanposhiho force-pushed the backoff-preenqueue branch from e4454a6 to 9da01d0 Compare July 12, 2024 03:11

k8s-ci-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Jul 12, 2024

sanposhiho marked this pull request as ready for review July 12, 2024 03:12

k8s-ci-robot assigned alculquicondor Jul 22, 2024

k8s-ci-robot assigned kerthcet Jul 22, 2024

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Aug 14, 2024

sanposhiho force-pushed the backoff-preenqueue branch from 9da01d0 to 2f69be5 Compare August 14, 2024 08:50

k8s-ci-robot removed lgtm "Looks good to me", indicates that a PR is ready to be merged. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. labels Aug 14, 2024

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 14, 2024

alculquicondor reviewed Aug 19, 2024

View reviewed changes

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Aug 20, 2024

scheduler: impose a backoff penalty on gated Pods

b5a1569

sanposhiho force-pushed the backoff-preenqueue branch from 2f69be5 to b5a1569 Compare August 27, 2024 00:58

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 27, 2024

k8s-ci-robot requested a review from alculquicondor August 27, 2024 00:58

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Aug 27, 2024

alculquicondor reviewed Aug 28, 2024

View reviewed changes

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 28, 2024

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Aug 28, 2024

k8s-ci-robot merged commit 59051eb into kubernetes:master Aug 28, 2024
14 checks passed

k8s-ci-robot added this to the v1.32 milestone Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scheduler: impose a backoff penalty on gated Pods #126029

scheduler: impose a backoff penalty on gated Pods #126029

sanposhiho commented Jul 11, 2024

k8s-ci-robot commented Jul 11, 2024

k8s-ci-robot commented Jul 11, 2024

k8s-ci-robot commented Jul 11, 2024

sanposhiho commented Jul 11, 2024

sanposhiho commented Jul 11, 2024

sanposhiho commented Jul 11, 2024

k8s-ci-robot commented Jul 22, 2024

sanposhiho commented Jul 22, 2024

sanposhiho commented Jul 22, 2024

sanposhiho commented Aug 14, 2024

macsko commented Aug 14, 2024

k8s-ci-robot commented Aug 14, 2024

alculquicondor Aug 19, 2024

sanposhiho Aug 27, 2024

sanposhiho Aug 27, 2024

alculquicondor Aug 19, 2024

sanposhiho Aug 27, 2024

alculquicondor Aug 27, 2024

sanposhiho Aug 28, 2024

sanposhiho commented Aug 27, 2024

sanposhiho commented Aug 27, 2024

alculquicondor left a comment

k8s-ci-robot commented Aug 28, 2024

alculquicondor commented Aug 28, 2024

k8s-triage-robot commented Aug 28, 2024

scheduler: impose a backoff penalty on gated Pods #126029

scheduler: impose a backoff penalty on gated Pods #126029

Conversation

sanposhiho commented Jul 11, 2024

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

k8s-ci-robot commented Jul 11, 2024

k8s-ci-robot commented Jul 11, 2024

k8s-ci-robot commented Jul 11, 2024

sanposhiho commented Jul 11, 2024

sanposhiho commented Jul 11, 2024

sanposhiho commented Jul 11, 2024

k8s-ci-robot commented Jul 22, 2024

sanposhiho commented Jul 22, 2024

sanposhiho commented Jul 22, 2024

sanposhiho commented Aug 14, 2024

macsko commented Aug 14, 2024

k8s-ci-robot commented Aug 14, 2024

alculquicondor Aug 19, 2024

Choose a reason for hiding this comment

sanposhiho Aug 27, 2024

Choose a reason for hiding this comment

sanposhiho Aug 27, 2024

Choose a reason for hiding this comment

alculquicondor Aug 19, 2024

Choose a reason for hiding this comment

sanposhiho Aug 27, 2024

Choose a reason for hiding this comment

alculquicondor Aug 27, 2024

Choose a reason for hiding this comment

sanposhiho Aug 28, 2024

Choose a reason for hiding this comment

sanposhiho commented Aug 27, 2024

sanposhiho commented Aug 27, 2024

alculquicondor left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Aug 28, 2024

alculquicondor commented Aug 28, 2024

k8s-triage-robot commented Aug 28, 2024