BookCapacity for ProvisioningRequest pods #6880

yaroslava-serdiuk · 2024-05-29T14:18:58Z

What type of PR is this?

/kind feature

This is needed to prevent ScaleDown for ProvisioningRequest during booking time.
Fixes #6517
Complete implementation for #6815

cluster-autoscaler/processors/provreq/provisioning_request_processor.go

kisieland · 2024-06-13T10:05:21Z

/lgtm

k8s-ci-robot · 2024-06-13T10:05:25Z

@kisieland: changing LGTM is restricted to collaborators

In response to this:

/lgtm

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

MaciekPytel · 2024-06-20T10:03:01Z

cluster-autoscaler/processors/provreq/processor.go

+
+// BookCapacity schedule fake pods for ProvisioningRequest that should have reserved capacity
+// in the cluster.
+func (p *provReqProcessor) BookCapacity(ctx *context.AutoscalingContext) error {


I think this should be implemented as PodListProcessor.Process() rather than introducing a new call to StaticAutoscaler:

It's literally doing what PLP is meant to do - changing the list of pods to be processed by CA (by injecting new ones).

Scheduling pods on existing nodes is also generally done in PLP - FilterOutSchedulable is the main place we do it.

Injecting pods in PLP in a similar way is a pretty well established pattern in CA forks - I know you have access to GKE fork, you can see how CapacityRequests do pretty much the same thing in a PLP.

PodListProcessor is processing the list of unschedulable pods and applying changes to it. Since booking capacity does nothing to unschedulable pods list and just modifying cluster snapshot.
Sure, I can implement booking capacity as a part of PodListProcessor and just do nothing with unschedulable list, is it what you asking? However I don't see the advantage of this approach.

MaciekPytel · 2024-06-20T10:06:45Z

cluster-autoscaler/processors/capacityreservation/capacityreservation.go

+)
+
+// CapacityReservation is interface to reserve capacity in the cluster.
+type CapacityReservation interface {


See my other comment - this is already generally done by PodListProcessors and I think PLP are better suited to the job: in most use-cases where you book capacity you also want to add any pods that don't fit to the list of pending pods in order to trigger scale-up.

in most use-cases where you book capacity you also want to add any pods that don't fit to the list of pending pods in order to trigger scale-up

I explicitly don't want to add any pods that don't fit to the list of pending pods, because the scale-up for ProvReq was already triggered, we just reserve capacity in the simple way by creating fake pods for ProvReq. In fact real pods could be already created, so fake pods won't fit in the cluster and this is fine.

Right, what I mean is that injecting in-memory pods is a common pattern already. This includes both pods that end up in the list of unschedulable pods and pods added directly to snapshot.
In most cases when you inject pods you expect whatever fits in snapshot to go there and what is left to trigger scale-up. Your use-case only involves modifying the snapshot and not the list of pending pods, which makes it slightly unusual. But I think it's better to still do it in PLP, even if it doesn't modify the lists of pods:

It is consistent with other implementations

What would be the expectation for future pod-injecting features that follow the pattern of "scheduling" as much as possible in cluster snapshot and adding leftover to list of unschedulable pods? Should those be implemented as PLP or the new processor? They both modify the list of pods and book capacity - and arguably so does FilterOutSchedulable which is a well established part of core CA logic.

Implemented PLP

yaroslava-serdiuk · 2024-07-11T10:19:42Z

/pony

k8s-ci-robot · 2024-07-11T10:19:50Z

@yaroslava-serdiuk:

In response to this:

/pony

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

MaciekPytel · 2024-07-12T17:09:39Z

/lgtm
/approve
/hold

k8s-ci-robot · 2024-07-12T17:09:55Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: MaciekPytel, yaroslava-serdiuk

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~cluster-autoscaler/OWNERS~~ [MaciekPytel]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

yaroslava-serdiuk · 2024-07-12T18:17:50Z

/unhold

yaroslava-serdiuk · 2024-07-16T08:28:57Z

/cherry-pick cluster-autoscaler-release-1.30

k8s-infra-cherrypick-robot · 2024-07-16T08:29:40Z

@yaroslava-serdiuk: new pull request created: #7057

In response to this:

/cherry-pick cluster-autoscaler-release-1.30

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-ci-robot requested review from BigDarkClown and x13n May 29, 2024 14:19

k8s-ci-robot added the area/cluster-autoscaler label May 29, 2024

kisieland reviewed May 31, 2024

View reviewed changes

cluster-autoscaler/processors/provreq/provisioning_request_processor.go Outdated Show resolved Hide resolved

cluster-autoscaler/processors/provreq/provisioning_request_processor.go Outdated Show resolved Hide resolved

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jun 7, 2024

yaroslava-serdiuk force-pushed the provreq-scale-down branch from 0b8fdab to c2c676e Compare June 12, 2024 13:03

yaroslava-serdiuk changed the title ~~WIP: BookCapacity for ProvisioningRequest pods~~ BookCapacity for ProvisioningRequest pods Jun 12, 2024

k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jun 12, 2024

yaroslava-serdiuk force-pushed the provreq-scale-down branch 3 times, most recently from 7ac217c to 09ba963 Compare June 12, 2024 14:15

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jun 13, 2024

MaciekPytel reviewed Jun 20, 2024

View reviewed changes

yaroslava-serdiuk mentioned this pull request Jun 20, 2024

Add documentation for check-capacity Provisioning class #6904

Merged

yaroslava-serdiuk force-pushed the provreq-scale-down branch from 09ba963 to 69b8c9f Compare June 21, 2024 11:51

BookCapacity for ProvisioningRequest pods

830bbb2

yaroslava-serdiuk force-pushed the provreq-scale-down branch from 69b8c9f to 830bbb2 Compare June 21, 2024 11:54

yaroslava-serdiuk mentioned this pull request Jul 9, 2024

Implement AtomicScaleUp ProvisioningClass #6815

Closed

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jul 12, 2024

k8s-ci-robot assigned MaciekPytel Jul 12, 2024

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jul 12, 2024

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 12, 2024

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jul 12, 2024

k8s-ci-robot merged commit 68a757c into kubernetes:master Jul 12, 2024
6 checks passed

k8s-infra-cherrypick-robot mentioned this pull request Jul 16, 2024

[cluster-autoscaler-release-1.30] BookCapacity for ProvisioningRequest pods #7057

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BookCapacity for ProvisioningRequest pods #6880

BookCapacity for ProvisioningRequest pods #6880

yaroslava-serdiuk commented May 29, 2024 •

edited

Loading

kisieland commented Jun 13, 2024

k8s-ci-robot commented Jun 13, 2024

MaciekPytel Jun 20, 2024

yaroslava-serdiuk Jun 20, 2024

MaciekPytel Jun 20, 2024

yaroslava-serdiuk Jun 20, 2024

MaciekPytel Jun 20, 2024

yaroslava-serdiuk Jun 21, 2024

yaroslava-serdiuk commented Jul 11, 2024

k8s-ci-robot commented Jul 11, 2024

MaciekPytel commented Jul 12, 2024

k8s-ci-robot commented Jul 12, 2024

yaroslava-serdiuk commented Jul 12, 2024

yaroslava-serdiuk commented Jul 16, 2024

k8s-infra-cherrypick-robot commented Jul 16, 2024

BookCapacity for ProvisioningRequest pods #6880

BookCapacity for ProvisioningRequest pods #6880

Conversation

yaroslava-serdiuk commented May 29, 2024 • edited Loading

What type of PR is this?

kisieland commented Jun 13, 2024

k8s-ci-robot commented Jun 13, 2024

MaciekPytel Jun 20, 2024

Choose a reason for hiding this comment

yaroslava-serdiuk Jun 20, 2024

Choose a reason for hiding this comment

MaciekPytel Jun 20, 2024

Choose a reason for hiding this comment

yaroslava-serdiuk Jun 20, 2024

Choose a reason for hiding this comment

MaciekPytel Jun 20, 2024

Choose a reason for hiding this comment

yaroslava-serdiuk Jun 21, 2024

Choose a reason for hiding this comment

yaroslava-serdiuk commented Jul 11, 2024

k8s-ci-robot commented Jul 11, 2024

MaciekPytel commented Jul 12, 2024

k8s-ci-robot commented Jul 12, 2024

yaroslava-serdiuk commented Jul 12, 2024

yaroslava-serdiuk commented Jul 16, 2024

k8s-infra-cherrypick-robot commented Jul 16, 2024

yaroslava-serdiuk commented May 29, 2024 •

edited

Loading