KEP: in-place update of pod resources #686

vinaykul · 2019-01-12T02:26:36Z

This PR moves Karol's initial proposal for in-place update of pod resources from k/community to k/enhancements as required by the new process.

This KEP intends to build upon the ideas in proposal for live and in-place vertical scaling and Vertical Resources Scaling in Kubernetes.

This PR also updates the owning-sig to sig-autoscaling, and adds initial set of reviewers @bsalamat , @derekwaynecarr , @dchen1107 from sig-scheduling and sig-node where bulk of the anticipated code changes will happen.

The original pull request by Karol, and associated discussion is here.

CC: @kgolab @bskiba @schylek @bsalamat @dchen1107 @derekwaynecarr @karthickrajamani @YoungjaeLee @resouer

k8s-ci-robot · 2019-01-12T02:26:38Z

Thanks for your pull request. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please follow instructions at https://git.k8s.io/community/CLA.md#the-contributor-license-agreement to sign the CLA.

It may take a couple minutes for the CLA signature to be fully registered; after that, please reply here with a new comment and we'll verify. Thanks.

If you've already signed a CLA, it's possible we don't have your GitHub username or you're using a different email address. Check your existing CLA data and verify that your email is set on your git commits.
If you signed the CLA as a corporation, please sign in with your organization's credentials at https://identity.linuxfoundation.org/projects/cncf to be authorized.
If you have done the above and are still having issues with the CLA being reported as unsigned, please email the CNCF helpdesk: [email protected]

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

vinaykul · 2019-01-12T08:24:35Z

@k8s-ci-robot fixed CLA email address to match commit email.
/check-cla
/assign @DirectXMan12

vinaykul · 2019-01-12T08:37:25Z

/check-cla

thockin · 2019-01-14T18:07:02Z

Is this moving a (partially) consensus-reached KEP to the new repo ir is this the request for deeper review? If the latter, please adjust the title to reflect that this IS the KEP.

vinaykul · 2019-01-14T18:27:38Z

Is this moving a (partially) consensus-reached KEP to the new repo ir is this the request for deeper review? If the latter, please adjust the title to reflect that this IS the KEP.

@thockin Yes this just moves the earlier draft KEP skeleton by @kgolab from k/community to k/enhancements in order to get the ball rolling per new process. The only change I made is to set the owning sig to autoscaling as I think it is the main driver for this feature.

I'll append my previous comments on this once I bring this up for an initial look with sig-node and sig-scheduling and get their official buy-off. This will evolve further as we merge design ideas and fill in details with community consensus.

PatrickLang · 2019-01-15T18:54:48Z

keps/sig-autoscaling/draft-20181106-in-place-update-of-pod-resources.md

+Thanks to the above:
+* PodSpec.Container.ResourceRequirements becomes purely a declaration,
+  denoting **desired** state of the Pod,
+* PodStatus.ContainerStatus.ResourceAllocated (new object) denotes **actual**


Is there a corresponding CRI change for review? That shouldn't block merging this KEP draft but it is going to be important for implementers as it would need to be reviewed for Linux+Windows compatibility and runtime compatibility (dockershim/kata/hyper-v)

cc @feiskyer

I'm not very confident we have reached agreement that it's the direction we will go. If yes, a CRI change should be included here.

@PatrickLang In our implementation, is was sufficient to make changes in kubelet to detect a resources-only container spec update, and call UpdateContainerResources CRI API without any changes to the CRI itself. We have tested it with docker, we are yet to try kata.

@vinaykul Kata does not update "container" resource for now :-) Also, it's related to how CRI shim is implemented, in containerd shimv2 work, I remembered we didn't handle this update at least month ago.

While if we decide to go with the current narrative in this KEP, CRI do need to be updated (new filed: ResourceAllocated) and CRI shim & crictl maintainers should be notified about the incompatible change of meaning of LinuxContainerResources .

@vinaykul Kata does not update "container" resource for now :-) Also, it's related to how CRI shim is implemented, in containerd shimv2 work, I remembered we didn't handle this update at least month ago.

Ah that's good to know. I last toyed with Kata at GA, and they were working on getting cpu/mem update working.

I tried out krt1.4.1 earlier today, and found OCI mostly works. CPU / memory increase & decrease reflects in the cgroup inside kata container and enforced, but VSZ/RSS isn't lowered when memory is lowered, and Get doesn't reflect the actual usage.

I'll try k8s-crio-kata tomorrow or friday to see how well crio-oci translation works and identify gaps. It probably won't work if containerd shim doesnt handle it.

While if we decide to go with the current narrative in this KEP, CRI do need to be updated (new filed: ResourceAllocated) and CRI shim & crictl maintainers should be notified about the incompatible change of meaning of LinuxContainerResources .

@resouer kata-runtime 1.4.1 seems to handle updating cpu/memory via CRI-O (below example)

Regarding CRI, kubelet would merely switch from using PodSpec.Container.Container.ResourceRequirements to PodStatus.Container.ResourceAllocated to get the limits when invoking the CRI API in this function for e.g: https://github.com/Huawei-PaaS/kubernetes/blob/vert-scaling-cp-review/pkg/kubelet/kuberuntime/kuberuntime_container.go#L619

Did I miss something in thinking that CRI update is not necessary?

Thanks, I just check the kata-runtime's api and now every CRI shim could support container level resource adjustment. In that case, no CRI change is required, we can simply just use ResourceAllocated to generate containerResources.

Are you sure about "every CRI"? I'll try to look on that deeper during next days, but almost for sure that will break https://github.com/Mirantis/virtlet/

Are you sure about "every CRI"? I'll try to look on that deeper during next days, but almost for sure that will break https://github.com/Mirantis/virtlet/

@PatrickLang Over the past week, I experimented with WinServer 2019 (for another project planning that I'm working on), and got the chance to try a windows cross-compile kubelet of my implementation and take a closer look at how to set the updated limits. Windows does create the container with specified limits (perhaps using information from ContainerConfig.WindowsContainerConfig.WindowsContainerResources struct).

For cleanliness, I do see that we should update the CRI API to specify ContainerResources instead of LinuxContainerResources (which would have pointers to LinuxContainerResources or WindowsContainerResources similar to ContainerConfig). Do you think containerID + WindowsContainerResources is sufficient for Windows to successfully update the limits?

@jellonek I've not looked at virtlet. If you have had the chance, can you please check if its CRI shim is able to use LinuxContainerResources?

keps/sig-autoscaling/draft-20181106-in-place-update-of-pod-resources.md

…m k/community to k/enhancements.

DirectXMan12 · 2019-01-16T20:24:30Z

/assign @mwielgus
/unassign @DirectXMan12

derekwaynecarr · 2019-01-16T21:06:36Z

/assign

keps/sig-autoscaling/draft-20181106-in-place-update-of-pod-resources.md

kgolab · 2019-09-23T12:43:51Z

@vinaykul , thanks a lot for driving this forward and also simplifying the initial design.

Overall the KEP looks good to me already from the autoscaling perspective.
I've got a few smaller comments which could as well be addressed later, after the KEP is merged.
@mwielgus , not sure if you need something more for a formal approval.

vinaykul · 2019-09-23T16:56:51Z

@vinaykul , thanks a lot for driving this forward and also simplifying the initial design.

Overall the KEP looks good to me already from the autoscaling perspective.
I've got a few smaller comments which could as well be addressed later, after the KEP is merged.
@mwielgus , not sure if you need something more for a formal approval.

@kgolab Thanks for the review, please see my responses above. The goal has been to keep things simple for Kubelet, and so we removed proposed changes that were not critical to the base resize operation. If possible, could you please join tomorrow 10 am PST sig-node weekly meeting to review the above concerns?

In today's sig-autoscaling meeting, I discussed this we will list @mwielgus as final approver in the KEP, he plans to approve it after your LGTM.

dchen1107 · 2019-09-24T16:57:25Z

/approve

@vinaykul I approved your KEP from SIG Node to unblock the ongoing API review since we are converging on the high level design. There are some small implementation details can be addressed later. Thanks for taking this project to SIG Node last couple of months, and leading the design conversation at our weekly meeting. Thanks for your patience.

vinaykul · 2019-09-24T17:44:33Z

/assign @derekwaynecarr

ahg-g · 2019-09-28T12:38:25Z

/approve

Approving for sig-scheduling. Our main concern is the race that could happen between the scheduler and an in-place update. We agreed that we don't have enough data points to conclude how disruptive that is going to be, and so we will re-evaluate after alpha.

riking · 2019-09-29T22:03:26Z

keps/sig-autoscaling/20181106-in-place-update-of-pod-resources.md

+To provide fine-grained user control, PodSpec.Containers is extended with
+ResizePolicy map (new object) for each resource type (CPU, memory):
+* NoRestart - the default value; resize Container without restarting it,
+* RestartContainer - restart the Container in-place to apply new resource


I can see a case for a third policy here, which as a strawman I will call SignalContainerWINCH. This would allow the container to attempt to adjust its language runtime to conform to the new limits - e.g. a programmer determines that calling runtime.GOMAXPROCS(math.Ceil(numCPUs) + 1) results in less scheduler thrashing.

However, such a signal would only be useful if pods are able to interrogate the system for their own resource limits. This is perhaps best left to future enhancements to in-place update and should not block 1.17 implementation.

On linux you can always read /sys/fs/cgroup can't you ?

mwielgus · 2019-10-03T23:53:19Z

/lgtm

mwielgus · 2019-10-03T23:53:48Z

/approve

Approving for sig-autoscaling.

k8s-ci-robot · 2019-10-03T23:54:09Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ahg-g, dashpole, dchen1107, mwielgus, vinaykul

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~keps/sig-autoscaling/OWNERS~~ [mwielgus]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

keps/sig-autoscaling/20181106-in-place-update-of-pod-resources.md

k8s-ci-robot added the cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. label Jan 12, 2019

k8s-ci-robot requested review from DirectXMan12 and mwielgus January 12, 2019 02:26

k8s-ci-robot assigned DirectXMan12 Jan 12, 2019

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. and removed cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. labels Jan 12, 2019

thockin self-assigned this Jan 14, 2019

PatrickLang reviewed Jan 15, 2019

View reviewed changes

resouer reviewed Jan 15, 2019

View reviewed changes

keps/sig-autoscaling/draft-20181106-in-place-update-of-pod-resources.md Outdated Show resolved Hide resolved

ConnorDoyle reviewed Jan 15, 2019

View reviewed changes

keps/sig-autoscaling/draft-20181106-in-place-update-of-pod-resources.md Outdated Show resolved Hide resolved

keps/sig-autoscaling/draft-20181106-in-place-update-of-pod-resources.md Outdated Show resolved Hide resolved

Vinay Kulkarni added 2 commits January 16, 2019 00:32

Move Karol Golab's draft KEP for In-place update of pod resources fro…

cd94808

…m k/community to k/enhancements.

Update owning-sig to sig-autoscaling, add initial set of reviewers.

7fb66f1

vinaykul force-pushed the master branch from e8687c3 to 7fb66f1 Compare January 16, 2019 08:38

k8s-ci-robot assigned mwielgus and unassigned DirectXMan12 Jan 16, 2019

k8s-ci-robot assigned derekwaynecarr Jan 16, 2019

bsalamat reviewed Jan 18, 2019

View reviewed changes

hex108 reviewed Jan 18, 2019

View reviewed changes

keps/sig-autoscaling/draft-20181106-in-place-update-of-pod-resources.md Outdated Show resolved Hide resolved

Add KEP approvers, minor clarifications

55c8e56

riking reviewed Sep 29, 2019

View reviewed changes

k8s-ci-robot added lgtm "Looks good to me", indicates that a PR is ready to be merged. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Oct 3, 2019

k8s-ci-robot merged commit e1982f3 into kubernetes:master Oct 3, 2019

k8s-ci-robot added this to the v1.17 milestone Oct 3, 2019

vinaykul mentioned this pull request Oct 8, 2019

In-Place Update of Pod Resources #1287

Open

31 tasks

bg-chun mentioned this pull request Oct 16, 2019

Update huge pages KEP for container isolation of huge pages #1199

Merged

vinaykul mentioned this pull request Oct 28, 2019

In-Place Vertical Pod Scaling KEP to implementable, and mini-KEP for CRI extensions #1342

Merged

seh reviewed Nov 28, 2019

View reviewed changes

keps/sig-autoscaling/20181106-in-place-update-of-pod-resources.md Show resolved Hide resolved

vinaykul mentioned this pull request May 7, 2020

In-Place Pod Vertical Scaling feature vinaykul/kubernetes#1

Closed

This was referenced Jul 1, 2020

In-Place Pod Vertical Scaling feature kubernetes/kubernetes#92127

Closed

API modifications for in-place pod resize KEP #1883

Merged

huggsboson mentioned this pull request Sep 14, 2020

RFI: Improved scheduling in support of blue-green deployments which don't consume 2x capacity kubernetes/kubernetes#93082

Closed

vinaykul mentioned this pull request Jul 3, 2021

In-place Pod Vertical Scaling feature kubernetes/kubernetes#102884

Merged

vinaykul mentioned this pull request Jan 7, 2023

API changes for in-place pod resize feature kubernetes/kubernetes#111946

Closed

vinaykul mentioned this pull request Mar 9, 2023

Restructure resize policy naming and set default resize policy values kubernetes/kubernetes#116119

Merged

vinaykul mentioned this pull request Mar 22, 2023

Fix pod object update that may cause data race kubernetes/kubernetes#116702

Merged

vinaykul mentioned this pull request Mar 29, 2023

In place pod resizing should be designed into the kubelet config state loop, not alongside it kubernetes/kubernetes#116971

Open

HirazawaUi mentioned this pull request Apr 28, 2024

[kubelet]: fixed container restart due to pod spec field changes kubernetes/kubernetes#124220

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KEP: in-place update of pod resources #686

KEP: in-place update of pod resources #686

vinaykul commented Jan 12, 2019

k8s-ci-robot commented Jan 12, 2019

vinaykul commented Jan 12, 2019

vinaykul commented Jan 12, 2019

thockin commented Jan 14, 2019

vinaykul commented Jan 14, 2019

PatrickLang Jan 15, 2019

resouer Jan 15, 2019

vinaykul Jan 16, 2019

resouer Jan 16, 2019

resouer Jan 16, 2019

vinaykul Jan 17, 2019

vinaykul Jan 17, 2019

resouer Jan 22, 2019

jellonek Jan 25, 2019

vinaykul Feb 19, 2019

DirectXMan12 commented Jan 16, 2019

derekwaynecarr commented Jan 16, 2019

kgolab commented Sep 23, 2019 •

edited

Loading

vinaykul commented Sep 23, 2019

dchen1107 commented Sep 24, 2019

vinaykul commented Sep 24, 2019

ahg-g commented Sep 28, 2019

riking Sep 29, 2019

fcantournet Sep 18, 2020

mwielgus commented Oct 3, 2019

mwielgus commented Oct 3, 2019

k8s-ci-robot commented Oct 3, 2019

KEP: in-place update of pod resources #686

KEP: in-place update of pod resources #686

Conversation

vinaykul commented Jan 12, 2019

k8s-ci-robot commented Jan 12, 2019

vinaykul commented Jan 12, 2019

vinaykul commented Jan 12, 2019

thockin commented Jan 14, 2019

vinaykul commented Jan 14, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DirectXMan12 commented Jan 16, 2019

derekwaynecarr commented Jan 16, 2019

kgolab commented Sep 23, 2019 • edited Loading

vinaykul commented Sep 23, 2019

dchen1107 commented Sep 24, 2019

vinaykul commented Sep 24, 2019

ahg-g commented Sep 28, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mwielgus commented Oct 3, 2019

mwielgus commented Oct 3, 2019

k8s-ci-robot commented Oct 3, 2019

kgolab commented Sep 23, 2019 •

edited

Loading