📖 Label Sync Between MachineDeployment and underlying Kubernetes Nodes #6255

Arvinderpal · 2022-03-04T04:56:21Z

What this PR does / why we need it:

Proposal: Label Sync Between MachineDeployment and underlying Kubernetes Nodes

Google Doc: https://docs.google.com/document/d/17QA2E0GcbWNYb160qs8ArHOW0uMfL-NTYivefPGtn-c/edit?usp=sharing

Which issue(s) this PR fixes
Fixes # #493

linux-foundation-easycla · 2022-03-04T04:56:23Z

The committers listed above are authorized under a signed CLA.

✅ login: Arvinderpal / name: Arvinderpal (b6b2d67)

k8s-ci-robot · 2022-03-04T04:56:29Z

Hi @Arvinderpal. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Arvinderpal · 2022-03-04T04:56:37Z

@enxebre PTAL

docs/proposals/20220210-md-node-label-sync.md

enxebre · 2022-03-10T16:45:28Z

@Arvinderpal could you please fix the EasyCLA?

Overall this looks reasonable to me. PTAL @sbueringer @fabriziopandini

Arvinderpal · 2022-03-17T00:05:56Z

@enxebre I made the suggested changes.
@fabriziopandini @sbueringer PTAL.
Thank you

fabriziopandini · 2022-03-18T16:10:40Z

/ok-to-test

docs/proposals/20220210-md-node-label-sync.md

sbueringer · 2022-03-21T19:11:50Z

@Arvinderpal Overall looks good, nice work! a few comments

Arvinderpal

@fabriziopandini @sbueringer Thank you for our feedback. Please see my comments below.

docs/proposals/20220210-md-node-label-sync.md

Arvinderpal

@enxebre @sbueringer PTAL at my comments.

docs/proposals/20220210-md-node-label-sync.md

sbueringer · 2022-05-25T15:04:56Z

I think we have two open points which would be nice to get consensus on:

Should we have a separate field for the labels that are synced: 📖 Label Sync Between MachineDeployment and underlying Kubernetes Nodes #6255 (comment)
Do we want to implement it for KCP and MD machines as part of this proposal or try to only implement it for MD machines? 📖 Label Sync Between MachineDeployment and underlying Kubernetes Nodes #6255 (comment) (we can figure out the details during implementation)

fabriziopandini · 2022-06-01T16:34:06Z

I have commented on both points.
TL;DR; it seems to me that consensus is on using labels and extending this proposal to include KCP
When those points are addressed and all the pending comments resolved, IMO we can start lazy consensus, or as @CecileRobertMichon suggested, gate lazy consensus with at least two maintainers lgtm

vincepri · 2022-06-07T17:13:21Z

@Arvinderpal Do you have time to address the above points?

k8s-ci-robot · 2022-06-07T17:25:29Z

New changes are detected. LGTM label has been removed.

k8s-ci-robot · 2022-06-07T17:25:43Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
To complete the pull request process, please ask for approval from enxebre after the PR has been reviewed.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Kubernetes Nodes

Arvinderpal · 2022-06-07T17:36:31Z

@Arvinderpal Do you have time to address the above points?

@vincepri Done! We can start lazy consensus.

vincepri · 2022-06-08T17:09:04Z

/assign @CecileRobertMichon @sbueringer

Lazy consensus expires on 06/15

sbueringer

@Arvinderpal a few details, otherwise lgtm from my side

sbueringer · 2022-06-09T12:34:43Z

docs/proposals/20220210-md-node-label-sync.md

+MachineSet.spec.template.metadata.labels => Machine.labels
+```
+
+As of this writing, changing the `MachineDeployment.spec.template.metadata.labels` will trigger a rollout. This is undesirable. CAPI will be updated to instead ignore updates of labels that fall within the CAPI prefix. There is precedence for this with the handling of `MachineDeploymentUniqueLabel`. 


Suggested change

As of this writing, changing the `MachineDeployment.spec.template.metadata.labels` will trigger a rollout. This is undesirable. CAPI will be updated to instead ignore updates of labels that fall within the CAPI prefix. There is precedence for this with the handling of `MachineDeploymentUniqueLabel`.

As of this writing, changing the `MachineDeployment.spec.template.metadata.labels` will trigger a rollout. This is undesirable. CAPI will be updated to instead propagate updates of labels in-place without a rollout. Similar changes will be made to KCP.

I would suggest to propagate all labels inline. Otherwise we will end up in a weird state where changes to some labels trigger a MD/MS rollout while others are in-place updated (should be simpler to implement as well).

+1, that said this is a breaking behavioral change which needs proper release documentation and a new minor release; in addition, changes to the logic of MachineDeployment might be extensive

Yeah which is why I favored a separate field ... but yeah that decision has been made now :)

Just to confirm, we're fine with breaking behavior changes of our API fields as long as we do them in new minor releases? (my impression was that this requires a new apiVersion)

P.S. re: MD changes. The real fun part is probably to stop considering labels for the hash we're using while not triggering a MD rollout when the new CAPI version with this behavior is rolled out

Just to confirm, we're fine with breaking behavior changes of our API fields as long as we do them in new minor releases? (my impression was that this requires a new apiVersion)

The API fields wouldn't need to change, but the propagation behavior of those fields would change, which is something that needs to be documented.

P.S. re: MD changes. The real fun part is probably to stop considering labels for the hash we're using while not triggering a MD rollout when the new CAPI version with this behavior is rolled out

The same problem applies if we add a new field to the Machine template, we'd need a way to make sure that the generated hash is backward compatible. If that's easier though, let's just add a new field?

The API fields wouldn't need to change, but the propagation behavior of those fields would change, which is something that needs to be documented.

Okay, good to know that we don't consider this a breaking change :)

The same problem applies if we add a new field to the Machine template, we'd need a way to make sure that the generated hash is backward compatible. If that's easier though, let's just add a new field?

I think if it's a separate field we just never have to include it in the hash and we don't have to figure out how to remove something from the hash calculation without rollouts. Probably?

Between the two solutions proposed above, the second one seems the most sane from a user perspective. If we also think that in the fullness of time we'd want to expand the in place node fields, this might be a good start.

That said, the biggest issue I see with the above is that we're relying on a hashing mechanism that's flaky and fragile. Have we considered breaking the hashing first before introducing new changes that might affect future rollouts and extensibility points?

That said, the biggest issue I see with the above is that we're relying on a hashing mechanism that's flaky and fragile. Have we considered breaking the hashing first before introducing new changes that might affect future rollouts and extensibility points?

Good point. I don't think we went that far regarding thinking about how to implement this proposal. But I think we should try to get to a consensus relatively soon, we're going back and forth on the field discussion since a while.

Maybe the following is a path forward?

Pick the option that labels will be in a separate node.labels field
Here it was basically a 50:50 decision (📖 Label Sync Between MachineDeployment and underlying Kubernetes Nodes #6255 (comment)), but sounds like now we prefer a separate field

Implement the sync from Machine to Nodes

Figure out how to in-place upgrade node.labels in MD

Figure out how to in-place upgrade node.labels in KCP

I think the feature already would have a big benefit after 2., even without in-place upgrades (as it at least allows to propagate labels to nodes at all, which isn't possible right now).

Then we can continue with 3. and doing POC's and figuring out what the ideal way is to implement in-place upgrades.

Damn :/ Forgot that adding the field will already lead to a rollout on CAPI upgrade if we don't change the hashing.
So we definitely have to figure out how to do it for 2. already.

But If the general consensus is that we prefer the new field anyway (and we have the hash problem in both cases) we can at least move the proposal forward and then have to experiment on the implementation how to actually do it.

@sbueringer Thanks for pointing out the issue!
Introducing a new field is fine by me! I don't believe we have any objections, besides maybe @fabriziopandini :)

Should we also consider replacing the Deepcopy with something that includes specific fields (i.e. excludes our new field).

Also, if I'm not mistaken, wouldn't introducing a new field always cause a different hash to be generated, even if that new field is nil. So, when people upgrade, there will always be a rollout? Is this behavior considered okay for a breaking change?

Should we also consider replacing the Deepcopy with something that includes specific fields (i.e. excludes our new field).

I think as long as we are using a changed struct the hash will change. I think we have a few options to solve this:

"freeze" the struct by creating another one which produces the same output

hack the way we write the struct into the hasher (e.g. by just dropping parts of the string)

Replace Spew with something which writes a compatible output but allows ignoring fields

Replace the entire hash mechanism through something else

There might be other alternatives as well. But for me personally it would be fine to experiment during implementation. I think by making it a separate field we have a few advantages, e.g.:

the hash issue is probably simpler to solve (if we keep the hash mechanism, we don't have to figure out how to only include pre-existing labels in the hash and ignore new ones)

and that it's not a behavioral change of the label field anymore.

So, when people upgrade, there will always be a rollout? Is this behavior considered okay for a breaking change?

Up until now we tried to avoid that at all costs (we even have an e2e test which validates it)

sbueringer · 2022-06-09T12:36:57Z

docs/proposals/20220210-md-node-label-sync.md

+Labels that fall under the CAPI specific prefix `node.cluster.k8s.io/*` will need to be kept in sync between the Machine and Node objects. Synchronization must handle these two scenarios: 
+(A) Label is added/removed on a Machine and must be added/removed on the corresponding Node.
+(B) Label is added/removed on the Node and must be removed/re-added on the Node to bring it in sync with the labels on Machine.


Suggested change

Labels that fall under the CAPI specific prefix `node.cluster.k8s.io/*` will need to be kept in sync between the Machine and Node objects. Synchronization must handle these two scenarios:

(A) Label is added/removed on a Machine and must be added/removed on the corresponding Node.

(B) Label is added/removed on the Node and must be removed/re-added on the Node to bring it in sync with the labels on Machine.

Labels that fall under the CAPI specific prefix `node.cluster.k8s.io/*` will need to be kept in sync between the Machine and Node objects. Synchronization must handle these three scenarios:

(A) Label is added/removed on a Machine and must be added/removed on the corresponding Node.

(B) Label is added/removed on the Node and must be removed/re-added on the Node to bring it in sync with the labels on Machine.

(C) Label value is changed on the Machine or Node and must be brought in sync with the label value on Machine.

I think we have 3 scenarios (or something similar)

sbueringer · 2022-06-09T12:44:17Z

docs/proposals/20220210-md-node-label-sync.md

+
+```
+If you utilize inequality based selection for workload placement, to prevent unintended scheduling of pods during the initial node startup phase, it is recommend that you specify the following taint in your KubeadmConfigTemplate:
+`cluster.x-k8s.io=label-sync-pending:NoSchedule`


Suggested change

`cluster.x-k8s.io=label-sync-pending:NoSchedule`

`cluster.x-k8s.io/label-sync-pending:NoSchedule`

Assuming key:effect

(same for other occurrences)

sbueringer · 2022-06-09T12:45:46Z

docs/proposals/20220210-md-node-label-sync.md

+
+The benefit being that it provides a clear indication to the user that these labels will be synced to the Kubernetes Node(s). This will, however, require api changes. 
+
+Option #2: Propogate labels in the top-level metadata


Suggested change

Option #2: Propogate labels in the top-level metadata

Option #2: Propagate labels in the top-level metadata

nit: please search/replace across the whole doc (also "propagation")

enxebre · 2022-09-06T11:59:41Z

I created a poc adding a new field to see how it feels here #7173.

My main concern is around UX for consumers and introducing divergence from current deployment rollout approach. Another possibility would be to consider solving in place rollouts generically first, then node label propagation would be not an exception but just like any other input change could be either replace (existing MachineDeployment rollout) or in place.

docs/proposals/20220210-md-node-label-sync.md

vincepri · 2022-09-26T17:08:52Z

@Arvinderpal Do you have time to address the above feedback?

sbueringer · 2022-09-26T17:21:54Z

@Arvinderpal Do you have time to address the above feedback?

Just to surface that here. Fabrizio and Alberto are currently working on / exploring how to implement it. Afaik they will update the proposal accordingly soon.

vincepri · 2022-09-26T20:12:15Z

Just want to make sure the proposal PR is up-to-date with current discussions

Arvinderpal · 2022-09-27T04:01:45Z

Hey guys, sorry about the delay. Unfortunately, I don't have the bandwidth to work on this. Please feel free to update the proposal as you see fit. Much appreciated!

fabriziopandini · 2022-09-27T09:00:48Z

kk I will take over work for this proposal, thank you @Arvinderpal for the work done so far and to everyone who provided valuable feedback

fabriziopandini · 2022-09-27T13:31:05Z

/close

created #7296 to start nailing down Machine-Node label propagation

k8s-ci-robot · 2022-09-27T13:31:11Z

@fabriziopandini: Closed this PR.

In response to this:

/close

created #7296 to start nailing down Machine-Node label propagation

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot added cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Mar 4, 2022

k8s-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Mar 4, 2022

k8s-ci-robot requested review from JoelSpeed and stmcginnis March 4, 2022 04:56

enxebre mentioned this pull request Mar 8, 2022

Support to propagate properties in-place from MachineDeployments to Machines #5880

Closed

enxebre reviewed Mar 8, 2022

View reviewed changes

docs/proposals/20220210-md-node-label-sync.md Outdated Show resolved Hide resolved

enxebre mentioned this pull request Mar 10, 2022

CAPI waiting forever for the volume to be detached #6285

Closed

enxebre reviewed Mar 10, 2022

View reviewed changes

docs/proposals/20220210-md-node-label-sync.md Outdated Show resolved Hide resolved

enxebre reviewed Mar 10, 2022

View reviewed changes

docs/proposals/20220210-md-node-label-sync.md Outdated Show resolved Hide resolved

Arvinderpal force-pushed the md-node-label-sync-proposal branch from 5f5c038 to b6b2d67 Compare March 13, 2022 22:10

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. and removed cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. labels Mar 13, 2022

Arvinderpal force-pushed the md-node-label-sync-proposal branch from b6b2d67 to 338e17b Compare March 13, 2022 22:45

Arvinderpal changed the title ~~📖 WIP: Label Sync Between MachineDeployment and underlying Kubernetes Nodes~~ 📖 Label Sync Between MachineDeployment and underlying Kubernetes Nodes Mar 15, 2022

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Mar 18, 2022

fabriziopandini reviewed Mar 18, 2022

View reviewed changes

sbueringer reviewed Mar 21, 2022

View reviewed changes

Arvinderpal commented Mar 22, 2022

View reviewed changes

Arvinderpal force-pushed the md-node-label-sync-proposal branch from 338e17b to 5fae8b4 Compare March 22, 2022 02:38

sbueringer reviewed Mar 22, 2022

View reviewed changes

docs/proposals/20220210-md-node-label-sync.md Show resolved Hide resolved

Arvinderpal commented Apr 14, 2022

View reviewed changes

docs/proposals/20220210-md-node-label-sync.md Outdated Show resolved Hide resolved

docs/proposals/20220210-md-node-label-sync.md Show resolved Hide resolved

docs/proposals/20220210-md-node-label-sync.md Outdated Show resolved Hide resolved

Arvinderpal force-pushed the md-node-label-sync-proposal branch from ef8b61f to c15fbbe Compare June 7, 2022 17:25

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jun 7, 2022

Proposal: Label Sync Between MachineDeployment and underlying

5f96f5a

Kubernetes Nodes

Arvinderpal force-pushed the md-node-label-sync-proposal branch from c15fbbe to 5f96f5a Compare June 7, 2022 17:31

k8s-ci-robot assigned CecileRobertMichon and sbueringer Jun 8, 2022

sbueringer reviewed Jun 9, 2022

View reviewed changes

enxebre mentioned this pull request Aug 4, 2022

Labels and annotations for MachineDeployments and KubeadmControlPlane created by topology controller #7006

Closed

enxebre mentioned this pull request Sep 6, 2022

✨ Add Node managed labels support #7173

Merged

sbueringer reviewed Sep 12, 2022

View reviewed changes

docs/proposals/20220210-md-node-label-sync.md Show resolved Hide resolved

fabriziopandini mentioned this pull request Sep 27, 2022

📖 Label Sync Between Machine and underlying Kubernetes Nodes #7296

Merged

k8s-ci-robot closed this Sep 27, 2022

fabriziopandini mentioned this pull request Oct 3, 2022

📖 In place propagation of changes affecting Kubernetes objects only #7331

Merged

batistein mentioned this pull request Nov 22, 2022

apply HetznerBareMetalHost labels to resulting node syself/cluster-api-provider-hetzner#469

Closed

elmiko mentioned this pull request Dec 5, 2022

clusterapi provider for cluster autoscaler should expose labels and taints when scaling from zero #7685

Closed

	As of this writing, changing the `MachineDeployment.spec.template.metadata.labels` will trigger a rollout. This is undesirable. CAPI will be updated to instead ignore updates of labels that fall within the CAPI prefix. There is precedence for this with the handling of `MachineDeploymentUniqueLabel`.
	As of this writing, changing the `MachineDeployment.spec.template.metadata.labels` will trigger a rollout. This is undesirable. CAPI will be updated to instead propagate updates of labels in-place without a rollout. Similar changes will be made to KCP.

	`cluster.x-k8s.io=label-sync-pending:NoSchedule`
	`cluster.x-k8s.io/label-sync-pending:NoSchedule`


		The benefit being that it provides a clear indication to the user that these labels will be synced to the Kubernetes Node(s). This will, however, require api changes.

		Option #2: Propogate labels in the top-level metadata

	Option #2: Propogate labels in the top-level metadata
	Option #2: Propagate labels in the top-level metadata

📖 Label Sync Between MachineDeployment and underlying Kubernetes Nodes #6255

📖 Label Sync Between MachineDeployment and underlying Kubernetes Nodes #6255

Conversation

Arvinderpal commented Mar 4, 2022

linux-foundation-easycla bot commented Mar 4, 2022 • edited Loading

k8s-ci-robot commented Mar 4, 2022

Arvinderpal commented Mar 4, 2022

enxebre commented Mar 10, 2022

Arvinderpal commented Mar 17, 2022

fabriziopandini commented Mar 18, 2022

sbueringer commented Mar 21, 2022 • edited Loading

Arvinderpal left a comment

Choose a reason for hiding this comment

Arvinderpal left a comment

Choose a reason for hiding this comment

sbueringer commented May 25, 2022 • edited Loading

fabriziopandini commented Jun 1, 2022

vincepri commented Jun 7, 2022

k8s-ci-robot commented Jun 7, 2022

k8s-ci-robot commented Jun 7, 2022

Arvinderpal commented Jun 7, 2022

vincepri commented Jun 8, 2022

sbueringer left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sbueringer Jun 9, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sbueringer Jun 9, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sbueringer Jun 10, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sbueringer Jun 14, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sbueringer Jun 9, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

enxebre commented Sep 6, 2022

vincepri commented Sep 26, 2022

sbueringer commented Sep 26, 2022

vincepri commented Sep 26, 2022

Arvinderpal commented Sep 27, 2022

fabriziopandini commented Sep 27, 2022

fabriziopandini commented Sep 27, 2022

k8s-ci-robot commented Sep 27, 2022

linux-foundation-easycla bot commented Mar 4, 2022 •

edited

Loading

sbueringer commented Mar 21, 2022 •

edited

Loading

sbueringer commented May 25, 2022 •

edited

Loading

sbueringer Jun 9, 2022 •

edited

Loading

sbueringer Jun 9, 2022 •

edited

Loading

sbueringer Jun 10, 2022 •

edited

Loading

sbueringer Jun 14, 2022 •

edited

Loading

sbueringer Jun 9, 2022 •

edited

Loading