-
Notifications
You must be signed in to change notification settings - Fork 4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cluster Autoscaler does not interpret labels specified with k8s.io/cluster-autoscaler/node-template/label/*
tags on an AWS ASG unless those tags are set to propagate to the instances
#4490
Comments
I think this was "kind of" documented in an example before, it was later replaced with some recommendations and moved to the FAQ. Not sure if that means that the behavior was fixed between those commits or if it's just unfortunate that the example got replaced/moved. |
The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs. This bot triages issues and PRs according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle stale |
The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs. This bot triages issues and PRs according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle rotten |
The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs. This bot triages issues and PRs according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /close |
@k8s-triage-robot: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/remove-lifecycle rotten |
@EHJ-52n: You can't reopen an issue/PR unless you authored it or you are a collaborator. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
@adamnovak Is this issue solved for you? |
I've been employing the workaround of always setting the tags to propagate, and I'm not likely to find time to try and reproduce this again on our live system any time soon. As for documenting that setting the tags to propagate is necessary, it looks like @acalm found where in the docs that would belong, so I think you could look there in the current mainline to see if it has been documented yet. |
Which component are you using?:
cluster-autoscaler
What version of the component are you using?:
Component version:
v1.17.3
What k8s version are you using (
kubectl version
)?:kubectl version
OutputWhat environment is this in?:
We're deploying nodes on Amazon AWS with Autoscaling Groups, using the cluster autoscaler's ability to automatically pick up ASGs tagged weith certain tags
What did you expect to happen?:
I expected the cluster autoscaler to read the tags of the ASG to determine what the tags on nodes that the ASG produces will be, when scaling from 0. I don't expect the value of the "Tag new instances" toggle on the tag to matter here.
In particular, I expect that if I tag an ASG with
k8s.io/cluster-autoscaler/node-template/label/eks.amazonaws.com/capacityType
with valueSPOT
, and don't set the tag to propagate to instances, then the cluster autoscaler will make a hypothetical node that will match anodeSelector
ofeks.amazonaws.com/capacityType: SPOT
.(Note that I'm not using EKS here, just the label values they define, since Kubernetes itself has no standard for labeling or tainting preemptible nodes.)
What happened instead?:
When the labeling tag was set to not propagate to nodes, I got log messages like:
When I changed the tag to propagate to new instances, then I got a different error (because I'd misspelled my ephemeral storage limit tag):
When I fixed that tag, then the autoscaler started provisioning my node.
How to reproduce it (as minimally and precisely as possible):
k8s.io/cluster-autoscaler/node-template/label/
, and specifies a unique label, but set it not to propagate to nodes. (Also, optionally configure the node to really have that label when it comes up.)nodeSelector
to match that label.Anything else we need to know?:
I suspect taints and other stuff inferred from tags also work this way.
This might be a possible cause of people reporting they are affected by #4010 and #3802, even though the screenshots I've seen there indicate that the tag new instances flags are set by the main reporters.
The text was updated successfully, but these errors were encountered: