Merge pull request kubernetes#4229 from stevehipwell/ca-aws-tag-docs

[ca] Update AWS docs for ASG tags
jbartosik · Sep 9, 2021 · 0f3e441 · 0f3e441
2 parents 3c9ee8a + 7fd4291
commit 0f3e441
Showing 1 changed file with 98 additions and 74 deletions.
diff --git a/cluster-autoscaler/cloudprovider/aws/README.md b/cluster-autoscaler/cloudprovider/aws/README.md
@@ -1,11 +1,14 @@
 # Cluster Autoscaler on AWS
+
 On AWS, Cluster Autoscaler utilizes Amazon EC2 Auto Scaling Groups to manage node
 groups. Cluster Autoscaler typically runs as a `Deployment` in your cluster.
 
 ## Requirements
+
 Cluster Autoscaler requires Kubernetes v1.3.0 or greater.
 
 ## Permissions
+
 Cluster Autoscaler requires the ability to examine and modify EC2 Auto Scaling
 Groups. We recommend using [IAM roles for Service
 Accounts](https://docs.aws.amazon.com/eks/latest/userguide/iam-roles-for-service-accounts.html)
@@ -15,24 +18,25 @@ use IAM Roles for Service Accounts, you may associate an IAM service role with
 the EC2 instance on which the Cluster Autoscaler pod runs.
 
 ### IAM Policy
+
 The following policy provides the minimum privileges necessary for Cluster Autoscaler to run:
 
 ```json
 {
-    "Version": "2012-10-17",
-    "Statement": [
-        {
-            "Effect": "Allow",
-            "Action": [
-                "autoscaling:DescribeAutoScalingGroups",
-                "autoscaling:DescribeAutoScalingInstances",
-                "autoscaling:DescribeLaunchConfigurations",
-                "autoscaling:SetDesiredCapacity",
-                "autoscaling:TerminateInstanceInAutoScalingGroup"
-            ],
-            "Resource": ["*"]
-        }
-    ]
+  "Version": "2012-10-17",
+  "Statement": [
+    {
+      "Effect": "Allow",
+      "Action": [
+        "autoscaling:DescribeAutoScalingGroups",
+        "autoscaling:DescribeAutoScalingInstances",
+        "autoscaling:DescribeLaunchConfigurations",
+        "autoscaling:SetDesiredCapacity",
+        "autoscaling:TerminateInstanceInAutoScalingGroup"
+      ],
+      "Resource": ["*"]
+    }
+  ]
 }
 ```
 
@@ -49,6 +53,7 @@ information can be found
 [here](https://docs.aws.amazon.com/autoscaling/latest/userguide/control-access-using-iam.html#policy-auto-scaling-resources).
 
 ### Using OIDC Federated Authentication
+
 OIDC federated authentication allows your service to assume an IAM role and interact with AWS services without having to store credentials as environment variables. For an example of how to use AWS IAM OIDC with the Cluster Autoscaler please see [here](CA_with_AWS_IAM_OIDC.md).
 
 ### Using AWS Credentials
@@ -73,24 +78,25 @@ data:
   aws_access_key_id: BASE64_OF_YOUR_AWS_ACCESS_KEY_ID
   aws_secret_access_key: BASE64_OF_YOUR_AWS_SECRET_ACCESS_KEY
 ```
+
 Please refer to the [relevant Kubernetes
 documentation](https://kubernetes.io/docs/concepts/configuration/secret/#creating-a-secret-manually)
 for creating a secret manually.
 
 ```yaml
 env:
-- name: AWS_ACCESS_KEY_ID
-  valueFrom:
-    secretKeyRef:
-      name: aws-secret
-      key: aws_access_key_id
-- name: AWS_SECRET_ACCESS_KEY
-  valueFrom:
-    secretKeyRef:
-      name: aws-secret
-      key: aws_secret_access_key
-- name: AWS_REGION
-  value: YOUR_AWS_REGION
+  - name: AWS_ACCESS_KEY_ID
+    valueFrom:
+      secretKeyRef:
+        name: aws-secret
+        key: aws_access_key_id
+  - name: AWS_SECRET_ACCESS_KEY
+    valueFrom:
+      secretKeyRef:
+        name: aws-secret
+        key: aws_secret_access_key
+  - name: AWS_REGION
+    value: YOUR_AWS_REGION
 ```
 
 ## Auto-Discovery Setup
@@ -123,10 +129,28 @@ Cluster Autoscaler will attempt to determine the CPU, memory, and GPU resources
 provided by an Auto Scaling Group based on the instance type specified in its
 Launch Configuration or Launch Template. It will also examine any overrides
 provided in an ASG's Mixed Instances Policy. If any such overrides are found,
-only the first instance type found will be used.  See [Using Mixed Instances
+only the first instance type found will be used. See [Using Mixed Instances
 Policies and Spot Instances](#Using-Mixed-Instances-Policies-and-Spot-Instances)
 for details.
 
+Cluster Autoscaler supports hints that nodes will be labelled when they join the
+cluster via ASG tags. The tag is of the format
+`k8s.io/cluster-autoscaler/node-template/label/<label-name>`. `<label-name>` is
+the name of the label and the value of each tag specifies the label value.
+
+Example tags:
+
+- `k8s.io/cluster-autoscaler/node-template/label/foo`: `bar`
+
+Cluster Autoscaler supports hints that nodes will be tainted when they join the
+cluster via ASG tags. The tag is of the format
+`k8s.io/cluster-autoscaler/node-template/taint/<taint-name>`. `<taint-name>` is
+the name of the taint and the value of each tag specifies the taint value and effect with the format `<taint-value>:<taint-effect>`.
+
+Example tags:
+
+- `k8s.io/cluster-autoscaler/node-template/taint/dedicated`: `true:NoSchedule`
+
 From version 1.14, Cluster Autoscaler can also determine the resources provided
 by each Auto Scaling Group via tags. The tag is of the format
 `k8s.io/cluster-autoscaler/node-template/resources/<resource-name>`.
@@ -136,33 +160,26 @@ identical to the units used in the `resources` field of a Pod specification.
 
 Example tags:
 
-* `k8s.io/cluster-autoscaler/node-template/resources/ephemeral-storage`: `100G`
-
-You may also provide additional hints to Cluster Autoscaler that the nodes will
-be labeled or tainted when they join the cluster, such as:
-
-* `k8s.io/cluster-autoscaler/node-template/label/foo`: `bar`
-* `k8s.io/cluster-autoscaler/node-template/taint/dedicated`: `NoSchedule`
-* `k8s.io/cluster-autoscaler/node-template/taint/tier:` `batch:NoSchedule`
+- `k8s.io/cluster-autoscaler/node-template/resources/ephemeral-storage`: `100G`
 
 **NOTE:** It is your responsibility to ensure such labels and/or taints are
 applied via the node's kubelet configuration at startup. Cluster Autoscaler will not set the node taints for you.
 
 Recommendations:
 
-* It is recommended to use a second tag like
+- It is recommended to use a second tag like
   `k8s.io/cluster-autoscaler/<cluster-name>` when
   `k8s.io/cluster-autoscaler/enabled` is used across many clusters to prevent
   ASGs from different clusters recognized as the node groups.
-* To prevent conflicts, do not provide a `--nodes` argument if
+- To prevent conflicts, do not provide a `--nodes` argument if
   `--node-group-auto-discovery` is specified.
-* Be sure to add `autoscaling:DescribeLaunchConfigurations` or
+- Be sure to add `autoscaling:DescribeLaunchConfigurations` or
   `ec2:DescribeLaunchTemplateVersions` to the `Action` list of the IAM Policy
   used by Cluster Autoscaler, depending on whether your ASG utilizes Launch
   Configurations or Launch Templates.
-* If Cluster Autoscaler adds a node to the cluster, and the node has taints applied
+- If Cluster Autoscaler adds a node to the cluster, and the node has taints applied
   when it joins the cluster that Cluster Autoscaler was unaware of (because the tag
-  wasn't supplied), this can lead to significant confusion and misbehavior.
+  wasn't supplied), this can lead to significant confusion and misbehaviour.
 
 ### Special note on GPU instances
 
@@ -174,8 +191,8 @@ To avoid this, you can configure `kubelet` on your GPU nodes to label the node
 before it joins the cluster by passing it the `--node-labels` flag. The label
 format is as follows:
 
-* Cluster Autoscaler < 1.15: `cloud.google.com/gke-accelerator=<gpu-type>`
-* Cluster Autoscaler >= 1.15: `k8s.amazonaws.com/accelerator=<gpu-type>`
+- Cluster Autoscaler < 1.15: `cloud.google.com/gke-accelerator=<gpu-type>`
+- Cluster Autoscaler >= 1.15: `k8s.amazonaws.com/accelerator=<gpu-type>`
 
 `<gpu-type>` varies by instance type. On P2 instances, for example, the
 value is `nvidia-tesla-k80`.
@@ -192,24 +209,28 @@ You can pass multiple `--nodes` arguments if you have multiple Auto Scaling Grou
 you want Cluster Autoscaler to use.
 
 **NOTES**:
-* Both `<min>` and `<max>` must be within the range of the minimum and maximum
+
+- Both `<min>` and `<max>` must be within the range of the minimum and maximum
   instance counts specified by the Auto Scaling group.
-* When manual configuration is used, all Auto Scaling groups must use EC2
+- When manual configuration is used, all Auto Scaling groups must use EC2
   instance types that provide equal CPU and memory capacity.
 
 Examples:
 
 ### One ASG Setup (min: 1, max: 10, ASG Name: k8s-worker-asg-1)
+
 ```
 kubectl apply -f examples/cluster-autoscaler-one-asg.yaml
 ```
 
 ### Multiple ASG Setup
+
 ```
 kubectl apply -f examples/cluster-autoscaler-multi-asg.yaml
 ```
 
 <!--TODO: Remove "previously referred to as master" references from this doc once this terminology is fully removed from k8s-->
+
 ## Control Plane (previously referred to as master) Node Setup
 
 **NOTE**: This setup is not compatible with Amazon EKS.
@@ -218,6 +239,7 @@ To run a CA pod on a control plane node the CA deployment should tolerate the `m
 taint and `nodeSelector` should be used to schedule the pods on a control plane node.
 Please replace `{{ node_asg_min }}`, `{{ node_asg_max }}` and `{{ name }}` with
 your ASG setting in the yaml file.
+
 ```
 kubectl apply -f examples/cluster-autoscaler-run-on-control-plane.yaml
 ```
@@ -254,41 +276,41 @@ spec:
     nodeAffinity:
       requiredDuringSchedulingIgnoredDuringExecution:
         nodeSelectorTerms:
-        - matchExpressions:
-          - key: beta.kubernetes.io/instance-type
-            operator: In
-            values:
-            - r5.2xlarge
-            - r5d.2xlarge
-            - r5a.2xlarge
-            - r5ad.2xlarge
-            - r5n.2xlarge
-            - r5dn.2xlarge
-            - r4.2xlarge
-            - i3.2xlarge
+          - matchExpressions:
+              - key: beta.kubernetes.io/instance-type
+                operator: In
+                values:
+                  - r5.2xlarge
+                  - r5d.2xlarge
+                  - r5a.2xlarge
+                  - r5ad.2xlarge
+                  - r5n.2xlarge
+                  - r5dn.2xlarge
+                  - r4.2xlarge
+                  - i3.2xlarge
 ```
 
 ### Example usage:
 
-* Create a [Launch
+- Create a [Launch
   Template](https://docs.aws.amazon.com/AWSCloudFormation/latest/UserGuide/aws-properties-autoscaling-autoscalinggroup-launchtemplate.html)
   (LT) with an instance type, for example, r5.2xlarge. Consider this the 'base'
   instance type. Do not define any spot purchase options here.
-* Create an ASG with a MixedInstancesPolicy that refers to the newly-created LT.
-* Set LaunchTemplateOverrides to include the 'base' instance type r5.2xlarge and
+- Create an ASG with a MixedInstancesPolicy that refers to the newly-created LT.
+- Set LaunchTemplateOverrides to include the 'base' instance type r5.2xlarge and
   suitable alternatives, e.g. r5d.2xlarge, i3.2xlarge, r5a.2xlarge and
   r5ad.2xlarge. Differing processor types and speeds should be evaluated
   depending on your use-case(s).
-* Set
+- Set
   [InstancesDistribution](https://docs.aws.amazon.com/autoscaling/ec2/APIReference/API_InstancesDistribution.html)
   according to your needs.
-* See [Allocation
+- See [Allocation
   Strategies](https://docs.aws.amazon.com/autoscaling/ec2/userguide/asg-purchase-options.html#asg-allocation-strategies)
-  for information about how the ASG fulfills capacity from the specified instance
+  for information about how the ASG fulfils capacity from the specified instance
   types. It is recommended to use the capacity-optimized allocation strategy,
   which will automatically launch Spot Instances into the most available pools
   by looking at real-time capacity data and.
-* For the same workload or for the generic capacity in your cluster, you can
+- For the same workload or for the generic capacity in your cluster, you can
   also create more node groups with a vCPU/Mem ratio that is a good fit for your
   workloads, but from different instance sizes. For example: Node group 1:
   m5.xlarge, m5a.xlarge, m5d.xlarge, m5ad.xlarge, m4.xlarge. Node group 2:
@@ -299,6 +321,7 @@ spec:
 See CloudFormation example [here](MixedInstancePolicy.md).
 
 ## Use Static Instance List
+
 The set of the latest supported EC2 instance types will be fetched by the CA at
 run time. You can find all the available instance types in the CA logs. If your
 network access is restricted such that fetching this set is infeasible, you can
@@ -310,13 +333,14 @@ To refresh static list, please run `go run ec2_instance_types/gen.go` under
 `aws_util.go`
 
 ## Common Notes and Gotchas:
-* The `/etc/ssl/certs/ca-bundle.crt` should exist by default on ec2 instance in
-  your EKS cluster. If you use other cluster privision tools like
+
+- The `/etc/ssl/certs/ca-bundle.crt` should exist by default on ec2 instance in
+  your EKS cluster. If you use other cluster provision tools like
   [kops](https://github.com/kubernetes/kops) with different operating systems
   other than Amazon Linux 2, please use `/etc/ssl/certs/ca-certificates.crt` or
   correct path on your host instead for the volume hostPath in your cluster
   autoscaler manifest.
-* If you’re using Persistent Volumes, your deployment needs to run in the same
+- If you’re using Persistent Volumes, your deployment needs to run in the same
   AZ as where the EBS volume is, otherwise the pod scheduling could fail if it
   is scheduled in a different AZ and cannot find the EBS volume. To overcome
   this, either use a single AZ ASG for this use case, or an ASG-per-AZ while
@@ -326,7 +350,7 @@ To refresh static list, please run `go run ec2_instance_types/gen.go` under
   using EBS to using shared storage that is available across AZs (for each pod
   in its respective AZ). Consider AWS services like Amazon EFS or Amazon FSx for
   Lustre.
-* On creation time, the ASG will have the [AZRebalance
+- On creation time, the ASG will have the [AZRebalance
   process](https://docs.aws.amazon.com/autoscaling/ec2/userguide/auto-scaling-benefits.html#AutoScalingBehavior.InstanceUsage)
   enabled, which means it will actively work to balance the number of instances
   between AZs, and possibly terminate instances. If your applications could be
@@ -336,26 +360,26 @@ To refresh static list, please run `go run ec2_instance_types/gen.go` under
   [AWS Node Termination
   Handler](https://github.com/aws/aws-node-termination-handler/issues/95) will
   also support this use-case in the future.
-* By default, cluster autoscaler will not terminate nodes running pods in the
+- By default, cluster autoscaler will not terminate nodes running pods in the
   kube-system namespace. You can override this default behaviour by passing in
   the `--skip-nodes-with-system-pods=false` flag.
-* By default, cluster autoscaler will wait 10 minutes between scale down
+- By default, cluster autoscaler will wait 10 minutes between scale down
   operations, you can adjust this using the `--scale-down-delay-after-add`,
   `--scale-down-delay-after-delete`, and `--scale-down-delay-after-failure`
   flag. E.g. `--scale-down-delay-after-add=5m` to decrease the scale down delay
   to 5 minutes after a node has been added.
-* If you're running multiple ASGs, the `--expander` flag supports three options:
+- If you're running multiple ASGs, the `--expander` flag supports three options:
   `random`, `most-pods` and `least-waste`. `random` will expand a random ASG on
   scale up. `most-pods` will scale up the ASG that will schedule the most amount
   of pods. `least-waste` will expand the ASG that will waste the least amount of
   CPU/MEM resources. In the event of a tie, cluster autoscaler will fall back to
   `random`.
-* If you're managing your own kubelets, they need to be started with the
+- If you're managing your own kubelets, they need to be started with the
   `--provider-id` flag. The provider id has the format
   `aws:///<availability-zone>/<instance-id>`, e.g.
   `aws:///us-east-1a/i-01234abcdef`.
-* If you want to use regional STS endpoints (e.g. when using VPC endpoint for
+- If you want to use regional STS endpoints (e.g. when using VPC endpoint for
   STS) the env `AWS_STS_REGIONAL_ENDPOINTS=regional` should be set.
-* If you want to run it on instances with IMDSv1 disabled make sure your
+- If you want to run it on instances with IMDSv1 disabled make sure your
   EC2 launch configuration has the setting `Metadata response hop limit` set to `2`.
-  Otherwise, the `/latest/api/token` call will timeout and result in an error. See [AWS docs here](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/configuring-instance-metadata-service.html#configuring-instance-metadata-options) for further information. 
+  Otherwise, the `/latest/api/token` call will timeout and result in an error. See [AWS docs here](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/configuring-instance-metadata-service.html#configuring-instance-metadata-options) for further information.