feat: Create launch template for Managed Node Groups #1138

ArchiFleKs · 2020-12-07T22:37:01Z

Signed-off-by: Kevin Lefevre [email protected]

PR o'clock

Description

Enable the creation of a default launch template if needed to use with managed node pool. This enable the use of kubelet_extra_args and to add taint quickly without having to manage a separate launch template Terraform config.

It implements this logic: aws/containers-roadmap#864

I think the launchtemplate default might need some trimming, tell me what you think.

Checklist

CI tests are passing
README.md has been updated after any changes to variables and outputs. See https://github.com/terraform-aws-modules/terraform-aws-eks/#doc-generation

cabrinha · 2020-12-08T01:08:10Z

Did you try creating a cluster using this TF code? I'm getting the following error when using one or more node_groups:

Error: Invalid for_each argument

  on ../../../modules/terraform-aws-eks/modules/node_groups/launchtemplate.tf line 2, in data "template_file" "workers_userdata":
   2:   for_each = { for k, v in local.node_groups_expanded : k => v if v["create_launch_template"] }

The "for_each" value depends on resource attributes that cannot be determined
until apply, so Terraform cannot predict how many instances will be created.
To work around this, use the -target argument to first apply only the
resources that the for_each depends on.

  node_groups = {
    nginx = {
      create_launch_template = true
      desired_capacity = 4
      max_capacity     = 10
      min_capacity     = 3

      instance_type      = "m5.large"
      kubelet_extra_args = "--node-labels=role=nginx,group=nginx"

      additional_tags = {
        group = "nginx"
      }
    }
  }

I was able to get a cluster up by doing the following:

change both for_each = { for k, v in local.node_groups_expanded : k => v if v["create_launch_template"] } statements to:

for_each = local.node_groups_expanded

My next issue was with "disk_size", which didn't get a default value, so I set it to 50.

EC2 instances are also coming up without the "Name" tag being set either. I think you need to add name_prefix to your launch template like: https://github.com/terraform-aws-modules/terraform-aws-eks/blob/master/examples/launch_templates_with_managed_node_groups/launchtemplate.tf#L21

https://github.com/terraform-aws-modules/terraform-aws-eks/blob/master/workers_launch_template.tf#L5

another name_prefix spot: https://github.com/terraform-aws-modules/terraform-aws-eks/blob/master/workers_launch_template.tf#L235

modules/node_groups/launchtemplate.tf

ArchiFleKs · 2020-12-08T08:25:17Z

@cabrinha thanks for the review, actually I used Terragrunt + Terraform but I was able to get a cluster running, I havent tested without disk_size (I tested with 50 also) so I think we need to set a default because there is none with launch template.

If we set the for_each to for_each = local.node_groups_expanded it will create a launch template for every node group

About the name I don think managed node group is setting the name tag on instances as even my "classic" managed node pool don't have one.

ArchiFleKs · 2020-12-08T09:33:38Z

@cabrinha also the plan is passing with the examples/managed_node_groups and:

  node_groups = {
    example = {
      desired_capacity       = 1
      max_capacity           = 10
      min_capacity           = 1
      create_launch_template = true
      kubelet_extra_args     = "--node-labels=role=nginx,group=nginx"

      instance_type = "m5.large"
      k8s_labels = {
        Environment = "test"
        GithubRepo  = "terraform-aws-eks"
        GithubOrg   = "terraform-aws-modules"
      }
      additional_tags = {
        ExtraTag = "example"
      }
    }
  }

cabrinha · 2020-12-08T17:11:22Z

@cabrinha thanks for the review, actually I used Terragrunt + Terraform but I was able to get a cluster running, I havent tested without disk_size (I tested with 50 also) so I think we need to set a default because there is none with launch template.

If we set the for_each to for_each = local.node_groups_expanded it will create a launch template for every node group

Could we add a simple create = true/false on each group? I guess thats what the create_launch_template flag would do.

About the name I don think managed node group is setting the name tag on instances as even my "classic" managed node pool don't have one.

I wish there was a way to add the Name tag to the instances spun up. The EC2 instances list is horrible without any Names 😅

Also, it'd be nice to add capacity_type too, since AWS now supports Managed Node Groups with Spot Instances.

What version of Terraform are you using?

$ terraform version
Terraform v0.12.29
+ provider.aws v3.20.0
+ provider.kubernetes v1.13.3
+ provider.local v2.0.0
+ provider.null v3.0.0
+ provider.random v3.0.0
+ provider.template v2.2.0

I'm on 0.12.29, using the same example code block you are and I'm still getting the error:

Error: Invalid for_each argument

  on ../../../modules/terraform-aws-eks/modules/node_groups/launchtemplate.tf line 2, in data "template_file" "workers_userdata":
   2:   for_each = { for k, v in local.node_groups_expanded : k => v if v["create_launch_template"] }

The "for_each" value depends on resource attributes that cannot be determined
until apply, so Terraform cannot predict how many instances will be created.
To work around this, use the -target argument to first apply only the
resources that the for_each depends on.

cabrinha · 2020-12-08T18:00:52Z

modules/node_groups/node_groups.tf

-  disk_size       = lookup(each.value, "disk_size", null)
-  instance_types  = each.value["launch_template_id"] != null ? [] : [each.value["instance_type"]]
+  disk_size       = each.value["launch_template_id"] != null || each.value["create_launch_template"] ? null : lookup(each.value, "disk_size", null)
+  instance_types  = each.value["launch_template_id"] != null || each.value["create_launch_template"] ? [] : [each.value["instance_type"]]


It'd be cool to add capacity_type = lookup(each.value, "capacity_type", "ON_DEMAND") right above this, so users could choose to use "SPOT"

If think this is implemented by #1129

ArchiFleKs · 2020-12-08T21:47:53Z

@cabrinha Alright for the spot and the name.

But I have to admit I really do not understand the part about about the create = true/false because that is what I'm trying to with for_each = { for k, v in local.node_groups_expanded : k => v if v["create_launch_template"] } which is same syntax as here

I'm using the latest Terraform 0.13.5, I have ont tried with 0.12 I will test on my end.

cabrinha · 2020-12-08T22:50:50Z

@cabrinha Alright for the spot and the name.

But I have to admit I really do not understand the part about about the create = true/false because that is what I'm trying to with for_each = { for k, v in local.node_groups_expanded : k => v if v["create_launch_template"] } which is same syntax as here

I'm using the latest Terraform 0.13.5, I have ont tried with 0.12 I will test on my end.

Supposedly this issue is more likely to happen on a blank tfstate.

cabrinha · 2020-12-29T18:41:33Z

Seems these three PRs are all targeting the same goal: #1161 #1129

binnythomas-1989 · 2021-01-11T13:11:06Z

Guys I have a question related to remote access, you won't be able to mention remote access on node groups as per aws docs

Per our documentation[1] When using a launch template, if any of the following parameters are specified in the node group configuration, your create or update request will fail. Specify these in your launch template:

- Instance type
- Disk size
- Remote access configuration
- EC2 SSH key

I dont see a handle of that on your code launch_template.tf. would be great if you can clarify.

ArchiFleKs · 2021-01-11T13:24:40Z

Guys I have a question related to remote access, you won't be able to mention remote access on node groups as per aws docs
Per our documentation[1] When using a launch template, if any of the following parameters are specified in the node group configuration, your create or update request will fail. Specify these in your launch template:

- Instance type
- Disk size
- Remote access configuration
- EC2 SSH key
I dont see a handle of that on your code launch_template.tf. would be great if you can clarify.

You mean to perform validation on the input ?

binnythomas-1989 · 2021-01-11T13:27:10Z

Guys I have a question related to remote access, you won't be able to mention remote access on node groups as per aws docs
Per our documentation[1] When using a launch template, if any of the following parameters are specified in the node group configuration, your create or update request will fail. Specify these in your launch template:

- Instance type
- Disk size
- Remote access configuration
- EC2 SSH key
I dont see a handle of that on your code launch_template.tf. would be great if you can clarify.
You mean to perform validation on the input ?

What I mean is if you add a remote access as an input to nodegroups with an associated launch template, you would end up with the below error.

Error: error creating EKS Node Group (nodegroup-test:x86_64-driving-louse): InvalidParameterException: Remote access configuration cannot be specified with a launch template.
{

You would need to handle the remote_access on the launch_template.tf. If im wrong do correct me.
Sample link
https://github.com/cloudposse/terraform-aws-eks-node-group/blob/master/launch-template.tf

ArchiFleKs · 2021-01-11T13:46:26Z

@binnythomas-1989 I think I understand, we need to prevent setting remote_access here : https://github.com/terraform-aws-modules/terraform-aws-eks/blob/master/modules/node_groups/node_groups.tf#L21 if launchtemplate is used

binnythomas-1989 · 2021-01-11T13:50:22Z

you are correct @ArchiFleKs. You would need to add the key on the launch template too.

binnythomas-1989 · 2021-01-12T09:26:29Z

Guys adding the Kubelet argument is great. I just tested it. Im kinda sorry with amazon their EKS is kinda really annoying.

So let me explain what I have figured.
You can add node-taints something like below.

kubelet_extra_args = "--node-labels=eks.amazonaws.com/nodegroup=company-net --register-with-taints=network=company:NoSchedule"

The problem is once you add the taints with a NoSchedule for example, the node won't join the cluster. You would have issues with the coredns pod scheduling since its a deployment. It won't be able to schedule.

So letting people to use this for taints is a bad idea on EKS.

ArchiFleKs · 2021-01-12T10:29:55Z

@binnythomas-1989 it is true for coreDNS, aws-node tolerate every taint, you should always have a "default" pool or a "criticalAddonsOnly" pool if you want.

I agree this is kind of a poweruser feature, node join also failed if you are using kubernetes.io forbidden label. But letting user the ability to have "reserved" node pool, for GPU etc is a must have feature in my opinion while it is not natively supported by the EKS API whereas label are.

ArchiFleKs · 2021-01-12T10:32:08Z

There is an example right now on how to use a launch template but it is not straightforward, this PR goal is to enable a simple configuration while still allowing to use a custom launch template if needed or a basic node pool.

eksctl is using a default launch template also to enable taint on node groups

binnythomas-1989 · 2021-01-12T10:43:22Z

@binnythomas-1989 it is true for coreDNS, aws-node tolerate every taint, you should always have a "default" pool or a "criticalAddonsOnly" pool if you want.

I agree this is kind of a poweruser feature, node join also failed if you are using kubernetes.io forbidden label. But letting user the ability to have "reserved" node pool, for GPU etc is a must have feature in my opinion while it is not natively supported by the EKS API whereas label are.

The kube-proxy and aws-node is a DaemonSet so that's okay, its just the core-dns is an issue since its a deployment. Anyways it could be useful if we add a condition on the README. i was trying to adopt your solution on my local terraform module I use. Im planning to handle this using terraform using kubectl provider for now. :-) I dont use eksctl

ArchiFleKs · 2021-01-12T11:54:13Z

@binnythomas-1989 this : https://github.com/terraform-aws-modules/terraform-aws-eks/pull/1138/files#diff-7a3fc6c7df17fda0c341e61255461bf1f149256a9ddf14d4a18ab6f020d08136R22 should take care of remote access, could you try ? If not I'll try to test today

binnythomas-1989 · 2021-01-12T11:59:00Z

@binnythomas-1989 this : https://github.com/terraform-aws-modules/terraform-aws-eks/pull/1138/files#diff-7a3fc6c7df17fda0c341e61255461bf1f149256a9ddf14d4a18ab6f020d08136R22 should take care of remote access, could you try ? If not I'll try to test today

This won't work again, Because now you need to handle the Remote access on Launch template and it would go like this

  key_name = each.value.ec2_ssh_key_pair

So that bit is being handled on the launch_template.tf

ArchiFleKs · 2021-01-12T13:09:57Z

@binnythomas-1989 this : https://github.com/terraform-aws-modules/terraform-aws-eks/pull/1138/files#diff-7a3fc6c7df17fda0c341e61255461bf1f149256a9ddf14d4a18ab6f020d08136R22 should take care of remote access, could you try ? If not I'll try to test today

This won't work again, Because now you need to handle the Remote access on Launch template and it would go like this
  key_name = each.value.ec2_ssh_key_pair
So that bit is being handled on the launch_template.tf

I added the key_name lookup

bobbywatson3 · 2021-01-16T17:16:18Z

This is exactly what we're looking for. Thank you for the work done on this so far!

barryib

I just realized that I didn't submit my review. It was in pending state.

modules/node_groups/launchtemplate.tf

barryib · 2020-12-23T12:56:59Z

modules/node_groups/launchtemplate.tf

+  }
+
+  # if you want to use a custom AMI
+  # image_id      = var.ami_id


Why can't we allow this ?

I think because if using a custom image, this does not apply: https://github.com/terraform-aws-modules/terraform-aws-eks/blob/master/examples/launch_templates_with_managed_node_groups/launchtemplate.tf#L18 I think this lead to different behavior between EKS ami and other AMI, but I have not tested with custom AMI

I think we should stay simple, because we still allow user to use a custom launch template if needed, or maybe this can be added in another PR as I'm not really sure on how to handle cloud-init with custom AMI

This is not implemented for now

as you partially copied the LT from the examples that got added in my #997 , I might be able to help here:

so I am using LT with a custom AMI and it works just fine.
however there is indeed subtle but important differences between using an LT w/ or w/o a custom AMI. In the old PR, someone else described it quite well, see #997 (comment)

He also mentions a then required fix to the MIME boundary when using cloudinit

@philicious do you mean that this part should be added manually when using custom AMI:

set -ex B64_CLUSTER_CA=LS0tLS1CRUdJTiBDRVJUSUZJQ0FURS0tLS0tCk1JSUN5RENDQWJDZ0F3SUJBZ0lCQURBTkJna3Foa2lHOXcwQkFRc0ZBREFWTVJNd0VRWURWUVFERXdwcmRXSmwKY201bGRHVnpNQjRYRFRJeE1ESXdNakUyTXpJeU0xb1hEVE14TURFek1URTJNekl5TTFvd0ZURVRNQkVHQTFVRQpBeE1LYTNWaVpYSnVaWFJsY3pDQ0FTSXdEUVlKS29aSWh2Y05BUUVCQlFBRGdnRVBBRENDQVFvQ2dnRUJBTkhXCk5vZjgxekorcGIxdEswMXRWVExSNEd0NDBDbkw5TU5vV0hSWGc3WndNVFkzcHVQMm05TlkvSXJ2bEZ2dDNNUVcKejUrb0FRdU8rcHA2RUFQOEZFK0JGaUVSVXpMZTYvbXFscGg2S2hmOEsyQU45QUN2RUYvMWlYNlQvWFlDdlRrRQp5MmhYSk1CUnVGSVF6dGVSaDEwRTFBZG5UWDdxNUY5RlhIY2VzR285TGlPbmRNMVpQRGpPS2lnZ0hMK2xheG4wCnN0bDlxeGZrYWZpMHNzb0ZCcUM3eGU1SGt2OVowYTYvRmxWeVNXazFQQXFCWDZOTlUvc0RjNTA3bXN0OEVMc0oKSU9naWFTcGZLaXVnekZNaTlTS3NQbjRQcm94UDEwRlErOGpSdTZZdm9tQmswMHFnU2NFTGxadng0bG1CVGloSgpCdDdFTlUxMzdvSXdhY1pCUUNFQ0F3RUFBYU1qTUNFd0RnWURWUjBQQVFIL0JBUURBZ0trTUE4R0ExVWRFd0VCCi93UUZNQU1CQWY4d0RRWUpLb1pJaHZjTkFRRUxCUUFEZ2dFQkFJam1ZWmthV3NuQ1lSNkpQUGw1WmVpcGkzYkYKREpBUzgvM2E4UFVnL3BsWTFVYlhCalU3b0FDb21UUzd2Y2hPUFU5aFNXdC9jNit5RnF5a0FwakMyRjFuSHg4WQpaQUg5NDFWYUNzRyt3VmE3MTJlcFRPTSt1TWxNSENFYVlMVTRKOXEvaUd1aVZtM2NPOGhmMTFoNjVGd3NuekE0CmdqQ0YxUC9Sdi9acnFSSk9XZmJaRE00MzlwajVqQzNYRVAyK1FXVlIzR2tzbW1NcDVISm9NZW5JaDBSTFhnK1oKTVRVNXFsdW0xTWZDdXRNVjkzNGJFQ21BRERJSm4rZVdHSERwRi9QOThnR1RyRU1QclhiUXZMblpwZHBNYldjNQp5LzZldkNtYXozMzllSlUwWkRaM1M0R2YvbEpBUTBZcFZoQkRlS2hXVHEwSXJYb2NWWHU5MDN0OXU5TT0KLS0tLS1FTkQgQ0VSVElGSUNBVEUtLS0tLQo= API_SERVER_URL=https://A62655F81AE9347A761BB172E28A633F.sk1.eu-west-1.eks.amazonaws.com K8S_CLUSTER_DNS_IP=172.20.0.10 /etc/eks/bootstrap.sh pio-thanos --kubelet-extra-args '--node-labels=eks.amazonaws.com/sourceLaunchTemplateVersion=1,GithubRepo=terraform-aws-eks,eks.amazonaws.com/nodegroup-image=ami-066fad1ae541d1cf9,Environment=test,eks.amazonaws.com/capacityType=ON_DEMAND,eks.amazonaws.com/nodegroup=pio-thanos-default-eu-west-1a-expert-bass,eks.amazonaws.com/sourceLaunchTemplateId=lt-079cbc5cf74ace131,GithubOrg=terraform-aws-modules' --b64-cluster-ca $B64_CLUSTER_CA --apiserver-endpoint $API_SERVER_URL --dns-cluster-ip $K8S_CLUSTER_DNS_IP

About the boudaries i think this is fixed and I implemented it here

I'm just not sure how to handle the custom AMI, how do you do it on your end ? From what I understand you need to pass the bootstrap command on your own when using custom AMI, because it does not get merge

you could be right about the boundaries. I remember that someone wanted to update the cloudinit to support custom boundaries. so ye, probably and if this PR produces a running EKS, then its proven.

and ye, with a custom AMI, you have to supply the entire userdata yourself as no merging happens with the default one from EKS.
so I in the end use it like I added in the examples.

I would have to have a closer look and do tests on how to add custom AMI support and satisfy these differences.

it would for sure be great if the module could also handle custom AMI if it already got LT generation added.

modules/node_groups/templates/userdata.sh.tpl

stevehipwell · 2021-02-01T09:58:19Z

@ArchiFleKs do you have a rough timeline for this work?

Also, would it be possible to add support for pre_userdata and additional_userdata? We need this for custom certificates, docker credentials and custom labels from AWS metadata.

Signed-off-by: Kevin Lefevre <[email protected]>

cabrinha · 2021-04-16T17:47:52Z

@ArchiFleKs looks like we got some conflicts here. Can you fix these up please?

@cabrinha I"m not sure I'll have the time to retest after the merge this week end but I'll try when I can. It would be great if you could or someone else

Sure, I can retest this at any time and post my configs here.

ArchiFleKs · 2021-04-16T17:48:48Z

Alright conflict should be fixed now

cabrinha · 2021-04-17T00:35:15Z

@ArchiFleKs looks like we got some conflicts here. Can you fix these up please?

@cabrinha I"m not sure I'll have the time to retest after the merge this week end but I'll try when I can. It would be great if you could or someone else

Just tested this config:

  node_groups = {
    managed = {
      desired_capacity = 1
      max_capacity     = 5
      min_capacity     = 1

      instance_types = [
        "c3.2xlarge",
        "c4.xlarge",
        "c4.2xlarge",
      ]
      capacity_type  = "SPOT"
      root_volume_type = "gp2"
      root_volume_size = 10
      kubelet_extra_args = "--node-labels=node.kubernetes.io/lifecycle=spot,role=worker,node.kubernetes.io/exclude-from-external-load-balancers --register-with-taints=dedicated=managed:NoSchedule"
      k8s_labels = {
        Environment = "test"
        GithubRepo  = "terraform-aws-eks"
        GithubOrg   = "terraform-aws-modules"
      }

      additional_tags = {
        CustomTag = "EKS example"
      }
    }
  }

Seems to be working well

devy294 · 2021-04-19T14:25:36Z

When will this be merged?

cabrinha · 2021-04-19T15:53:45Z

When will this be merged?

Great question. @barryib time to review and merge?

stevehipwell · 2021-04-20T19:08:52Z

@barryib it looks like the changes in this PR didn't make it into v15.0.0 or v15.1.0, do you have a plan for when they are going to be released?

cabrinha · 2021-04-20T22:21:05Z

@barryib it looks like the changes in this PR didn't make it into v15.0.0 or v15.1.0, do you have a plan for when they are going to be released?

Aren't the changes here? v15.1.0...master 2e1651d

archoversight · 2021-04-20T22:32:45Z

@barryib it looks like the changes in this PR didn't make it into v15.0.0 or v15.1.0, do you have a plan for when they are going to be released?

Aren't the changes here? v15.1.0...master 2e1651d

That first link is showing the diff between v15.1.0 and master. Your second link shows that it is only on master and not yet in any tags.

martin308 · 2021-04-22T00:21:50Z

I'm not seeing this work with kubelet_extra_args. When this is used the userdata that is provided to the launch template only includes the additions provided to kubelet_extra_args and none of the other required parameters to bootstrap.

Example:

retriever = {
       instance_types    = ["c5.xlarge"]
       desired_capacity  = 1
       max_capacity      = 1
       min_capacity      = 1
       disk_size         = 20
       kubelet_extra_args = "--node-labels=test=extra_args"
       create_launch_template = true
      }

Produces the following userdata for the launch configuration:

Content-Type: multipart/mixed; boundary="//"
MIME-Version: 1.0

--//
Content-Transfer-Encoding: 7bit
Content-Type: text/x-shellscript
Mime-Version: 1.0

#!/bin/bash -e

# Allow user supplied pre userdata code


sed -i '/^KUBELET_EXTRA_ARGS=/a KUBELET_EXTRA_ARGS+=" --node-labels=test=extra_args"' /etc/eks/bootstrap.sh

--//--

Which is missing all of the other required parameters provided by the EKS AMI which is vital for registering the nodes in the cluster.

Am I missing something or does this not work?

stevehipwell · 2021-04-22T06:09:30Z

@martin308 this hasn't been released yet, so unless you're using ref=master it won't do anything.

jfoechsler · 2021-04-22T07:25:08Z


Which is missing all of the other required parameters provided by the EKS AMI which is vital for registering the nodes in the cluster.

Am I missing something or does this not work?

@martin308 I'm pretty sure what you are missing/forgetting, is this: https://docs.aws.amazon.com/eks/latest/userguide/launch-templates.html#launch-template-user-data about using custom user data snippets while still using official AMI.

You would need full user data in case of custom AMI, but in that case you would also have created the full regular launch template.

martin308 · 2021-04-22T16:28:42Z

@martin308 this hasn't been released yet, so unless you're using ref=master it won't do anything.

yup, pulling down the merge commit by ref 👍

source = "github.com/terraform-aws-modules/terraform-aws-eks?ref=2e1651df86bd315000738cf901a4cc0586be1af3"

Which is missing all of the other required parameters provided by the EKS AMI which is vital for registering the nodes in the cluster.

Am I missing something or does this not work?
@martin308 I'm pretty sure what you are missing/forgetting, is this: https://docs.aws.amazon.com/eks/latest/userguide/launch-templates.html#launch-template-user-data about using custom user data snippets while still using official AMI.

You would need full user data in case of custom AMI, but in that case you would also have created the full regular launch template.

I'm not using a custom AMI as per my example. I guess I'm just confused as how to use the kubelet_extra_args feature added in this PR.

Are you saying that it is expected that my example above would not work? If so is there an example of how to make use of the kubelet_extra_args feature with the official AMI?

jfoechsler · 2021-04-22T16:56:32Z

Are you saying that it is expected that my example above would not work? If so is there an example of how to make use of the kubelet_extra_args feature with the official AMI?

No I'm saying the opposite :) My understanding is your example with the resulting LT should work and be able to join cluster (due to the merging of user data done outside control of this terraform module). I'm interested to hear if your testing confirm that.

ArchiFleKs · 2021-04-22T17:47:32Z

I just tested with master and the following configuration:

  node_groups = {
    "default-${local.aws_region}a" = {
      ami_type = "AL2_ARM_64"
      desired_capacity = 1
      max_capacity     = 3
      min_capacity     = 1
      instance_types    = ["t4g.large"]
      subnets          = [dependency.vpc.outputs.private_subnets[0]]
      disk_size        = 20
    }

    "default-${local.aws_region}b" = {
      ami_type = "AL2_ARM_64"
      desired_capacity = 1
      max_capacity     = 3
      min_capacity     = 1
      instance_types    = ["t4g.large"]
      subnets          = [dependency.vpc.outputs.private_subnets[1]]
      disk_size        = 20
    }

    "default-${local.aws_region}c" = {
      ami_type = "AL2_ARM_64"
      create_launch_template        = true
      desired_capacity = 1
      max_capacity     = 3
      min_capacity     = 1
      instance_types    = ["t4g.large"]
      subnets          = [dependency.vpc.outputs.private_subnets[2]]
      kubelet_extra_args = "--node-labels=role=private --register-with-taints=dedicated=private:NoSchedule"
      disk_size        = 20
    }
  }

It is working as expected

ArchiFleKs · 2021-04-22T17:53:20Z

I can confirm it works with x86 also:

  node_groups = {
    "default-${local.aws_region}a" = {
      ami_type = "AL2_ARM_64"
      desired_capacity = 1
      max_capacity     = 3
      min_capacity     = 1
      instance_types    = ["t4g.large"]
      subnets          = [dependency.vpc.outputs.private_subnets[0]]
      disk_size        = 20
    }

    "default-${local.aws_region}b" = {
      ami_type = "AL2_ARM_64"
      desired_capacity = 1
      max_capacity     = 3
      min_capacity     = 1
      instance_types    = ["t4g.large"]
      subnets          = [dependency.vpc.outputs.private_subnets[1]]
      disk_size        = 20
    }

    "default-${local.aws_region}c" = {
      ami_type = "AL2_ARM_64"
      create_launch_template        = true
      desired_capacity = 1
      max_capacity     = 3
      min_capacity     = 1
      instance_types    = ["t4g.large"]
      subnets          = [dependency.vpc.outputs.private_subnets[2]]
      kubelet_extra_args = "--node-labels=role=private --register-with-taints=dedicated=private:NoSchedule"
      disk_size        = 20
    }
    "taint-${local.aws_region}c" = {
      create_launch_template        = true
      desired_capacity = 1
      max_capacity     = 3
      min_capacity     = 1
      instance_types    = ["t3a.large"]
      subnets          = [dependency.vpc.outputs.private_subnets[2]]
      kubelet_extra_args = "--node-labels=role=private --register-with-taints=dedicated=private:NoSchedule"
      disk_size        = 20
    }
  }

ArchiFleKs · 2021-04-22T17:54:00Z

You have to use the create_launch_template flag which is not by default or else the kubelet_extra_args are not passed to anything.

…odules#1138) Signed-off-by: Kevin Lefevre <[email protected]>

stevehipwell · 2021-06-08T10:15:32Z

@ArchiFleKs I've been looking at further customizing the manage node group bootstrap process and I'd be interested if you tried setting the KUBELET_EXTRA_ARGS environment variable instead of using sed?

ipleten · 2021-07-15T17:21:48Z

@ArchiFleKs I've been looking at further customizing the manage node group bootstrap process and I'd be interested if you tried setting the KUBELET_EXTRA_ARGS environment variable instead of using sed?

I tried and seems cloud-init don't preserve exported variables between its parts (w/o custom AMI user-data get merged with the one provided by AWS). One of solutions might be to write vars to some file like /etc/eks/boostrap-vars and modify bootstrap.sh to read them later.

stevehipwell · 2021-07-15T17:40:51Z

@ipleten it was a leading question, I've done this exact thing for some of the other env variables by persisting the export. I'll probably open a PR to change this as it's more resilient to AMI changes than the sed solution.

stevehipwell · 2021-07-20T10:03:20Z

@ipleten it looks like I already did, #1433.

github-actions · 2022-11-14T02:30:32Z

I'm going to lock this pull request because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues. If you have found a problem that seems related to this change, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

ArchiFleKs mentioned this pull request Dec 7, 2020

[EKS] [request]: Managed Node Groups support for node taints aws/containers-roadmap#864

Closed

cabrinha reviewed Dec 8, 2020

View reviewed changes

modules/node_groups/launchtemplate.tf Outdated Show resolved Hide resolved

cabrinha reviewed Dec 8, 2020

View reviewed changes

mjwilkerson-strateos approved these changes Dec 8, 2020

View reviewed changes

barryib self-assigned this Dec 22, 2020

barryib changed the title ~~feat: enable default launch template~~ feat: Create launch template for Managed Node Groups Dec 23, 2020

ArchiFleKs force-pushed the feat/ng-extra-args branch from 4812f88 to 51d7f61 Compare January 12, 2021 11:42

barryib reviewed Jan 28, 2021

View reviewed changes

feat: enable default launch template

d348fb4

Signed-off-by: Kevin Lefevre <[email protected]>

barryib approved these changes Apr 19, 2021

View reviewed changes

barryib merged commit 2e1651d into terraform-aws-modules:master Apr 19, 2021

This was referenced Apr 19, 2021

fix: Regression broken MNG with LT, introduced in 14.0 #1221

Closed

LaunchTemplate support for MNG broken with v14.0.0 #1211

Closed

This was referenced May 5, 2021

EKS Node Group fails to recreate when using launch template, on minor template update #1152

Closed

Upgrade from version v13.2.1 to v15.1.0 fails with "NodeGroup already exists with name..." #1314

Closed

barryib mentioned this pull request May 19, 2021

Add support for userdata in managed NodeGroups #1192

Closed

4 tasks

barryib pushed a commit to barryib/terraform-aws-eks that referenced this pull request May 20, 2021

feat: Create launch template for Managed Node Groups (terraform-aws-m…

63bd740

…odules#1138) Signed-off-by: Kevin Lefevre <[email protected]>

github-actions bot locked as resolved and limited conversation to collaborators Nov 14, 2022

feat: Create launch template for Managed Node Groups #1138

feat: Create launch template for Managed Node Groups #1138

Conversation

ArchiFleKs commented Dec 7, 2020 • edited Loading

PR o'clock

Description

Checklist

cabrinha commented Dec 8, 2020 • edited Loading

ArchiFleKs commented Dec 8, 2020 • edited Loading

ArchiFleKs commented Dec 8, 2020

cabrinha commented Dec 8, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ArchiFleKs commented Dec 8, 2020

cabrinha commented Dec 8, 2020

cabrinha commented Dec 29, 2020 • edited Loading

binnythomas-1989 commented Jan 11, 2021

ArchiFleKs commented Jan 11, 2021

binnythomas-1989 commented Jan 11, 2021 • edited Loading

ArchiFleKs commented Jan 11, 2021

binnythomas-1989 commented Jan 11, 2021 • edited Loading

binnythomas-1989 commented Jan 12, 2021 • edited Loading

ArchiFleKs commented Jan 12, 2021

ArchiFleKs commented Jan 12, 2021

binnythomas-1989 commented Jan 12, 2021

ArchiFleKs commented Jan 12, 2021

binnythomas-1989 commented Jan 12, 2021

ArchiFleKs commented Jan 12, 2021

bobbywatson3 commented Jan 16, 2021

barryib left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stevehipwell commented Feb 1, 2021

cabrinha commented Apr 16, 2021

ArchiFleKs commented Apr 16, 2021

cabrinha commented Apr 17, 2021

devy294 commented Apr 19, 2021

cabrinha commented Apr 19, 2021 • edited Loading

stevehipwell commented Apr 20, 2021

cabrinha commented Apr 20, 2021 • edited Loading

archoversight commented Apr 20, 2021

martin308 commented Apr 22, 2021

stevehipwell commented Apr 22, 2021

jfoechsler commented Apr 22, 2021

martin308 commented Apr 22, 2021

jfoechsler commented Apr 22, 2021

ArchiFleKs commented Apr 22, 2021

ArchiFleKs commented Apr 22, 2021

ArchiFleKs commented Apr 22, 2021

stevehipwell commented Jun 8, 2021

ipleten commented Jul 15, 2021 • edited Loading

stevehipwell commented Jul 15, 2021

stevehipwell commented Jul 20, 2021

github-actions bot commented Nov 14, 2022

ArchiFleKs commented Dec 7, 2020 •

edited

Loading

cabrinha commented Dec 8, 2020 •

edited

Loading

ArchiFleKs commented Dec 8, 2020 •

edited

Loading

cabrinha commented Dec 8, 2020 •

edited

Loading

cabrinha commented Dec 29, 2020 •

edited

Loading

binnythomas-1989 commented Jan 11, 2021 •

edited

Loading

binnythomas-1989 commented Jan 11, 2021 •

edited

Loading

binnythomas-1989 commented Jan 12, 2021 •

edited

Loading

cabrinha commented Apr 19, 2021 •

edited

Loading

cabrinha commented Apr 20, 2021 •

edited

Loading

ipleten commented Jul 15, 2021 •

edited

Loading