Newly refactored module #774

js-timbirkett · 2020-03-10T18:23:26Z

Hello 👋

Firstly, this module is great and has a lot to deal with as standing up an EKS cluster and nodes / workers is not a trivial task.

There have been issues regarding orchestrating the creation of the control plane and the workers / nodes [#519], issues questioning complexity or coupling within the module [#635], questions with regards to LC / LT worker_groups and whether they should be merged or simplified [#563], and issues around numeric indexes and modifying arrays in general.

I had a go at creating a new, refactored version of this module and my attempt is here:
https://github.com/devopsmakers/terraform-aws-eks

It breaks things out into sub-modules that can be used by both the parent module (to maintain the simple single module gets you a working cluster) and directly by the user of the module for more complex situations (Custom CNI).

It also allows a map of maps for worker groups and the code has been simplified a bit (reduction of the death by lookups in the current worker related code).

I made a few tough decisions that might stimulate some debate and possibly some outrage. These are highlighted in the repos README.

I'm not saying it's perfect (yet), but I think it would make a good base for a fairly major refactor of this module and I'd be glad to assist in any way I can. Feel free to use any ideas or the codebase itself to further improve this module.

Thanks!

barryib · 2020-03-10T21:00:13Z

Hello @js-timbirkett thanks for opening this discussion and for your work. It's a good start.

Can you please open a PR so we can iterate and give feedbacks ?

For the design decisions:

To me, we can move worker groups in a separate submodule like we already did for managed node groups.
As for dropping LC support, I don't have a strong opinion on this. I personally don't use LC at all, but I don't know if there are lot of people out there which still rely on it. See Is the complexity of this module getting too high? #635 (comment)

@max-rocket-internet @dpiddockcmp @antonbabenko please advice (mostly for the LC removal)

antonbabenko · 2020-03-11T08:09:30Z

+1 for dropping LC. I think we all agreed in some previous issues that dropping support for LC is the way to go.

I wonder, are there any downsides of using LT at all comparing to LC?

max-rocket-internet · 2020-03-11T12:32:35Z

I wonder, are there any downsides of using LT at all comparing to LC?

No downsides to LT that I know of, LC is just older and has less features.

max-rocket-internet · 2020-03-11T12:36:36Z

I had a go at creating a new, refactored version of this module

Sounds great but we need a PRs to review and a path forward for current users.

I look forward to seeing some PRs 😃

js-timbirkett · 2020-03-12T11:44:53Z

Hey 👋

The refactored module is still very much in a beta state, when I've finished shaving the Terraform yak and double checking outputs, inputs etc... I'll open a big ole PR for further discussion and improvements.

It'll definitely be a major release and not something that's easily migrate-able to without a lot of Terraform state moving or importing (could probably write up a guide on that).

chancez · 2020-03-18T22:38:08Z

@js-timbirkett I'll be giving your worker group module a try. I'm vendoring it though since I need to change ignore_changes for the ASG to not ignore desired capacity since we're using TF, not cluster autoscaler for managing workers.

js-timbirkett · 2020-03-19T11:09:08Z

Oh nice @chancez - Thanks for the 🐛 fix!

I'll be doing some final editing of my version of the module early next week when I have some time off work. Then I'll open what will be a rather large PR against this module and see how it goes...

barryib · 2020-04-22T21:55:05Z

@js-timbirkett any update here ?

barryib · 2020-05-08T16:37:12Z

@js-timbirkett are you still working on this ?

ptphuc · 2020-07-23T06:35:21Z

I'm really looking for this changes could happen. Any updates for this @js-timbirkett ?. Many thx for your contribution so far.

barryib · 2020-07-23T18:33:33Z

I was wondering if we shouldn't wait for Terraform 0.13 with it's loop support for modules.

With that, we can split the submodule to deploy one worker group and let users loop over the submodule to create their worker groups.

@max-rocket-internet @dpiddockcmp @antonbabenko please advise.

antonbabenko · 2020-07-24T06:08:44Z

My 5 cents. I would probably wait for using 0.13 features in the core of the modules, because very few users will migrate to 0.13 right away, and for us maintainers it means we will need to have the same functionality for 0.12-users at the same time, too. Submodule with count for 0.13 users seems like a good use case. +1 from me. Best regards, Anton.

…

23 лип. 2020 р. о 21:33 Thierno IB. BARRY ***@***.***> пише: I was wondering if we shouldn't wait for Terraform 0.13 with it's loop support for modules. With that, we can split the submodule to deploy one worker group and let users loop over the submodule to create their worker groups. @max-rocket-internet @dpiddockcmp @antonbabenko please advise. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe.

dpiddockcmp · 2020-07-24T08:58:10Z

0.13 doesn't appear to require any massive breaking changes to upgrade. It looks like most 0.12 code should mostly run fine under it. Although I haven't experimented very heavily. I wonder if that will speed or slow adoption?

Either way, we should wait a good period of time before adding 0.13 only features that probably have bugs and edge cases. 6 months?

There's also the MNG submodule to consider. It already has the pattern of internal for_each. It would be good for submodules to be consistent in behaviour. So it would need migrating with breaking changes for users if we did go down the for_each module path.

My vote, right now, would be to stick with for_each inside the submodules if we want to implement these changes soon.

max-rocket-internet · 2020-07-27T08:07:43Z

wait for Terraform 0.13 with it's loop support for modules.

Sounds good to me 🚀

stale · 2020-10-25T09:35:49Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale · 2021-01-23T11:33:52Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale · 2021-04-29T03:22:24Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale · 2021-08-02T23:05:35Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale · 2021-09-02T03:48:31Z

This issue has been automatically closed because it has not had recent activity since being marked as stale.

willquill · 2021-10-28T21:56:00Z

If you're like me and use this module to create a cluster with no nodes so that you can then delete the aws-node daemon set to disable AWS VPC networking for pods in order to apply a different CNI, like Calico, as described here, and then come back to the module to add worker nodes, you can absolutely configure conditional worker_group creation via a boolean variable.

I put example code for this in Issue #861 in my post here: #861 (comment)

And I actually took it a step further, creating a repository called terraform-eks to act as a personal module with the terraform-aws-eks module nested inside of it. Now I can tag specific commits of my personal repository with the AMI version used in the worker nodes and invoke an entire cluster and all of its dependencies with only 15 lines of code:

module "eks" {
  source                   = "git::ssh://[email protected]:1337/my-project/terraform-eks.git?ref=v1.21.0"
  environment              = var.environment
  env                      = var.env
  tags                     = local.tags
  source_ami_region        = "us-east-2"
  vpc_id                   = local.vpc_id
  eks_subnets              = data.aws_subnet_ids.eks_subnets.ids
  eks_tags                 = var.eks_tags
  eks_worker_instance_type = "m5.4xlarge"
  asg_max_size             = 3
  asg_min_size             = 1
  asg_desired_capacity     = 1
  create_workers           = true
}

With this method, you first apply when create_workers is false, then go do the CNI stuff, and finally come back and set create_workers to true.

Doing something like this is a game changer in terms of avoiding code repetition if you have multiple EKS clusters. The main.tf in my use of the official, public module is 104 lines. If you have four clusters, that's 416 lines of code. With my method, it can be 104 for your personal version of the public module +15+15+15+15, for a total of 164 lines of code (give or take, as you have to account for variables, locals, and data calls).

github-actions · 2022-11-17T02:27:13Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues. If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

barryib mentioned this issue Mar 11, 2020

State of worker group launch templates stored with keys instead of indices #778

Closed

4 tasks

barryib mentioned this issue Apr 22, 2020

feat: Initial support for aws's bottlerocket os #840

Closed

2 tasks

barryib mentioned this issue May 4, 2020

feat: Ability to manage worker groups as maps #858

Closed

2 tasks

dpiddockcmp mentioned this issue May 6, 2020

Not able to create worker group into existing eks cluster (conditional creation) #861

Closed

barryib mentioned this issue May 6, 2020

feat: Add EKS Fargate support #866

Closed

2 tasks

barryib pinned this issue May 8, 2020

sc250024 mentioned this issue Sep 29, 2020

Cut version for 0.13 support #1025

Closed

4 tasks

stale bot added the stale label Oct 25, 2020

barryib removed the stale label Oct 25, 2020

barryib mentioned this issue Nov 14, 2020

How to handle state migrations as code during module upgrades ? #1101

Closed

4 tasks

stale bot added the stale label Jan 23, 2021

barryib removed the stale label Jan 29, 2021

stale bot added the stale label Apr 29, 2021

barryib removed the stale label May 4, 2021

This was referenced May 4, 2021

feat: Use a Map internally for workers data structure #1132

Closed

improvement: count to for_each #1309

Closed

stale bot added the stale label Aug 2, 2021

stale bot closed this as completed Sep 2, 2021

antonbabenko unpinned this issue Apr 7, 2022

github-actions bot locked as resolved and limited conversation to collaborators Nov 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Newly refactored module #774

Newly refactored module #774

js-timbirkett commented Mar 10, 2020

barryib commented Mar 10, 2020 •

edited

Loading

antonbabenko commented Mar 11, 2020

max-rocket-internet commented Mar 11, 2020

max-rocket-internet commented Mar 11, 2020

js-timbirkett commented Mar 12, 2020

chancez commented Mar 18, 2020

js-timbirkett commented Mar 19, 2020

barryib commented Apr 22, 2020

barryib commented May 8, 2020

ptphuc commented Jul 23, 2020

barryib commented Jul 23, 2020

antonbabenko commented Jul 24, 2020 via email

dpiddockcmp commented Jul 24, 2020

max-rocket-internet commented Jul 27, 2020

stale bot commented Oct 25, 2020

stale bot commented Jan 23, 2021

stale bot commented Apr 29, 2021

stale bot commented Aug 2, 2021

stale bot commented Sep 2, 2021

willquill commented Oct 28, 2021

github-actions bot commented Nov 17, 2022

Newly refactored module #774

Newly refactored module #774

Comments

js-timbirkett commented Mar 10, 2020

barryib commented Mar 10, 2020 • edited Loading

antonbabenko commented Mar 11, 2020

max-rocket-internet commented Mar 11, 2020

max-rocket-internet commented Mar 11, 2020

js-timbirkett commented Mar 12, 2020

chancez commented Mar 18, 2020

js-timbirkett commented Mar 19, 2020

barryib commented Apr 22, 2020

barryib commented May 8, 2020

ptphuc commented Jul 23, 2020

barryib commented Jul 23, 2020

antonbabenko commented Jul 24, 2020 via email

dpiddockcmp commented Jul 24, 2020

max-rocket-internet commented Jul 27, 2020

stale bot commented Oct 25, 2020

stale bot commented Jan 23, 2021

stale bot commented Apr 29, 2021

stale bot commented Aug 2, 2021

stale bot commented Sep 2, 2021

willquill commented Oct 28, 2021

github-actions bot commented Nov 17, 2022

barryib commented Mar 10, 2020 •

edited

Loading