Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Enable support for ignore_failed_scaling_activities #3104

Merged
merged 4 commits into from
Jul 19, 2024

Conversation

ivankatliarchuk
Copy link
Contributor

@ivankatliarchuk ivankatliarchuk commented Jul 19, 2024

Description

Add support for ignore_failed_scaling_activities. No breaking changes for current clusters, as default value is set to false

Motivation and Context

Our setup

  • AWS EKS clusters, ~ 500 nodes each
  • Multiple ASGs with self managed node groups
  • Cluster autoscaler is scaling nodes up and down
  • 99% of instances SPOT
  • Every minute there are multiple scaling activities
  • We have set of Blue and Green ASGs

So what is happening?

  • Changing ASG min or max capacity
  • At the same time autoscaler scale up few more nodes, or scale them down
  • The error appears Error: waiting for Auto Scaling Group (eks-group1-1.27-asfsdfasdf) capacity satisfied: timeout while waiting for state to become 'ok' (last state: 'want exactly 44 healthy instance(s) in Auto Scaling Group, have 45', timeout: 10m0s

Breaking Changes

It should not break anything

How Has This Been Tested?

  • I have updated at least one of the examples/* to demonstrate and validate my change(s)
  • I have tested and validated these changes using one or more of the provided examples/* projects
  • I have executed pre-commit run -a on my pull request
  • I have tested and validated on current cluster

@ivankatliarchuk ivankatliarchuk changed the title feat(issue-3102): added support for ignore_failed_scaling_activities feat: added support for ignore_failed_scaling_activities Jul 19, 2024
@ivankatliarchuk ivankatliarchuk changed the title feat: added support for ignore_failed_scaling_activities feat: Enable support for ignore_failed_scaling_activities Jul 19, 2024
modules/self-managed-node-group/variables.tf Outdated Show resolved Hide resolved
node_groups.tf Outdated Show resolved Hide resolved
@bryantbiggs bryantbiggs merged commit 532226e into terraform-aws-modules:master Jul 19, 2024
19 checks passed
antonbabenko pushed a commit that referenced this pull request Jul 19, 2024
## [20.20.0](v20.19.0...v20.20.0) (2024-07-19)

### Features

* Enable support for ignore_failed_scaling_activities ([#3104](#3104)) ([532226e](532226e))
@antonbabenko
Copy link
Member

This PR is included in version 20.20.0 🎉

@ivankatliarchuk ivankatliarchuk deleted the feat-3102 branch July 19, 2024 14:46
Argote-Gustavo-bah pushed a commit to navwar-seastar/terraform-aws-eks that referenced this pull request Jul 23, 2024
…aws-modules#3104)

* feat(issue-3102): added support for ignore_failed_scaling_activities

Signed-off-by: ivan katliarchuk <[email protected]>

* Update node_groups.tf

Co-authored-by: Bryant Biggs <[email protected]>

* Update modules/self-managed-node-group/variables.tf

Co-authored-by: Bryant Biggs <[email protected]>

* feat(issue-3102): pre-commit checks sorted

Signed-off-by: ivan katliarchuk <[email protected]>

---------

Signed-off-by: ivan katliarchuk <[email protected]>
Co-authored-by: Bryant Biggs <[email protected]>
Argote-Gustavo-bah pushed a commit to navwar-seastar/terraform-aws-eks that referenced this pull request Jul 23, 2024
Copy link

I'm going to lock this pull request because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues. If you have found a problem that seems related to this change, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Aug 19, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add support for ignore_failed_scaling_activities
3 participants