[Bug]: aws_emr_instance_group configured with instance_count = 0 spins up instance at apply then destroys #38837

mentasm · 2024-08-13T01:04:22Z

Terraform Core Version

1.5.4

AWS Provider Version

4.67.0

Affected Resource(s)

aws_emr_instance_group

Expected Behavior

aws_emr_instance_group configured with instance_count = 0 should create the instance group with no instances actually being created

Actual Behavior

aws_emr_instance_group creates an instance group, but spins up an instance. As soon as the instance finishes creation an autoscaling event destroys it to obey the instance_count=0 configuration. This also causes apply stage to run for much longer than required as it waits for the instance creation and subsequent destroy before achieving its desired state and completing the apply operation.

Relevant Error/Panic Output Snippet

No response

Terraform Configuration Files

resource "aws_emr_cluster" "cluster" {
  name          = "emr-test-cluster"
  ...
}

resource "aws_emr_instance_group" "task_1" {
  cluster_id     = aws_emr_cluster.cluster.id
  instance_count = 0
  instance_type  = "m5.xlarge"
  bid_price = 0.5
  name           = "config_1"
}

resource "aws_emr_instance_group" "task_2" {
  cluster_id     = aws_emr_cluster.cluster.id
  instance_count = 0
  instance_type  = "m5.x2large"
  bid_price = 1.0
  name           = "config_2"
}

Steps to Reproduce

configure emr cluster with a number of aws_emr_instance_group resources with various instance types and instance_count=0
terraform init
terraform apply

Debug Output

No response

Panic Output

No response

Important Factoids

We run emr for structured streaming and clusters are long running. For a number of reasons in this situation we cannot run instance_fleets, and instead run with multiple instance groups of different instance types using spot instances. For nonprod environments we deploy the clusters with just master and core nodes and configure these task instance groups to able to scale on deployment of jobs. This works fine in most instances, but we do see these task groups spinning up task instances at apply (and we pay have to pay for those, minimally admittedly) and occasionally the resize operation takes so long our deployment pipelines timeout.

I have raised this with AWS support and the can see that API calls are being executed as requested

References

No response

Would you like to implement a fix?

No

The text was updated successfully, but these errors were encountered:

github-actions · 2024-08-13T01:04:34Z

Community Note

Voting for Prioritization

Please vote on this issue by adding a 👍 reaction to the original post to help the community and maintainers prioritize this request.
Please see our prioritization guide for information on how we prioritize.
Please do not leave "+1" or other comments that do not add relevant new information or questions, they generate extra noise for issue followers and do not help prioritize the request.

Volunteering to Work on This Issue

If you are interested in working on this issue, please leave a comment.
If this would be your first contribution, please review the contribution guide.

mentasm · 2024-08-19T03:47:15Z

Have subsequently tested supplying NULL to the instance group config for instance count and had the same result

mentasm · 2024-08-19T03:50:46Z

Believe that it may be related to the code here:

terraform-provider-aws/internal/service/emr/instance_group.go

Line 178 in 3724def

if v, ok := d.GetOk(names.AttrInstanceCount); ok {

but my go coding is not great

jrutroff2 · 2024-08-23T11:31:29Z

This is a major issue for us as it doubles the emr cluster build time since we have to wait for a task node to build. We do this very often with a high quantity of clusters and it is very painful during maintenance windows. Please escalate this. We can't move past this provider version until this is fixed.

Thanks

ewbankkit · 2024-08-27T14:07:50Z

Relates #26154.

github-actions · 2024-08-27T20:53:34Z

Warning

This issue has been closed, meaning that any additional comments are hard for our team to see. Please assume that the maintainers will not see them.

Ongoing conversations amongst community members are welcome, however, the issue will be locked after 30 days. Moving conversations to another venue, such as the AWS Provider forum, is recommended. If you have additional concerns, please open a new issue, referencing this one where needed.

github-actions · 2024-08-29T22:33:27Z

This functionality has been released in v5.65.0 of the Terraform AWS Provider. Please see the Terraform documentation on provider versioning or reach out if you need any assistance upgrading.

For further feature requests or bug reports with this functionality, please create a new GitHub issue following the template. Thank you!

github-actions · 2024-09-29T02:20:48Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

mentasm added the bug Addresses a defect in current functionality. label Aug 13, 2024

github-actions bot added the service/emr Issues and PRs that pertain to the emr service. label Aug 13, 2024

terraform-aws-provider bot added the needs-triage Waiting for first response or review from a maintainer. label Aug 13, 2024

justinretzolk removed the needs-triage Waiting for first response or review from a maintainer. label Aug 13, 2024

ewbankkit self-assigned this Aug 27, 2024

terraform-aws-provider bot added the prioritized Part of the maintainer teams immediate focus. To be addressed within the current quarter. label Aug 27, 2024

ewbankkit mentioned this issue Aug 27, 2024

Add "io2" as a valid EBS volume type to EMR validation routine #37740

Merged

ewbankkit closed this as completed in #37740 Aug 27, 2024

github-actions bot added this to the v5.65.0 milestone Aug 27, 2024

github-actions bot removed the prioritized Part of the maintainer teams immediate focus. To be addressed within the current quarter. label Aug 29, 2024

github-actions bot locked as resolved and limited conversation to collaborators Sep 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: aws_emr_instance_group configured with instance_count = 0 spins up instance at apply then destroys #38837

[Bug]: aws_emr_instance_group configured with instance_count = 0 spins up instance at apply then destroys #38837

mentasm commented Aug 13, 2024 •

edited

Loading

github-actions bot commented Aug 13, 2024

mentasm commented Aug 19, 2024

mentasm commented Aug 19, 2024

jrutroff2 commented Aug 23, 2024

ewbankkit commented Aug 27, 2024

github-actions bot commented Aug 27, 2024

github-actions bot commented Aug 29, 2024

github-actions bot commented Sep 29, 2024

[Bug]: aws_emr_instance_group configured with instance_count = 0 spins up instance at apply then destroys #38837

[Bug]: aws_emr_instance_group configured with instance_count = 0 spins up instance at apply then destroys #38837

Comments

mentasm commented Aug 13, 2024 • edited Loading

Terraform Core Version

AWS Provider Version

Affected Resource(s)

Expected Behavior

Actual Behavior

Relevant Error/Panic Output Snippet

Terraform Configuration Files

Steps to Reproduce

Debug Output

Panic Output

Important Factoids

References

Would you like to implement a fix?

github-actions bot commented Aug 13, 2024

Community Note

mentasm commented Aug 19, 2024

mentasm commented Aug 19, 2024

jrutroff2 commented Aug 23, 2024

ewbankkit commented Aug 27, 2024

github-actions bot commented Aug 27, 2024

github-actions bot commented Aug 29, 2024

github-actions bot commented Sep 29, 2024

mentasm commented Aug 13, 2024 •

edited

Loading