aws: aws_elasticache_cluster doesn't wait till completed #2732

mzupan · 2015-07-15T12:22:35Z

Haven't looked at the aws library to see if this is possible but the aws_elasticache_cluster creation at least for redis doesn't wait until a cache node is created so anything referencing the attribute will fail.

For example

resource "aws_elasticache_cluster" "logstash-redis" {
  cluster_id = "web-logstash-redis"
  engine = "redis"
  node_type = "cache.m3.medium"
  num_cache_nodes = 1
  parameter_group_name = "default.redis2.8"
  port = 6379

  subnet_group_name = "${aws_elasticache_subnet_group.logstash-redis.name}"
  security_group_ids = ["${aws_security_group.internal-redis.id}"]
  parameter_group_name = "default.redis2.8"
}

resource "aws_route53_record" "logstash-redis" {
   zone_id = "${aws_route53_zone.private-zone.id}"
   name = "redis-logstash.${var.env}.urthecast.com"
   type = "CNAME"
   ttl = "30"
   records = ["${aws_elasticache_cluster.logstash-redis.cache_nodes.0.address}"]
}

It fails on the route53 creation

* Resource 'aws_elasticache_cluster.logstash-redis' does not have attribute 'cache_nodes.0.address' for variable 'aws_elasticache_cluster.logstash-redis.cache_nodes.0.address'

Once the cache node is created it works fine

The text was updated successfully, but these errors were encountered:

catsby · 2015-07-15T14:10:42Z

Hey @mzupan – I'll check this out, however we've been down this road in #2051 and both @phinze and I can't seem to reproduce it after several attempts. There is code in the resource that waits for the nodes to become available, which should populate the attribute before you're getting to this point.

Maybe you can spot something in the logic that we're missing?

munhitsu · 2015-07-22T12:52:38Z

+1 - I'm having exactly the same issue with terraform 0.6.0

catsby · 2015-07-22T19:50:40Z

@munhitsu do you have a configuration that reproduces this reliably? We've been unable to reproduce this so far (see #2051)

munhitsu · 2015-07-23T16:48:06Z

Yes, I'm getting it most of the time (just tested on 0.6.1) - @catsby ping me directly for stack/logs.
Stack cleanup just hides issue.

catsby · 2015-07-23T18:37:32Z

I'm looking for a configuration that reproduces this. Can you reproduce this with the configuration above, or the one I shared in this gist?

I believe this is happening... I just can't track down where, I'm not able to reproduce it even a single time 😦

catsby · 2015-07-24T15:03:30Z

Stack cleanup just hides issue

I'm not sure I follow your meaning here.

I've tried the mentioned config and config I shared in this gist, has anyone managed to reproduce this issue with either?

I've tried several regions as well. I've seen logs that demonstrate the error(s) so I believe this to be happening.

@munhitsu – thanks for the info you shared privately. I tried pairing your example back to just the cluster related things, but still no success reproducing it. An important observation from your logs though...

[DEBUG] status: available
[DEBUG] status: available

[DEBUG] status: only appears in the code, in the ElastiCache Cluster resource, here:

https://github.com/hashicorp/terraform/blob/master/builtin/providers/aws/resource_aws_elasticache_cluster.go#L422

In your log, those are the only two lines output of that status. When I create the cluster, I get this output:

https://gist.github.com/catsby/8b85c5eb4390c3a0340e

Which is about 5 minutes of [DEBUG] status: creating before I reach available. Has anyone noticed that when this fails for them (requires running Terraform with TF_LOG=1 to get the verbose output).

That seems to be a mistake on AWS's side... I'm not sure what else would happen here, we're reading that value from what the API returns.

I pushed a branch https://github.com/hashicorp/terraform/tree/aws-elasticache-debug that has extra debugging for the Cluster creation and checking of the nodes. If anyone can reproduce this, please try with that branch if possible and examine the logs. I've changed the output to be [DEBUG] ElastiCache Cluster status: to be more clear what we're checking for.

I'm trying to re-run this with the that branch.

catsby · 2015-07-24T15:07:56Z

Specific debug additions: 6469c32

catsby · 2015-07-24T16:17:32Z

At long (long) last, I think I have insight here. Working on testing my theory and fixing ... Thank you all for your help and patience here

catsby · 2015-07-24T16:52:50Z

I think #2842 fixes this. The only way I could reproduce this is if I was creating (or had a prior existing) cluster in the same region. There were bugs in the code that wasn't searching correctly, and wasn't comparing the right cluster information..

Sorry for dragging this on, I really appreciate all the help and patience here. Please let me know if you can checkout #2842

ozbillwang · 2017-04-15T10:34:24Z

have this issue today with terraform 0.9.1,

Error running plan: 1 error(s) occurred:

* module.redis.aws_route53_record.redis: 1 error(s) occurred:

* module.redis.aws_route53_record.redis: Resource 'aws_elasticache_cluster.redis' not found for variable 'aws_elasticache_cluster.redis.cache_nodes.0.address'

updates

I found another issue that cluster_id is more than 20 letters. After I fixed this issue, above problem is gone. I can reproduce above issue by increase length of cluster_id. That's Interesting.

ghost · 2020-04-13T02:39:47Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.

If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

catsby added bug waiting-response An issue/pull request is waiting for a response from the community provider/aws labels Jul 15, 2015

catsby removed the waiting-response An issue/pull request is waiting for a response from the community label Jul 24, 2015

catsby mentioned this issue Jul 24, 2015

provider/aws: Fix issue with checking for ElastiCache cluster status #2842

Merged

catsby closed this as completed in #2842 Jul 29, 2015

ghost locked and limited conversation to collaborators Apr 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aws: aws_elasticache_cluster doesn't wait till completed #2732

aws: aws_elasticache_cluster doesn't wait till completed #2732

mzupan commented Jul 15, 2015

catsby commented Jul 15, 2015

munhitsu commented Jul 22, 2015

catsby commented Jul 22, 2015

munhitsu commented Jul 23, 2015

catsby commented Jul 23, 2015

catsby commented Jul 24, 2015

catsby commented Jul 24, 2015

catsby commented Jul 24, 2015

catsby commented Jul 24, 2015

ozbillwang commented Apr 15, 2017 •

edited

Loading

ghost commented Apr 13, 2020

aws: aws_elasticache_cluster doesn't wait till completed #2732

aws: aws_elasticache_cluster doesn't wait till completed #2732

Comments

mzupan commented Jul 15, 2015

catsby commented Jul 15, 2015

munhitsu commented Jul 22, 2015

catsby commented Jul 22, 2015

munhitsu commented Jul 23, 2015

catsby commented Jul 23, 2015

catsby commented Jul 24, 2015

catsby commented Jul 24, 2015

catsby commented Jul 24, 2015

catsby commented Jul 24, 2015

ozbillwang commented Apr 15, 2017 • edited Loading

updates

ghost commented Apr 13, 2020

ozbillwang commented Apr 15, 2017 •

edited

Loading