node_pools don't support regional clusters #1300

Stono · 2018-04-06T08:33:40Z

Hey,
We're trying to rebuild our clusters using the new regional clusters added in 1.9.0, however we use google_container_node_pool to add custom pools to our cluster.

The node pools however do not work against regional clusters:

Error: Error applying plan:

1 error(s) occurred:

* module.kubernetes.google_container_node_pool.custom-pool: 1 error(s) occurred:

* google_container_node_pool.custom-pool: Error creating NodePool: googleapi: Error 400: v1 API cannot be used to access GKE regional clusters. See https://goo.gl/uHKp3k for more information., badRequest

Proposal
I think the google_container_node_pool resource should be changed with:

A region parameter added which has the description of "The location of the kubernetes regional master, for use with regional clusters"
The zone parameter description changed to "The location of the kubernetes zonal master, for use with zonal clusters"
The parameters should be mutually exclusive, with zone using the existing code process and region using the newer v1beta1 api which handles regional clusters (https://cloud.google.com/kubernetes-engine/docs/reference/api-organization#beta)

Associated issue: #829
Associated PR for master regional clusters: #1181

The text was updated successfully, but these errors were encountered:

Stono · 2018-04-06T08:45:22Z

@danawillow @ashish-amarnath I tried!

darrenhaken · 2018-04-06T08:52:51Z

I'll take this up @Stono

Stono · 2018-04-06T09:14:22Z

Thank you oh random stranger @darrenhaken who definitely doesn't work in my office :)

morgante · 2018-04-18T15:45:54Z

I'm also encountering this bug.

This PR also switched us to using the beta API in all cases, and that had a side effect which is worth noting, note included here for posterity. ===== The problem is, we add a GPU, and as per the docs, GKE adds a taint to the node pool saying "don't schedule here unless you tolerate GPUs", which is pretty sensible. Terraform doesn't know about that, because it didn't ask for the taint to be added. So after apply, on refresh, it sees the state of the world (1 taint) and the state of the config (0 taints) and wants to set the world equal to the config. This introduces a diff, which makes the test fail - tests fail if there's a diff after they run. Taints are a beta feature, though. :) And since the config doesn't contain any taints, terraform didn't see any beta features in that node pool ... so it used to send the request to the v1 API. And since the v1 API didn't return anything about taints (since they're a beta feature), terraform happily checked the state of the world (0 taints I know about) vs the config (0 taints), and all was well. This PR makes every node pool refresh request hit the beta API. So now terraform finds out about the taints (which were always there) and the test fails (which it always should have done). The solution is probably to write a little bit of code which suppresses the report of the diff of any taint with value 'nvidia.com/gpu', but only if GPUs are enabled. I think that's something that can be done.

Stono · 2018-05-09T06:25:35Z

This works now so closing the issue :-) ta

…#1320) This PR also switched us to using the beta API in all cases, and that had a side effect which is worth noting, note included here for posterity. ===== The problem is, we add a GPU, and as per the docs, GKE adds a taint to the node pool saying "don't schedule here unless you tolerate GPUs", which is pretty sensible. Terraform doesn't know about that, because it didn't ask for the taint to be added. So after apply, on refresh, it sees the state of the world (1 taint) and the state of the config (0 taints) and wants to set the world equal to the config. This introduces a diff, which makes the test fail - tests fail if there's a diff after they run. Taints are a beta feature, though. :) And since the config doesn't contain any taints, terraform didn't see any beta features in that node pool ... so it used to send the request to the v1 API. And since the v1 API didn't return anything about taints (since they're a beta feature), terraform happily checked the state of the world (0 taints I know about) vs the config (0 taints), and all was well. This PR makes every node pool refresh request hit the beta API. So now terraform finds out about the taints (which were always there) and the test fails (which it always should have done). The solution is probably to write a little bit of code which suppresses the report of the diff of any taint with value 'nvidia.com/gpu', but only if GPUs are enabled. I think that's something that can be done.

ghost · 2018-11-18T14:09:29Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.

If you feel this issue should be reopened, we encourage creating a new issue linking back to this one for added context. If you feel I made an error 🤖 🙉 , please reach out to my human friends 👉 [email protected]. Thanks!

catsby added the bug label Apr 6, 2018

darrenhaken mentioned this issue Apr 7, 2018

What IAM permissions do I need to give a role for developing the provider? #1306

Closed

Stono mentioned this issue Apr 10, 2018

Multiple race condition issues in google_project and google_project_services #1313

Closed

darrenhaken mentioned this issue Apr 11, 2018

#1300 Supporting regional clusters for node pools #1320

Merged

nat-henderson mentioned this issue Apr 16, 2018

Wait for the billing account to be set up correctly. #1345

Merged

Stono closed this as completed May 9, 2018

ghost locked and limited conversation to collaborators Nov 18, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

node_pools don't support regional clusters #1300

node_pools don't support regional clusters #1300

Stono commented Apr 6, 2018 •

edited by rosbo

Loading

Stono commented Apr 6, 2018 •

edited

Loading

darrenhaken commented Apr 6, 2018

Stono commented Apr 6, 2018

morgante commented Apr 18, 2018

Stono commented May 9, 2018

ghost commented Nov 18, 2018

node_pools don't support regional clusters #1300

node_pools don't support regional clusters #1300

Comments

Stono commented Apr 6, 2018 • edited by rosbo Loading

Stono commented Apr 6, 2018 • edited Loading

darrenhaken commented Apr 6, 2018

Stono commented Apr 6, 2018

morgante commented Apr 18, 2018

Stono commented May 9, 2018

ghost commented Nov 18, 2018

Stono commented Apr 6, 2018 •

edited by rosbo

Loading

Stono commented Apr 6, 2018 •

edited

Loading