-
Notifications
You must be signed in to change notification settings - Fork 5.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Change In API behavior? #7207
Comments
I, too am getting the same error message but with
This was working fine for me about a month ago. Did some API behavior change? Feels like a race condition. Also only happens to me on destroy. |
Ok cool, then I'm not crazy.. good to know. We deploy pretty much 5x a day, and the first day this seemed to hit us was within the last 72 hours. |
I think this should be raised on terraform or pulumi SDK. Is it this github https://github.com/terraform-providers/terraform-provider-azurerm ? This API spec is for everything from Azure portal to CLI/powershell/go/java/dotnet client. If this got problem everyone is in trouble hence it is unlikely the cause of your issue (personally I just deleted a few resources on portal and see no issue). |
Alrighty, I'll raise it Pulumi first. Honestly I think it's something on API side, just because the portal doesn't throw an error doesn't mean there was a chance on the API spec, but I'll go ahead and do that too and let the finger pointing begin 👇 👈 👉 |
@visokoo are you using terraform or pulumi ? |
This spec basically just dictate the endpoint, and that request JSON should look like this and response JSON would look like that. It really does not care how's backend implementation. The spec is more like Interface in programming language. If you cannot call something it is spec problem. If you call something and sometime it fails at runtime it is more likely implementation problem. In your case (and @visokoo 's) it seems the "startTenantUpdate" is kind of get involved (though no one seems calling it explicitly). So looks to me either SDK calls it unwittingly, or backend is trying to do this to your resource without you calling it. Either is runtime problem. |
Is there a better Microsoft github where I should post this? Ok so if it is a backend runtime problem, where is the appropriate place to report to Microsoft a backend runtime problem? |
Calling Azure customer support is usually the easiest way (if it does not cost anything :-)). If you are using tool provided by Azure team, raise an issue there would also help (e.g. https://github.com/Azure/azure-cli if you are using Azure CLI). Team maintaining the SDK would try to figure out which service to contact. I will try to involve people I know that works on AME terraform project but I am not sure whether they are related or not. @mybayern1974 |
…is terminated It looks like an internal Azure race is causing a cryptic message like ``` Error: Error waiting for removal of Backend Address Pool Association for NIC \"ci-op-8qv3w054-282fe-2222c-bootstrap-nic\" (Resource Group \"ci-op-8qv3w054-282fe-2222c-rg\"): Code=\"OperationNotAllowed\" Message=\"Operation 'startTenantUpdate' is not allowed on VM 'ci-op-8qv3w054-282fe-2222c-bootstrap' since the VM is marked for deletion. You can only retry the Delete operation (or wait for an ongoing one to complete).\" Details=[] ``` when we update the NIC and the machine attached to it. `azurerm_network_interface_backend_address_pool_association.` depends on NIC but is not related to the machine NIC is attached to, the VM might be shutting down while this update is happeninig. This depends_on makes sure that VM is destroyed before we try to delete this association, preventing the race. Similar error seen by people like Azure/azure-rest-api-specs#7207
Calling customer support.. seriously? That's the best you can do? Have you ever tried to call customer support? Edit: sorry, sounded a bit harsh - my point is if you have 3+ people here reporting the issue, cant you escalate on your side, I'm assuming you are employed by Microsoft? |
Pulumi. It honestly feels like an intermittent failure. The destroy doesn't fail every time which leads me to think in the direction of flaky API. @dbilleci-lightstream, if you post on Pulumi's Github issues, can you link back to here? |
Yes, I did post there: pulumi/pulumi-azure#365 He asked for code to reproduce it, but I'll have to write special code to do so as it's in the middle of a big chunk of "other stuff" in our automation pipeline. If you have a simpler example in the meantime, please feel free to post it there! |
@dbilleci-lightstream , @ArcturusZhang can help take a look at the two terraform issues: 2491 and 4330 which are motivated by the issue filed at pulumi issue 365. I cannot tell right now whether it's rest api related before investigating the terraform behavior, so i suggest close the issue here, instead, use the terraform issue to track. We can reopen this issue once we find the root cause is at here. Do you agree with this? If yes we may close this issue here in 3 days. |
Sure that's fine.
…On Wed, Sep 18, 2019, 4:51 AM mybayern1974 ***@***.***> wrote:
@dbilleci-lightstream <https://github.com/dbilleci-lightstream> ,
@ArcturusZhang <https://github.com/ArcturusZhang> can help take a look at
the two terraform issues: 2491
<hashicorp/terraform-provider-azurerm#2491
4330
<hashicorp/terraform-provider-azurerm#4330>
which are motivated by the issue filed at pulumi issue 365
<pulumi/pulumi-azure#365>. I cannot tell right
now whether it's rest api related before investigating the terraform
behavior, so i suggest close the issue here, instead, use the terraform
issue to track. We can reopen this issue once we find the root cause is at
here. Do you agree with this? If yes we may close this issue here in 3 days.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#7207?email_source=notifications&email_token=AKPPZZ7KJIYZOGHMVB77V6TQKII25A5CNFSM4IWKUYV2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD67ZLSA#issuecomment-532649416>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AKPPZZ3NUCGVKJCXRKGNIPTQKII25ANCNFSM4IWKUYVQ>
.
|
Close per the author agrees. I would reopen this once finding the root cause is at rest api here. |
…is terminated It looks like an internal Azure race is causing a cryptic message like ``` Error: Error waiting for removal of Backend Address Pool Association for NIC \"ci-op-8qv3w054-282fe-2222c-bootstrap-nic\" (Resource Group \"ci-op-8qv3w054-282fe-2222c-rg\"): Code=\"OperationNotAllowed\" Message=\"Operation 'startTenantUpdate' is not allowed on VM 'ci-op-8qv3w054-282fe-2222c-bootstrap' since the VM is marked for deletion. You can only retry the Delete operation (or wait for an ongoing one to complete).\" Details=[] ``` when we update the NIC and the machine attached to it. `azurerm_network_interface_backend_address_pool_association.` depends on NIC but is not related to the machine NIC is attached to, the VM might be shutting down while this update is happeninig. This depends_on makes sure that VM is destroyed before we try to delete this association, preventing the race. Similar error seen by people like Azure/azure-rest-api-specs#7207
I'm using PULUMI which is based on Terraform providers for azure. I've checked their bugs page and don't see anything related.
I'm running into an issue where if I create a simple VM which gets linked to an existing load balancer, and then try to destroy it, I get a new error message which I believe is from the API:
Created resources:
Then, on teardown of resources, I get this failure:
I've not changed my infra code, and I got this on 2 different provider versions, one from about 2 months ago (that has been working) and now on the newest provider version today, same error.
Any ideas?
The text was updated successfully, but these errors were encountered: