Unable to cleanly delete an AKS managed cluster by capz #1395

LochanRn · 2021-05-27T21:40:48Z

/kind bug

What steps did you take and what happened:
Created an AKS Cluster using clusterctl.

The cluster was successfully created and was running fine.

When I tried to delete the cluster using the command

kubectl delete cluster

The kubernetes service in azure was deleted and all the resources created by it also got deleted successfully, but Azuremanagedcontrolplane, MachinePool, AzuremanagedCluster and Cluster Objects were stuck and not deleted.

What did you expect to happen:
All the cluster objects to be cleaned up successfully on deletion of the cluster

Anything else you would like to add:
[Miscellaneous information that will assist in solving the issue.]
PFB logs of capz pod and capi controller manager
capz-del.log
capi-cm-del.log

Environment:

cluster-api-provider-azure version: v0.4.15
Kubernetes version: (use kubectl version): 1.20.1
OS (e.g. from /etc/os-release):

The text was updated successfully, but these errors were encountered:

alexeldeib · 2021-05-27T22:28:15Z

There’s a circular dependency with deletion and finalizers.

If you delete only the CAPI cluster object, it will try to delete all the descendant machines/machinepools before deleting the control plane object. With AKS/AzureManagedControlPlane, the last MachinePool will fail to be deleted because an AKS cluster requires at least one agent pool.

The workaround is to manually delete the AzureManagedControlPlane. This will delete the entire AKS cluster, allowing the lingering MachinePool to hit a 404 on deletion, and the finalizer will be removed.

The probable fix is to check the cluster deletion timestamp. If it’s set, then remove the finalizers from all corresponding machine pools and don’t bother trying to delete them. Let AKS clean everything up on cluster deletion.

alexeldeib · 2021-05-28T00:58:36Z

we're also not deleting unmanaged vnets in a user-managed RG since #1009

alexeldeib · 2021-05-28T01:51:25Z

/assign

alexeldeib · 2021-05-28T03:23:16Z

there's also a circular watch dependency from AMCP -> Cluster -> AMCP being initialized and ready. although that one self-resolves, it just is slow.

fixes incoming :)

k8s-ci-robot added the kind/bug Categorizes issue or PR as related to a bug. label May 27, 2021

CecileRobertMichon added the area/managedclusters Issues related to managed AKS clusters created through the CAPZ ManagedCluster Type label May 27, 2021

k8s-ci-robot assigned alexeldeib May 28, 2021

alexeldeib mentioned this issue May 28, 2021

🐛 fix deletion, speed up creation for aks clusters #1397

Merged

3 tasks

k8s-ci-robot closed this as completed in #1397 Jun 1, 2021

alexeldeib mentioned this issue Jun 16, 2021

e2e deletion validation for managedclusters #1454

Closed

LochanRn mentioned this issue Aug 5, 2021

REQUEST: New membership for LochanRn kubernetes/org#2875

Closed

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to cleanly delete an AKS managed cluster by capz #1395

Unable to cleanly delete an AKS managed cluster by capz #1395

LochanRn commented May 27, 2021

alexeldeib commented May 27, 2021 •

edited

Loading

alexeldeib commented May 28, 2021

alexeldeib commented May 28, 2021

alexeldeib commented May 28, 2021 •

edited

Loading

Unable to cleanly delete an AKS managed cluster by capz #1395

Unable to cleanly delete an AKS managed cluster by capz #1395

Comments

LochanRn commented May 27, 2021

alexeldeib commented May 27, 2021 • edited Loading

alexeldeib commented May 28, 2021

alexeldeib commented May 28, 2021

alexeldeib commented May 28, 2021 • edited Loading

alexeldeib commented May 27, 2021 •

edited

Loading

alexeldeib commented May 28, 2021 •

edited

Loading