-
Notifications
You must be signed in to change notification settings - Fork 112
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TKG Cluster upgrade from v1.26.8 to v1.27.5 fails #1248
Comments
Hi @returntrip, I implemented a fix in this PR: #1247 You can clone my fork: https://github.com/adambarreiro/terraform-provider-vcd.git, then checkout Then you could try creating a cluster with a specific TKG version, it should now display the OVAs that weren't displayed before. Let me know, thanks in advance! |
Hi @adambarreiro! Sure, I have tried this:
However, I would expect to have same as in GUI: So it seems missing TKG 2.4.0 v1.25.13. I also guess that the (GUI) upgrades logic seems as follow: Supported Upgrade:
Unsupported Upgrade:
Above is considering that I have these templates available in my CSE catalog:
However,
While both GUI and
Some notes from above: TKG 2.4.0 v1.27.5 is not available perhaps because TKG version is 2.4.0 which is below the current TKG version (TKG 2.5.0 v1.26.11) |
Thanks @returntrip for the feedback, really appreciated. |
It started well:
a) WN node was upgraded OK
This is a snippet from the CAPIYAML. check
This is the current version running on the CPN. And here is the documentation about this being the correct version for TKG 2.4.0 While here is the documentation for TKG 2.5.0
See that also the BTW, how are the k8s components version (DNS, etcd) fetched and pupulated? I am just curious. |
Hi @returntrip, It seems to be another bug, the CoreDNS version for TKG 2.4.0 with v1.25.13 K8s is wrong, should be I have reported the same to the Container Service Extension team as I believe it should happen in UI as well, so thanks a lot for that. |
I am a bit confused tho cause the doc for TKG 2.4.0 https://docs.vmware.com/en/VMware-Tanzu-Kubernetes-Grid/2.4/tkg-deploy-mc/mgmt-release-notes.html states: coredns v1.10.1_vmware.7 etcd v3.5.7_vmware.6 So on paper the version is correct. |
Hi @returntrip, you're correct indeed. I have now changed the v2.5.0 to use CoreDNS |
I think the problem is here. In the sense that v1.26.11 has incorrect coredns and etc version there They should (according to this) be: Instead of: There is somethign odd also here So either the documentation is incorrect or both https://github.com/vmware/go-vcloud-director/blob/main/govcd/cse/tkg_versions.json and https://github.com/vmware/cluster-api-provider-cloud-director/blob/main/templates/cluster-template-v1.26.11-tkgv2.5.0-crs.yaml are P.S.: if the documentation is incorrect, then the upgraded path is wrong in some way... |
…cluster (#1247) * Fix Issue #1248 that prevents CSE Kubernetes clusters from being upgraded to an OVA with higher Kubernetes version but same TKG version, and to an OVA with a higher patch version of Kubernetes. * Fix Issue #1248 that prevents CSE Kubernetes clusters from being upgraded to TKG v2.5.0 with Kubernetes v1.26.11 as it performed an invalid upgrade of CoreDNS. * Fix Issue #1252 that prevents reading the SSH Public Key from provisioned CSE Kubernetes clusters. Signed-off-by: abarreiro <[email protected]>
Merged #1248, will release a patch soon. PS: A bit more of detail about the fix, it seems this issue uncovered more wrong things than expected. It seems both the documentation and CSE UI extension have some mistakes. As a workaround, TKG v2.4.0 versions will use a lower CoreDNS version, until things get clarified. CSE team will update the known issues on their docs at some point, with some recommended upgrade paths. Thanks for reporting! |
Terraform Version
Terraform v1.7.5
on linux_amd64
provider registry.terraform.io/vmware/vcd v3.12.0
Affected Resource(s)
Expected Behavior
Upgrade from 'Ubuntu 20.04 and Kubernetes v1.26.8+vmware.1' to 'Ubuntu 20.04 and Kubernetes v1.27.5+vmware.1' succeeds
Actual Behavior
Upgrade fails with error:
Error: Kubernetes cluster update failed: cannot perform an OVA change as the new one 'Ubuntu 20.04 and Kubernetes v1.27.5+vmware.1' has an older TKG/Kubernetes version (v2.4.0/v1.27.5+vmware.1)
│
│ with vcd_cse_kubernetes_cluster.my_cluster,
│ on main.tf line 73, in resource "vcd_cse_kubernetes_cluster" "my_cluster":
│ 73: resource "vcd_cse_kubernetes_cluster" "my_cluster" {
Steps to Reproduce
Please list the steps required to reproduce the issue, for example:
terraform apply
References
Are there any other GitHub issues (open or closed) or Pull Requests that should be linked here? For example:
The text was updated successfully, but these errors were encountered: