-
Notifications
You must be signed in to change notification settings - Fork 312
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Konnectivity implementation timeline #2452
Comments
Hi MattJeanes, AKS bot here 👋 I might be just a bot, but I'm told my suggestions are normally quite good, as such:
|
Hi there @MattJeanes, we are slightly delayed on getting this out. We will update this issue when this is on its way |
That's perfect, thank you very much. Do you have an estimated timeline for this? |
I wish I could be very accurate on this, but at the moment the best I can say is in the short vs medium term for this to be rolled out. We'll update this issue once we know it on the release track. |
This issue has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs within 15 days of this comment. |
Hi, just posting an update on this in case it times out. I'm seeing tunnelfront being used in clusters I'm creating in the West Europe region, v1.21.2. This is causing issues as some reference architectures are now assuming that all clusters have moved to konnectivity. Is there anywhere I can read the official stance on tunnelfront vs aks-link vs konnectivity from the AKS team? |
Err konnectivity just got rolled out to us in canadacentral just doing a I don't even know what to say about this feature being rolled out to us unexpectedly as I don't see canadacentral in the list nor while also breaking our production cluster. konnectivity-agent-6944bbbc-khp7l 1/1 Running 0 9m58s
konnectivity-agent-6944bbbc-q5lxk 1/1 Running 0 9m58s
❯ klo konnectivity-agent-6944bbbc-khp7l
Error from server: Get "https://10.129.0.65:10250/containerLogs/kube-system/konnectivity-agent-6944bbbc-khp7l/konnectivity-agent?follow=true": write unix @->/tunnel-uds/socket: write: broken pipe |
+1 I just enabled --uptime-sla for one of our prod cluster in northeurope, it looks now Konnectivity-Agent is used instead of aks-link. Can we have a documentation on what's the impact? |
Still no fix for this on our end even though we filed a Sev A and affects 300 production users. Even with premiere support and the backend team saying they would try to revert back to aks-link. So far since yesterday nothing has been actioned, and Microsoft even downgraded our Severity without our consent.
Edit -> They have rolled back to aks-link on our cluster and almost instantly everything started to work again. Additionally we now can see the old konnectivity logs and they say:
|
I just go hit by this today. In one of our clusters konnectivity was rolled out. So far the effect I can see is that webhooks from API server to worker nodes fails occasionally.
Scaling up the number of pods behind the webhook service seems to mitigate the issue a little bit. The konnectivity logs are full of
konnectivity version:
|
This issue has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs within 15 days of this comment. |
This issue will now be closed because it hasn't had any activity for 7 days after stale. MattJeanes feel free to comment again on the next 7 days to reopen or open a new issue after that time if you still have a question/issue or suggestion. |
I'd like to reopen this as I'm not sure on the state of rollout still especially after it has been causing issues as above. @justindavies is there any update on the rollout? |
The rollout seems to continue in February: |
Hi all, we stopped the rollout due to discovering some memory leaks in the upstream project. We've worked closely with the upstream team to fix these leaks and are ready to rollout again. You may have already seen a message in your Portal regarding this news. Re-opening this thread to track any issues that arise. |
Thank you for the update, really appreciated! 🙂 |
Here was the communication that was sent to all clusters: In Azure Kubernetes Service (AKS), the tunnel is a secure communication channel that allows the managed control plane to communicate with individual nodes. AKS will start rolling out a new version of our tunnel, Konnectivity, to all clusters over the next 3 weeks and clusters will automatically switch to using Konnectivity for tunnel communication. Check and confirm that your networking rules allow the required ports, FQDNs, and IPs and that your networking firewall doesn’t block or rewrite the Application-Layer Protocol Negotiation (ALPN) TLS extension. If your cluster doesn’t meet these networking rules, then it’s in an unsupported state and won’t be able to receive this update. If your cluster is in an unsupported state, AKS will roll the tunnel back to the previous version to prevent disruptions of service. Please correct the networking rules as soon as possible to bring your cluster to a supported state. |
Hi @alvinli222, do you know if the Konnectivity Service was rolled out to UK West? We deployed an AKS cluster in UK south, and it automatically set up Konnectivity-Agent Pods. But, when we deployed the same service to UK west, it did not deploy any Konnectivity-Agents and instead deployed a tunnefront pod. Many thanks for any information you can provide. Edit: We have also noted, that the deployments we now make to UK South are not setting up Konnectivity-Agent Pods either, so we are not too sure what is going on here. There does not seem to be an obvious Terraform or Azure option to change what is done. |
@jackmcc08 @MattJeanes Any luck with Konnectivity? I have various clusters with different workloads and in various data centers and i have yet to see Konnectivity in any of the clusters, despite AKS team reporting that the rollout is complete. @alvinli222 I haven't modifying anything network wise on these clusters, they use kubenet and have nginx ingress deployed in them. Is there any guidance from Microsoft on why this might not be rolling out to clusters? |
@jackmcc08 @arsnyder16 @MattJeanes Konnectivity is now available in ALL Azure Public Regions. So simply doing an upgrade or create option will instantiate Konnectivity. |
@miwithro Thanks! I am seeing it now, i don't think i have seen this mentioned anywhere that we need to roll the cluster like this |
@arsnyder16 yes we will update the Release Notes to reflect this. |
Hi folks, additionally, you can submit an |
@arsnyder16 sorry for not responding sooner, got sidetracked :) yes, we are now seeing konnectivity agents in our clusters. I assume as per @miwithro comments this has now been implemented. |
@alvinli222 If I run that command in Azure Cloud Shell I get this error:
If I change
Are you testing with an unreleased version of az or something? I was thinking you meant running any az aks update command would update Konnectivity, but this did not say anything about Konnectivity in the output from this command. |
Hello guys, Please can i know any advices on the resrarting tunnelfront pods? |
Hi @BrianGuo987 , Upgrade AKS from 1.5 to 1.22 is not supported. Users should not upgrade AKS cross 2+ major version. 1.5 is too old and too differences compared with 1.22. |
This issue has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs within 15 days of this comment. |
I think we can close this now that Konnectivity is widely available, thank you for keeping us posted on the rollout, it has been very much appreciated 🙂 |
What happened:
Release 2021-06-17 states that starting on next weeks release Konnectivity will replace aks-link and tunnelfront for connecting the control plane and nodes, however this does not seem to have happened.
What you expected to happen:
I would have expected this rollout to have completed by now as per the timeline given on the release.
How to reproduce it (as minimally and precisely as possible):
Create a cluster in AKS - West Europe or East US region with Uptime SLA on or off. There should be konnectivity pods but no such pods exist and instead aks-link / tunnelfront exist in kube-system.
Anything else we need to know?:
I have a ticket open with Azure support who tell me that the konnectivity implementation should resolve issues we are seeing with the aks-link / tunnelfront tunnels dropping occasionally resulting in AggregatedAPIDown/AggregatedAPIErrors alerts firing coming from our Prometheus instances.
I am told this deployment has been delayed so I am raising this issue for myself and others to publicly track the implementation progress as I for example will be reducing our alerting thresholds to compensate for these errors but would like to increase it back up when this deployment is complete.
I would also like to know if this upgrade would require upgrading the Kubernetes cluster or if it will done automatically in the background and if it does need an upgrade which versions have the new konnectivity components.
Environment:
kubectl version
):The text was updated successfully, but these errors were encountered: