-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
K8s HA installation timed out on task "Join master to ControlPlane" #1075
Comments
I would recommend here to check once again if the problem still appears since the ticket was created already a few month ago. Moreover maybe somewhere there should be used time pause (60sec) or conditional in ansible because maybe some tasks run too fast or in wrong order. |
Now this has been tested with the latest develop code with Kubernetes 1.18.6 and I haven't been able to reproduce the problem yet. |
It seems after upgrading to newer Kubernetes 1.18 we don't really see that problem anymore. But it's still there in 0.6, which is impacting upgrade testing. I'll take a look if it's possible to fix it in 0.6. 👍 |
Problem still exists, no real progress here. It's reporducible in AWS / Azure while using Azure DevOps, it takes couple of retries to reproduce. Since deployment of a cluster takes 1+ hour it's really annoying to work on. |
Is there a workaround? For example if I run apply again would it work? |
Add information to changelog known issues section. |
Handled in this PR |
Describe the bug
K8s HA installation fails randomly on task
Join master to ControlPlane
on Azure environments.To Reproduce
Steps to reproduce the behavior:
epicli apply -f test.yml
Expected behavior
The HA cluster was successfully deployed.
Config files
Configuration that should be included in the yaml file:
Task where the problem appears:
OS (please complete the following information):
Cloud Environment (please complete the following information):
Additional context
Log:
It happens randomly.
On average, one of the two HA deployments on Azure fails because of this issue.
The text was updated successfully, but these errors were encountered: