-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cannot join 3rd server managed etcd cluster due to unhealthy cluster
error from etcd
#2533
Comments
Same issue here , it seems to be random as sometime it does'nt happen ( reproducing it using same ansible script ) |
@adi90x can you share any logs from when you've had this occur? In particular if you had k3s service logs from both the first server and the server that failed to join, that would be immensely helpful. |
Hello, Below are log from node trying to join ( using -v 10 ) :
Hope it may help , but there is nothing that seems interesting... And from Working Server it may be more interesting :
So I guess it is because one of my server is publishing is internal ip as the etcd ip ? Regards, |
At this point it is expected that all etcd servers can reach each other at their private IP addresses. |
Any update on this issue ? Did anyone find a way to for etcd to listen on public ip ? |
@adi90x this is not currently possible. It only advertises the private address. More specific to this issue in particular, the member maintenance function should probably check for unhealthy (unreachable) fully-promoted etcd cluster members, and remove them from the cluster if there is not a corresponding Kubernetes cluster member. |
Closing due to age - can reopen if the issue re-emerges |
This comment was marked as resolved.
This comment was marked as resolved.
@dberardo-com open a new issue and fill out the issue template. |
Environmental Info:
K3s Version:
k3s v1.19.3+k3s3 (0e4fbfe)
Node(s) CPU architecture, OS, and Version:
Linux rnd-cloud1-master3 5.4.0-53-generic #59-Ubuntu SMP Wed Oct 21 09:38:44 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Cluster Configuration:
3 managed etcd servers with agents
Describe the bug:
Cannot join 3rd server to cluster due to
unhealthy cluster
errorSteps To Reproduce:
Unknown - somehow failed to install on the 3rd node; etcd says its in the cluster but kubernetes does not.
Expected behavior:
All servers join the cluster
Actual behavior:
first and second masters have been created successfully but third NO, and I try create it on several VM, but this fall down with the same error:
Additional context / logs:
This is taken from https://rancher-users.slack.com/archives/CGGQEHPPW/p1605341375146500?thread_ts=1605302190.133700&cid=CGGQEHPPW
It looks like etcd currently thinks it has a 3-node cluster with an offline member. Adding a 4th node would break the quorum math, so another member cannot be added until all three nodes are online, or the offline member has been deleted.
The text was updated successfully, but these errors were encountered: