-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
etcdserver: request timed out, possibly due to connection lost #1059
Comments
Looks like 3.4 with pre-voting is still off in the future? So that leaves "don't flake out the masters to cause so many elections"? |
I'm going to close this issue as it seems to likely be etcd, not installer. If we start running into trouble with this again lets open a BZ against the etcd component. |
If you are still having this issue. Check out this video. I was able to resolve the issue with these instructions. |
This is a stopgap solution until openshift is able to merge the API PR openshift#1059 openshift/api#1059.
This is a stopgap solution until openshift is able to merge the API PR openshift#1059 openshift/api#1059.
Version
Platform (aws|libvirt|openstack):
All.
What happened?
In an e2e-aws run mentioned here:
What you expected to happen?
No errors due to etcd delays.
How to reproduce it (as minimally and precisely as possible)?
There have been a lot of these in CI recently, although I'm not sure what would have changed. AWS has had a number of performance issues for us today though, including slow resource generation. Maybe our CI disks are just running slower than usual or something?
Anything else we need to know?
Details or a similar issue in the etcd logs:
This seems similar to etcd-io/etcd#9464, which talks about ticks for election and pre-voting as potential fixes, and about bumping to 3.4 to get them. Are their plans for bumping the elderly 3.1.14 we use for bootstrap health checks? Or the more respectable 3.3.10 the machine-config operator suggests for the masters? I guess we'd have to bump to 3.4 for pre-voting, since 3.3.10 already contains the backported-to-3.3.x etcd-io/etcd@3282d9070 (which landed in 3.3.3). Or maybe the problem is something else entirely :p.
As a minor pivot, it seems safe enough for us to move up to 3.3.10 to catch up with openshift/machine-config-operator@59f809676.
/kind bug
The text was updated successfully, but these errors were encountered: