Improvements for etcd liveness probes #2567

randomvariable · 2021-09-10T14:56:07Z

What keywords did you search in kubeadm issues before filing this one?

This is related to kubernetes/kubernetes#96886
and etcd-io/etcd#13340

Is this a BUG REPORT or FEATURE REQUEST?

FEATURE REQUEST

Versions

kubeadm version (use kubeadm version): v1.22, etcd v3.5.0

What happened?

Under certain cluster conditions, such as an entire cluster being powered off and on, then you may not want etcd pods restarted if no raft leader is present since leader election is taking place. There are more details in the etcd-io/etcd#13340, in which we have requested that there is either a lightweight /ready endpoint added to etcd or to allow /health to take additional query parameters that would allow us to relax the constraints.

Once that's in place, downstream consumers can use the JSON patch method to patch the etcd static pod (e.g. kubernetes-sigs/cluster-api#4874).

However, we may also want to change the defaults for kubeadm but we should do some modelling about what state transitions and cluster conditions we care about.

Additionally, I think we will also want to relax the consistency constraints as part of learner mode adoption.

This is mostly a tracking issue for possible changes to etcd that we can consume.

What you expected to happen?

Turned off clusters can be restarted

How to reproduce it (as minimally and precisely as possible)?

I have a feeling this is also related to kubernetes-sigs/kind#1689 , so improvements may be testable

TODOs

1.25:

add the new health check endpoint for kubeadm probes
[x] kubeadm: add serializable health checks for etcd probes kubernetes#110072
backport the 1.25 change to kubeadm releases that have etcd 3.5.3
TODO
can be done for 1.24 (that has 3.5.3+) but we need 3.5.3 backports to happen for the older releases:
Automated cherry pick of #109471: etcd: Update to v3.5.3 kubernetes#109532
Automated cherry pick of #109471: etcd: Update to v3.5.3 kubernetes#109533

The text was updated successfully, but these errors were encountered:

pacoxu · 2021-09-13T10:29:32Z

Adding /health?exclude=NORAFTLEADER&consistency=s seems to be current workaround solution in kubeadm.

neolit123 · 2021-09-13T10:49:21Z

I was under the impression that it does not work.

pacoxu · 2021-09-13T11:29:59Z

How to easily reproduce it with a simple script?

randomvariable · 2021-09-15T17:23:11Z

I think starting a HA kind cluster, and then shutting down the containers and restarting them might be sufficient. I have a feeling it's related to kubernetes-sigs/kind#1689

neolit123 · 2021-11-22T18:05:23Z

looks like etcd 3.6 will have the missing feature that we wanted.
etcd-io/etcd#13340

we are in code freeze for 1.23, but this seems like something that can be added once 1.24 starts and backported to older releases (assuming it's not a big diff in kubeadm).

palnabarun · 2021-11-23T13:19:53Z

It also depends on when etcd 3.6 would be released. Does etcd follow a defined timeline? Unless I am missing something, I couldn't find much with my searching the web skills.

PS: ~~If etcd 3.6 gets released before Code Freeze for Kubernetes 1.24,~~ Whenever the next version of etcd gets released with the above feature, I would like to work on this.

neolit123 · 2021-11-23T13:57:15Z

Etcd does not have a fixed cadence for minors AFAIK.
It took two years between .4 and .5 appatently.
But prior releases were only one year appart.

Judging from that i don't think this will align with 1.24 k8s.

palnabarun · 2021-11-23T14:35:09Z

Can we ask the etcd team if they would like to cut a patch release with the said feature?

neolit123 · 2021-11-23T15:28:07Z

i don't think they will agree to that, but worth a try if someone wants to do it.

ahrtr · 2022-02-17T07:06:43Z

FYI. Back porting the PR to etcd 3.5, etcd-io/etcd#13706

ahrtr · 2022-02-17T07:56:43Z

cc @serathius

neolit123 · 2022-02-17T12:45:44Z

@ahrtr great. If the etcd backport is accepted we can try backporting a kubeadm etcd bump.

pacoxu · 2022-02-22T01:40:01Z

The backport PR to v3.5 etcd-io/etcd#13706 was merged and we may wait for etcd v3.5.3.

ahrtr · 2022-02-22T01:54:42Z

Yes, the PR was merged, and I just submitted another PR pull/13725 to update the 3.5 changelog.

neolit123 · 2022-04-18T18:57:42Z

etcd bump to 3.5.3 merged in 1.24 /master.
kubernetes/kubernetes#109471

here are backports for 1.23 and 1.22:
kubernetes/kubernetes#109533
kubernetes/kubernetes#109532

after / if these merge we would want to backport and enable the new probes conditionally for kubeadm versions that use 3.5.3.

neolit123 · 2022-04-18T19:10:34Z

i think what we have to do to enable the new check is the following in the kubeadm etcd manifest:

    livenessProbe:
      failureThreshold: 8
      httpGet:
        host: 127.0.0.1
-        path: /health
+        path: /health/serializable=true
        port: 2381
        scheme: HTTP
      initialDelaySeconds: 10
      periodSeconds: 10
      timeoutSeconds: 15

but it must be done only for k8s control plane version >= 1.22 (if the above cherry picks merge, that is)

neolit123 · 2022-05-11T16:44:45Z

~~once we add etcd 3.5.3 to the support skew, would anyone like to work on this?~~

3.5.3 backports:
kubernetes/kubernetes#109471
kubernetes/kubernetes#109532
kubernetes/kubernetes#109533

neolit123 · 2022-05-16T14:43:01Z

1.25 PR is here:
kubernetes/kubernetes#110072

neolit123 · 2022-05-16T15:01:48Z

@ahrtr

i've tested the new HTTP endpoint with etcd 3.5.3 and it seems to be failing.

EDIT: NVM, as @VirrageS pointed out on the PR it should be health?serializable=true i.e. serializable is a query parameter.

ahrtr · 2022-05-16T21:54:45Z

The reason is that the PR etcd/pull/13525 isn't backported to 3.5.

It's a little subtle. Let's work with an example, assuming there are an etcd cluster with 3 members. There are two cases here:

In the beginning, all 3 members ( or at least 2 members) are working well. Somehow 1 or 2 members are gone/down for whatever reason. Then the left member(s) can still serve serializable requests;
In the beginning, all 3 members ( or at least 2 members) are working well. If you stop all the 3 members, and only restart 1 of them afterwards. Since the quorum isn't satisfied, so the member will not serve client requests, even serializable requests, until the quorum is satisfied. I believe this should be your case. Please double confirm.

Probably we need to backport the PR to 3.5 as well. But it's an enhancement, I need to discuss with other etcd maintainers. Please also let me know whether you really need it, or probably you adjust/update K8s test to adapt to case 1?

neolit123 · 2022-05-16T22:30:55Z

@ahrtr

The reason is that the PR etcd-io/etcd#13525 isn't backported to 3.5.

the issue that i saw earlier was due to a mistake on my end - using /health/serializable=true instead of health?serializable=true. after switching to health?serializable=true startup / liveness probes work as expected, at least for a new cluster with healthy members.

In the beginning, all 3 members ( or at least 2 members) are working well. If you stop all the 3 members, and only restart 1 of them afterwards. Since the quorum isn't satisfied, so the member will not serve client requests, even serializable requests, until the quorum is satisfied. I believe this should be your case. Please double confirm.

that sounds like the scenario the OP describes here.

Probably we need to backport the PR to 3.5 as well. But it's an enhancement, I need to discuss with other etcd maintainers. Please also let me know whether you really need it, or probably you adjust/update K8s test to adapt to case 1?

it's unclear to me if these additional changes are needed or not.

ahrtr · 2022-05-16T22:40:33Z

The key point is etcd can't finish the bootstrap/startup process if the quorum isn't satisfied. So it can't serve any client requests, even serializable requests. It is exactly what the PR 13525 fixed.

But once etcd finishes the bootstrap/startup process, it can continue to serve serializable requests if the quorum isn't satisfied.

neolit123 · 2022-05-16T22:42:48Z

The key point is etcd can't finish the bootstrap/startup process if the quorum isn't satisfied. So it can't serve any client requests, even serializable requests. It is exactly what the PR etcd-io/etcd#13525 fixed.

this sounds like a good argument for the backport of etcd-io/etcd#13525

ahrtr · 2022-05-16T22:44:47Z

Let me submit a PR for the backport and get feedback from other maintainer.

neolit123 · 2022-06-23T15:14:24Z

@ahrtr
PTAL at this for LGTM:
kubernetes/kubernetes#110744

NOTE: it closes this issue because there isn't much else we can do here.

it's following your recommendation here:
etcd-io/etcd#14048 (comment)

neolit123 added area/etcd kind/feature Categorizes issue or PR as related to a new feature. priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete. labels Sep 10, 2021

neolit123 added this to the v1.23 milestone Sep 10, 2021

neolit123 mentioned this issue Sep 10, 2021

Consider if to add Patches field to InitConfiguration and JoinConfiguration kubernetes-sigs/cluster-api#4874

Closed

pacoxu mentioned this issue Sep 13, 2021

kubeadm: loose the liveness health of etcd to avoid some known failures kubernetes/kubernetes#104960

Closed

neolit123 modified the milestones: v1.23, v1.24 Nov 22, 2021

neolit123 mentioned this issue Jan 28, 2022

Kubernetes cluster become unresponsive after one node goes down kubernetes/website#31553

Closed

pacoxu mentioned this issue Feb 17, 2022

Enhance health check endpoint to support serializable request etcd-io/etcd#13399

Merged

neolit123 mentioned this issue Feb 17, 2022

[etcd] Bump etcd client to 3.5.1 kubernetes/kubernetes#106589

Closed

This was referenced Feb 22, 2022

Update etcd to 3.5.2 kubernetes/kubernetes#107917

Closed

Kubernetes cluster unavailable Unable to connect to the server: EOF kubernetes/kubernetes#95116

Closed

neolit123 mentioned this issue Mar 29, 2022

etcd: add flag for checking member data consistency #2676

Open

6 tasks

neolit123 modified the milestones: v1.24, v1.25 Mar 29, 2022

neolit123 self-assigned this May 16, 2022

neolit123 added the lifecycle/active Indicates that an issue or PR is actively being worked on by a contributor. label May 16, 2022

neolit123 mentioned this issue May 16, 2022

kubeadm: add serializable health checks for etcd probes kubernetes/kubernetes#110072

Merged

neolit123 added the kind/bug Categorizes issue or PR as related to a bug. label May 16, 2022

ahrtr mentioned this issue May 16, 2022

Backport pull/13525 to 3.5 etcd-io/etcd#14048

Closed

neolit123 mentioned this issue Jun 23, 2022

kubeadm: use non-serializable startup probe for etcd pods kubernetes/kubernetes#110744

Merged

k8s-ci-robot closed this as completed in kubernetes/kubernetes#110744 Jul 1, 2022

kvaps mentioned this issue Mar 15, 2024

Implement Ready status condition aenix-io/etcd-operator#24

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements for etcd liveness probes #2567

Improvements for etcd liveness probes #2567

randomvariable commented Sep 10, 2021 •

edited by neolit123

Loading

pacoxu commented Sep 13, 2021

neolit123 commented Sep 13, 2021 via email

pacoxu commented Sep 13, 2021

randomvariable commented Sep 15, 2021

neolit123 commented Nov 22, 2021

palnabarun commented Nov 23, 2021 •

edited

Loading

neolit123 commented Nov 23, 2021

palnabarun commented Nov 23, 2021

neolit123 commented Nov 23, 2021

ahrtr commented Feb 17, 2022

ahrtr commented Feb 17, 2022

neolit123 commented Feb 17, 2022

pacoxu commented Feb 22, 2022

ahrtr commented Feb 22, 2022

neolit123 commented Apr 18, 2022

neolit123 commented Apr 18, 2022 •

edited

Loading

neolit123 commented May 11, 2022 •

edited

Loading

neolit123 commented May 16, 2022

neolit123 commented May 16, 2022 •

edited

Loading

ahrtr commented May 16, 2022

neolit123 commented May 16, 2022

ahrtr commented May 16, 2022

neolit123 commented May 16, 2022 •

edited

Loading

ahrtr commented May 16, 2022

neolit123 commented Jun 23, 2022 •

edited

Loading

Improvements for etcd liveness probes #2567

Improvements for etcd liveness probes #2567

Comments

randomvariable commented Sep 10, 2021 • edited by neolit123 Loading

What keywords did you search in kubeadm issues before filing this one?

Is this a BUG REPORT or FEATURE REQUEST?

Versions

What happened?

What you expected to happen?

How to reproduce it (as minimally and precisely as possible)?

pacoxu commented Sep 13, 2021

neolit123 commented Sep 13, 2021 via email

pacoxu commented Sep 13, 2021

randomvariable commented Sep 15, 2021

neolit123 commented Nov 22, 2021

palnabarun commented Nov 23, 2021 • edited Loading

neolit123 commented Nov 23, 2021

palnabarun commented Nov 23, 2021

neolit123 commented Nov 23, 2021

ahrtr commented Feb 17, 2022

ahrtr commented Feb 17, 2022

neolit123 commented Feb 17, 2022

pacoxu commented Feb 22, 2022

ahrtr commented Feb 22, 2022

neolit123 commented Apr 18, 2022

neolit123 commented Apr 18, 2022 • edited Loading

neolit123 commented May 11, 2022 • edited Loading

neolit123 commented May 16, 2022

neolit123 commented May 16, 2022 • edited Loading

ahrtr commented May 16, 2022

neolit123 commented May 16, 2022

ahrtr commented May 16, 2022

neolit123 commented May 16, 2022 • edited Loading

ahrtr commented May 16, 2022

neolit123 commented Jun 23, 2022 • edited Loading

randomvariable commented Sep 10, 2021 •

edited by neolit123

Loading

palnabarun commented Nov 23, 2021 •

edited

Loading

neolit123 commented Apr 18, 2022 •

edited

Loading

neolit123 commented May 11, 2022 •

edited

Loading

neolit123 commented May 16, 2022 •

edited

Loading

neolit123 commented May 16, 2022 •

edited

Loading

neolit123 commented Jun 23, 2022 •

edited

Loading