Improve the failover of galera service #289

dciabrin · 2024-11-20T11:55:38Z

When a galera node is in the process of shutting down (e.g. during a rolling restart caused by a minor update), the node is unable to serve SQL queries, however it is still connected to clients. This confuses clients who get unexpected SQL status [1] and prevent them from retrying their queries, causing unexpected errors down the road.

Improve the pod stop pre-hook to failover the active endpoint to another pod prior to shutting down the galera server, and kill connected clients to force them to reconnect to the new active endpoint. At this stage, the galera server can be safely shutdown as no client will see its WSREP state update.

Also update the failover script: 1) when no endpoint is available, ensure no traffic is going through any pod. 2) do not trigger a endpoint failover as long as the current endpoint targets a galera node that is still part of the primary partition (i.e. it is still able to serve traffic).

[1] 'WSREP has not yet prepared node for application use'

Jira: OSPRH-11488

stuggi · 2024-11-21T09:02:48Z

looks good to me

gibizer · 2024-11-21T09:11:58Z

and kill connected clients to force them to reconnect to the new active endpoint.
Can we make it graceful? Is there a way to handle in progress queries and only kill idle client connections and delaying the pod stop until all the in progress queries are done and therefore all clients are gracefully disconnected?

gibizer · 2024-11-21T09:13:33Z

templates/galera/bin/mysql_shutdown.sh

+# filter out system and localhost connections, only consider clients with a port in the host field
+# from that point, clients will automatically reconnect to another node
+CLIENTS=$(mysql -uroot -p${DB_ROOT_PASSWORD} -nN -e "select id from information_schema.processlist where host like '%:%';")
+echo -n "$CLIENTS" | tr '\n' ',' | xargs mysqladmin -uroot -p${DB_ROOT_PASSWORD} kill


can we make this graceful by only killing the client after it finished the in progress query if any?

sadly no, there's no option in mysql to mark a connection for closing after it finished its processing.

I ran couple of rollouts back an forth while polling the keystone API and at least I was not able to hit the case when an ongoing query was interrupted. Actually I was not able to reproduce any API outage during galera rollout after this fix. So I'm OK to merge this. Thanks @dciabrin for fixing this.

When a galera node is in the process of shutting down (e.g. during a rolling restart caused by a minor update), the node is unable to serve SQL queries, however it is still connected to clients. This confuses clients who get unexpected SQL status [1] and prevent them from retrying their queries, causing unexpected errors down the road. Improve the pod stop pre-hook to failover the active endpoint to another pod prior to shutting down the galera server, and kill connected clients to force them to reconnect to the new active endpoint. At this stage, the galera server can be safely shutdown as no client will see its WSREP state update. Also update the failover script: 1) when no endpoint is available, ensure no traffic is going through any pod. 2) do not trigger a endpoint failover as long as the current endpoint targets a galera node that is still part of the primary partition (i.e. it is still able to serve traffic). [1] 'WSREP has not yet prepared node for application use' Jira: OSPRH-11488

dciabrin · 2024-11-21T14:49:59Z

Rebasing this PR now to fix the kuttl test failure:

pkg/openstack/galera.go:172:23: cannot use &instance.Spec.NodeSelector (value of type *map[string]string) as map[string]string value in assignment
make: *** [Makefile:147: vet] Error 1
{"component":"entrypoint","error":"wrapped process failed: exit status 2","file":"sigs.k8s.io/prow/pkg/entrypoint/run.go:84","func":"sigs.k8s.io/prow/pkg/entrypoint.Options.internalRun","level":"error","msg":"Error executing test process","severity":"error","time":"2024-11-21T14:06:53Z"}

openshift-ci · 2024-11-21T15:05:44Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dciabrin, gibizer

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [dciabrin]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

stuggi · 2024-11-21T15:20:35Z

/cherry-pick 18.0-fr1

openshift-cherrypick-robot · 2024-11-21T15:20:38Z

@stuggi: once the present PR merges, I will cherry-pick it on top of 18.0-fr1 in a new PR and assign it to you.

In response to this:

/cherry-pick 18.0-fr1

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

openshift-cherrypick-robot · 2024-11-21T16:30:10Z

@stuggi: new pull request created: #290

In response to this:

/cherry-pick 18.0-fr1

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

bogdando · 2024-11-22T12:22:56Z

templates/galera/bin/mysql_wsrep_notify.sh

+    # select the first available node in the primary partition to be the failover endpoint
+    NEW_ENDPOINT=$(echo "$MEMBERS" | grep -v "${PODNAME}" | head -1)
+    if [ -z "${NEW_ENDPOINT}" ]; then
+        log "No other available node to become the active endpoint."


we probably should return with that fact?
unless the intention is to call service_endpoint() with "empty endpoint means "block incoming traffic"" (then please add a log msg for that)

bogdando · 2024-11-22T12:28:35Z

templates/galera/bin/mysql_wsrep_notify.sh

+    NEW_SVC=$(echo "$CURRENT_SVC" | service_endpoint "$NEW_ENDPOINT")
+    [ $? == 0 ] || return 1
+
+    log "Configuring a new active endpoint for service ${SERVICE}: '${CURRENT_ENDPOINT}' -> '${NEW_ENDPOINT}'"


... or this would look like -> (empty)

bogdando · 2024-11-22T12:30:49Z

templates/galera/bin/mysql_wsrep_notify.sh

+if echo "${STATUS}" | grep -i -q -e 'failover'; then
+    mysql_probe_state
+    if [ $? != 0 ]; then
+        log_error "Could not probe missing mysql information. Aborting"


"... Aborting during failover" would make this msg looking different to when mysql was started

bogdando

sorry being late to the party

openshift-ci bot requested review from dprince and viroel November 20, 2024 11:55

openshift-ci bot added the approved label Nov 20, 2024

dciabrin requested review from abays, stuggi and gibizer and removed request for dprince and viroel November 20, 2024 14:14

bogdando self-requested a review November 20, 2024 15:12

gibizer reviewed Nov 21, 2024

View reviewed changes

gibizer approved these changes Nov 21, 2024

View reviewed changes

openshift-ci bot assigned gibizer Nov 21, 2024

openshift-ci bot added the lgtm label Nov 21, 2024

dciabrin force-pushed the osprh-11488 branch from e384bd5 to 128d48d Compare November 21, 2024 14:50

openshift-ci bot removed the lgtm label Nov 21, 2024

gibizer approved these changes Nov 21, 2024

View reviewed changes

openshift-ci bot added the lgtm label Nov 21, 2024

openshift-merge-bot bot merged commit c6bef3c into openstack-k8s-operators:main Nov 21, 2024
6 checks passed

openshift-cherrypick-robot mentioned this pull request Nov 21, 2024

[18.0-fr1] Improve the failover of galera service #290

Merged

bogdando reviewed Nov 22, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve the failover of galera service #289

Improve the failover of galera service #289

dciabrin commented Nov 20, 2024 •

edited by openshift-ci bot

Loading

stuggi commented Nov 21, 2024

gibizer commented Nov 21, 2024

gibizer Nov 21, 2024

dciabrin Nov 21, 2024

gibizer Nov 21, 2024

dciabrin commented Nov 21, 2024

openshift-ci bot commented Nov 21, 2024

stuggi commented Nov 21, 2024

openshift-cherrypick-robot commented Nov 21, 2024

openshift-cherrypick-robot commented Nov 21, 2024

bogdando Nov 22, 2024 •

edited

Loading

bogdando Nov 22, 2024

bogdando Nov 22, 2024

bogdando left a comment

Improve the failover of galera service #289

Improve the failover of galera service #289

Conversation

dciabrin commented Nov 20, 2024 • edited by openshift-ci bot Loading

stuggi commented Nov 21, 2024

gibizer commented Nov 21, 2024

gibizer Nov 21, 2024

Choose a reason for hiding this comment

dciabrin Nov 21, 2024

Choose a reason for hiding this comment

gibizer Nov 21, 2024

Choose a reason for hiding this comment

dciabrin commented Nov 21, 2024

openshift-ci bot commented Nov 21, 2024

stuggi commented Nov 21, 2024

openshift-cherrypick-robot commented Nov 21, 2024

openshift-cherrypick-robot commented Nov 21, 2024

bogdando Nov 22, 2024 • edited Loading

Choose a reason for hiding this comment

bogdando Nov 22, 2024

Choose a reason for hiding this comment

bogdando Nov 22, 2024

Choose a reason for hiding this comment

bogdando left a comment

Choose a reason for hiding this comment

dciabrin commented Nov 20, 2024 •

edited by openshift-ci bot

Loading

bogdando Nov 22, 2024 •

edited

Loading