Enforce higher priority for RepositoriesService ClusterStateApplier #58808

fcofdez · 2020-07-01T08:54:28Z

This avoids shards allocation failures when the repository instance
comes in the same ClusterState update as the shard allocation.

This avoids shards allocation failures when the repository instance comes in the same ClusterState update as the shard allocation.

ywelsch

I've left some minor comments, o.w. looking good.

ywelsch · 2020-07-01T11:27:06Z

.../test/java/org/elasticsearch/xpack/searchablesnapshots/ClusterStateApplierOrderingTests.java

+import static org.hamcrest.Matchers.greaterThan;
+import static org.hamcrest.Matchers.is;
+
+@ESIntegTestCase.ClusterScope(scope = TEST, numDataNodes = 0, autoManageMasterNodes = false)


Why autoManageMasterNodes = false?

That was the only scenario where I was able to reproduce the failure consistently. I was missing the gateway.recover_after_data_nodes piece

.../test/java/org/elasticsearch/xpack/searchablesnapshots/ClusterStateApplierOrderingTests.java

ywelsch · 2020-07-01T11:53:02Z

.../test/java/org/elasticsearch/xpack/searchablesnapshots/ClusterStateApplierOrderingTests.java

+        internalCluster().fullRestart();
+
+        List<UnassignedInfo.Reason> unassignedReasons = new ArrayList<>();
+        internalCluster().clusterService().addListener(event -> {


Isn't this racing against state recovery and the actual allocation taking place? We might be adding the listener too late. Perhaps we should delay state recovery (for example by setting gateway.recover_after_data_nodes to 3 on restart, and start up a third data node after the listener is registered (on the master)).

I wasn't aware of that setting, I'll use that approach.

…d-shard-failure

ywelsch

LGTM

fcofdez · 2020-07-03T10:30:33Z

@ywelsch should we backport this to 7.9?

ywelsch · 2020-07-03T10:31:22Z

yes

This avoids shards allocation failures when the repository instance comes in the same ClusterState update as the shard allocation. Backport of elastic#58808

…59040) * Enforce higher priority for RepositoriesService ClusterStateApplier This avoids shards allocation failures when the repository instance comes in the same ClusterState update as the shard allocation. Backport of #58808

Enforce higher priority for RepositoriesService ClusterStateApplier

ea67811

This avoids shards allocation failures when the repository instance comes in the same ClusterState update as the shard allocation.

fcofdez marked this pull request as ready for review July 1, 2020 11:24

fcofdez requested a review from ywelsch July 1, 2020 11:24

ywelsch reviewed Jul 1, 2020

View reviewed changes

fcofdez added 2 commits July 1, 2020 18:18

Address review comments

690b00c

Merge remote-tracking branch 'origin/master' into repository-not-foun…

af5696e

…d-shard-failure

fcofdez requested a review from ywelsch July 2, 2020 14:44

ywelsch approved these changes Jul 3, 2020

View reviewed changes

fcofdez added the v8.0.0 label Jul 3, 2020

fcofdez merged commit 6cd9770 into elastic:master Jul 3, 2020

ywelsch added the v7.9.0 label Jul 3, 2020

fcofdez mentioned this pull request Jul 5, 2020

Enforce higher priority for RepositoriesService ClusterStateApplier #59040

Merged

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enforce higher priority for RepositoriesService ClusterStateApplier #58808

Enforce higher priority for RepositoriesService ClusterStateApplier #58808

fcofdez commented Jul 1, 2020

ywelsch left a comment

ywelsch Jul 1, 2020

fcofdez Jul 1, 2020

ywelsch Jul 1, 2020

fcofdez Jul 1, 2020

ywelsch left a comment

fcofdez commented Jul 3, 2020

ywelsch commented Jul 3, 2020

Enforce higher priority for RepositoriesService ClusterStateApplier #58808

Enforce higher priority for RepositoriesService ClusterStateApplier #58808

Conversation

fcofdez commented Jul 1, 2020

ywelsch left a comment

Choose a reason for hiding this comment

ywelsch Jul 1, 2020

Choose a reason for hiding this comment

fcofdez Jul 1, 2020

Choose a reason for hiding this comment

ywelsch Jul 1, 2020

Choose a reason for hiding this comment

fcofdez Jul 1, 2020

Choose a reason for hiding this comment

ywelsch left a comment

Choose a reason for hiding this comment

fcofdez commented Jul 3, 2020

ywelsch commented Jul 3, 2020