Propagate last node to reinitialized routing tables #91549

DaveCTurner · 2022-11-14T13:15:37Z

When closing or opening an index, or restoring a snapshot over a closed index, we reinitialize its routing table from scratch and expect the gateway allocators to select the appropriate node for each shard copy. With this commit we also keep track of the last-allocated node ID for each copy which makes it more likely that the desired balance of these shards remains unchanged too.

Closes #91472

When closing or opening an index, or restoring a snapshot over a closed index, we reinitialize its routing table from scratch and expect the gateway allocators to select the appropriate node for each shard copy. With this commit we also keep track of the last-allocated node ID for each copy which makes it more likely that the desired balance of these shards remains unchanged too. Closes elastic#91472

elasticsearchmachine · 2022-11-14T13:16:01Z

Pinging @elastic/es-distributed (Team:Distributed)

…ount, expect all the node IDs to be filled in

henningandersen

One smaller concern, otherwise this looks good.

henningandersen · 2022-11-14T13:27:24Z

server/src/main/java/org/elasticsearch/cluster/routing/IndexRoutingTable.java

+            }
+            final var previousNodes = new ArrayList<String>(previousShardRoutingTable.size());
+            previousNodes.add(primaryNode);
+            for (final var assignedShard : previousShardRoutingTable.assignedShards()) {


This also includes the target of relocations. I wonder if we should only look at active shards, since anything less will anyway not be considered good enough by the gateway allocator?

The problem I see with this is that if a relocation is ongoing, we risk a copy having a last allocated node id that is much worse than it could be (i.e., a node that only has just started the recovery)?

Good point, thanks - see bd12ab9.

henningandersen · 2022-11-14T13:36:55Z

server/src/test/java/org/elasticsearch/cluster/routing/UnassignedInfoTests.java

+                assertThat(shard.unassignedInfo().getReason(), equalTo(expectedUnassignedReason));
+                final var lastAllocatedNodeId = shard.unassignedInfo().getLastAllocatedNodeId();
+                if (lastAllocatedNodeId == null) {
+                    // restoring an index may change the number of shards/replicas so no guarantee that lastAllocatedNodeId is populated


I think only the number of replicas, not the number of shards can be changed? Probably what you meant with shards/replicas, but removing "shards/" would be better I think.

Suggested change

// restoring an index may change the number of shards/replicas so no guarantee that lastAllocatedNodeId is populated

// restoring an index may change the number of replicas so no guarantee that lastAllocatedNodeId is populated

On the contrary, I didn't think there's anything to require that the snapshot has the same number of shards as index on top of which it's being restored.

Ahh, right, thanks.

henningandersen · 2022-11-14T13:37:33Z

server/src/test/java/org/elasticsearch/cluster/routing/UnassignedInfoTests.java

+        // both original and restored index must have at least one shard tho
+        assertTrue(foundAnyNodeIds);


Can this not go one line up, i.e., we can check this for every shard id?

Not if the shard count can change in a restore (which AFAIK it can)

…closed

henningandersen

LGTM.

* main: (163 commits) [DOCS] Edits frequent items aggregation (elastic#91564) Handle providers of optional services in ubermodule classloader (elastic#91217) Add `exportDockerImages` lifecycle task for exporting docker tarballs (elastic#91571) Fix CSV dependency report output file location in DRA CI job Fix variable placeholder for Strings.format calls (elastic#91531) Fix output dir creation in ConcatFileTask (elastic#91568) Fix declaration of dependencies in DRA snapshots CI job (elastic#91569) Upgrade Gradle Enterprise plugin to 3.11.4 (elastic#91435) Ingest DateProcessor (small) speedup, optimize collections code in DateFormatter.forPattern (elastic#91521) Fix inter project handling of generateDependenciesReport (elastic#91555) [Synthetics] Add synthetics-* read to fleet-server (elastic#91391) [ML] Copy more settings when creating DF analytics destination index (elastic#91546) Reduce CartesianCentroidIT flakiness (elastic#91553) Propagate last node to reinitialized routing tables (elastic#91549) Forecast write load during rollovers (elastic#91425) [DOCS] Warn about potential overhead of named queries (elastic#91512) Datastream unavailable exception metadata (elastic#91461) Generate docker images and dependency report in DRA ci job (elastic#91545) Support cartesian_bounds aggregation on point and shape (elastic#91298) Add support for EQL samples queries (elastic#91312) ... # Conflicts: # x-pack/plugin/rollup/src/main/java/org/elasticsearch/xpack/downsample/RollupShardIndexer.java

DaveCTurner added >non-issue :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) v8.6.0 labels Nov 14, 2022

DaveCTurner requested review from idegtiarenko and henningandersen November 14, 2022 13:15

elasticsearchmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Nov 14, 2022

DaveCTurner mentioned this pull request Nov 14, 2022

[CI] AwarenessAllocationIT testAwarenessZones failing #91472

Closed

DaveCTurner added 2 commits November 14, 2022 13:22

Slightly stronger test: when restoring an index with the same shard c…

2dd98c5

…ount, expect all the node IDs to be filled in

Also verify that primary gets the same location

2fdaa9d

idegtiarenko approved these changes Nov 14, 2022

View reviewed changes

Rename

be9ac06

henningandersen reviewed Nov 14, 2022

View reviewed changes

Also handle case where one of the original shards is relocating when …

bd12ab9

…closed

DaveCTurner added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Nov 14, 2022

henningandersen approved these changes Nov 14, 2022

View reviewed changes

elasticsearchmachine merged commit 1f72f2e into elastic:main Nov 14, 2022

DaveCTurner deleted the 2022-11-14-propagate-last-allocated-id-to-reinitialized-routing-tables branch November 14, 2022 14:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Propagate last node to reinitialized routing tables #91549

Propagate last node to reinitialized routing tables #91549

DaveCTurner commented Nov 14, 2022

elasticsearchmachine commented Nov 14, 2022

henningandersen left a comment

henningandersen Nov 14, 2022

DaveCTurner Nov 14, 2022

henningandersen Nov 14, 2022

DaveCTurner Nov 14, 2022

henningandersen Nov 14, 2022

henningandersen Nov 14, 2022

DaveCTurner Nov 14, 2022

henningandersen left a comment

	// restoring an index may change the number of shards/replicas so no guarantee that lastAllocatedNodeId is populated
	// restoring an index may change the number of replicas so no guarantee that lastAllocatedNodeId is populated

		// both original and restored index must have at least one shard tho
		assertTrue(foundAnyNodeIds);

Propagate last node to reinitialized routing tables #91549

Propagate last node to reinitialized routing tables #91549

Conversation

DaveCTurner commented Nov 14, 2022

elasticsearchmachine commented Nov 14, 2022

henningandersen left a comment

Choose a reason for hiding this comment

henningandersen Nov 14, 2022

Choose a reason for hiding this comment

DaveCTurner Nov 14, 2022

Choose a reason for hiding this comment

henningandersen Nov 14, 2022

Choose a reason for hiding this comment

DaveCTurner Nov 14, 2022

Choose a reason for hiding this comment

henningandersen Nov 14, 2022

Choose a reason for hiding this comment

henningandersen Nov 14, 2022

Choose a reason for hiding this comment

DaveCTurner Nov 14, 2022

Choose a reason for hiding this comment

henningandersen left a comment

Choose a reason for hiding this comment