Simplify Snapshot Initialization #51256

original-brownbear · 2020-01-21T15:38:50Z

We were loading RepositoryData twice during snapshot initialization,
redundantly checking if a snapshot existed already.
The first snapshot existence check is somewhat redundant because a snapshot could be
created between loading RepositoryData and updating the cluster state with the INIT
state snapshot entry.
Also, it is much safer to do the subsequent checks for index existence in the repo and
and the presence of old version snapshots once the INIT state entry prevents further
snapshots from being created concurrently.
While the current state of things will never lead to corruption on a concurrent snapshot
creation, it could result in a situation (though unlikely) where all the snapshot's work
is done on the data nodes, only to find out that the repository generation was off during
snapshot finalization, failing there and leaving a bunch of dead data in the repository
that won't be used in a subsequent snapshot (because the shard generation was never referenced
due to the failed snapshot finalization).
BwC should not be a concern here since the init stage only has meaning for a single master node as any init stage snapshot is removed on master-failover so creating the placeholder entry with repo generation -2 makes no difference here..

Note: This is a step on the way to parallel repository operations by making snapshot related CS
and repo related CS more tightly correlated.

We were loading `RepositoryData` twice during snapshot initialization, redundantly checking if a snapshot existed already. The first snapshot existence check is somewhat redundant because a snapshot could be created between loading `RepositoryData` and updating the cluster state with the `INIT` state snapshot entry. Also, it is much safer to do the subsequent checks for index existence in the repo and and the presence of old version snapshots once the `INIT` state entry prevents further snapshots from being created concurrently. While the current state of things will never lead to corruption on a concurrent snapshot creation, it could result in a situation (though unlikely) where all the snapshot's work is done on the data nodes, only to find out that the repository generation was off during snapshot finalization, failing there and leaving a bunch of dead data in the repository that won't be used in a subsequent snapshot (because the shard generation was never referenced due to the failed snapshot finalization). Note: This is a step on the way to parallel repository operations by making snapshot related CS and repo related CS more tightly correlated.

elasticmachine · 2020-01-21T15:38:52Z

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

tlrx

LGTM, makes sense. I only left very minor comments.

tlrx · 2020-01-22T15:31:13Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotsService.java

+                        "cannot snapshot while a repository cleanup is in-progress in [" + repositoryCleanupInProgress + "]");
+                }
+                SnapshotsInProgress snapshots = currentState.custom(SnapshotsInProgress.TYPE);
+                if (snapshots == null || snapshots.entries().isEmpty()) {


I find this a bit more readable:

Suggested change

if (snapshots == null || snapshots.entries().isEmpty()) {

if (snapshots != null && snapshots.entries().isEmpty() == false) {

throw new ConcurrentSnapshotExecutionException(repositoryName, snapshotName, " a snapshot is already running");

}

// Store newSnapshot here to be processed in clusterStateProcessed

...

tlrx · 2020-01-22T15:35:59Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotsService.java

+                        new Snapshot(repositoryName, snapshotId),
+                        request.includeGlobalState(), request.partial(),
+                        State.INIT,
+                        Collections.emptyList(),


Maybe add a comment on why the list is empty here? Something like
// list of snapshot indices will be resolved later

…te-order-deterministic

original-brownbear · 2020-01-23T11:32:23Z

Thanks Tanguy!

We were loading `RepositoryData` twice during snapshot initialization, redundantly checking if a snapshot existed already. The first snapshot existence check is somewhat redundant because a snapshot could be created between loading `RepositoryData` and updating the cluster state with the `INIT` state snapshot entry. Also, it is much safer to do the subsequent checks for index existence in the repo and and the presence of old version snapshots once the `INIT` state entry prevents further snapshots from being created concurrently. While the current state of things will never lead to corruption on a concurrent snapshot creation, it could result in a situation (though unlikely) where all the snapshot's work is done on the data nodes, only to find out that the repository generation was off during snapshot finalization, failing there and leaving a bunch of dead data in the repository that won't be used in a subsequent snapshot (because the shard generation was never referenced due to the failed snapshot finalization). Note: This is a step on the way to parallel repository operations by making snapshot related CS and repo related CS more tightly correlated.

original-brownbear added >non-issue :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs v8.0.0 v7.7.0 labels Jan 21, 2020

original-brownbear added 2 commits January 21, 2020 16:40

cs

1ffd807

shorter

7ca5ef3

original-brownbear requested review from ywelsch and tlrx January 21, 2020 19:57

tlrx approved these changes Jan 22, 2020

View reviewed changes

original-brownbear added 2 commits January 22, 2020 17:29

Merge remote-tracking branch 'elastic/master' into make-snapshot-dele…

ae7c2a6

…te-order-deterministic

CR comments

6da2565

original-brownbear merged commit 6736cf5 into elastic:master Jan 23, 2020

original-brownbear deleted the make-snapshot-delete-order-deterministic branch January 23, 2020 11:32

original-brownbear mentioned this pull request Jan 23, 2020

Simplify Snapshot Initialization (#51256) #51344

Merged

original-brownbear restored the make-snapshot-delete-order-deterministic branch August 6, 2020 18:37

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify Snapshot Initialization #51256

Simplify Snapshot Initialization #51256

original-brownbear commented Jan 21, 2020

elasticmachine commented Jan 21, 2020

tlrx left a comment

tlrx Jan 22, 2020

tlrx Jan 22, 2020

original-brownbear commented Jan 23, 2020

-                if (snapshots == null || snapshots.entries().isEmpty()) {
+                if (snapshots != null && snapshots.entries().isEmpty() == false) {
+                        throw new ConcurrentSnapshotExecutionException(repositoryName, snapshotName, " a snapshot is already running");
+                }
+                // Store newSnapshot here to be processed in clusterStateProcessed
+                ...

Simplify Snapshot Initialization #51256

Simplify Snapshot Initialization #51256

Conversation

original-brownbear commented Jan 21, 2020

elasticmachine commented Jan 21, 2020

tlrx left a comment

Choose a reason for hiding this comment

tlrx Jan 22, 2020

Choose a reason for hiding this comment

tlrx Jan 22, 2020

Choose a reason for hiding this comment

original-brownbear commented Jan 23, 2020