Remove Redundant CS Update on Snapshot Finalization #55276

original-brownbear · 2020-04-16T05:52:10Z

This change folds the removal of the in-progress snapshot entry
into setting the safe repository generation.
Outside of removing an unnecessary cluster state update, this also has the advantage
of removing a somewhat inconsistent cluster state where the safe repository generation points at RepositoryData that contains a finished snapshot while it is still in-progress in the cluster
state, making it easier to reason about the state machine of upcoming concurrent snapshot operations.

Note: We can do the same for snapshot deletion where it is a much more interesting change because most of the work in the delete (deleting all unreferenced blobs) can happen after the new index-N is written and does not require the delete-in-progress entry in the cluster state (assuming the repo is using shard-generations). I just wanted to start with the snapshot finalization since it's much easier to review and has no BwC implications.

This change folds the removal of the in-progress snapshot entry into setting the safe repository generation. Outside of removing an unnecessary cluster state update, this also has the advantage of removing a somewhat inconsistent cluster state where the safe repository generation points at `RepositoryData` that contains a finished snapshot while it is still in-progress in the cluster state, making it easier to reason about the state machine of upcoming concurrent snapshot operations.

elasticmachine · 2020-04-16T05:52:12Z

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

…nalization

original-brownbear · 2020-04-16T07:58:14Z

server/src/test/java/org/elasticsearch/discovery/SnapshotDisruptionIT.java

@@ -189,8 +189,7 @@ public void clusterChanged(ClusterChangedEvent event) {
                        final RepositoriesMetadata repoMeta =
                            event.state().metadata().custom(RepositoriesMetadata.TYPE);
                        final RepositoryMetadata metadata = repoMeta.repository("test-repo");
-                        if (metadata.generation() == metadata.pendingGeneration()
-                            && metadata.generation() > snapshotEntry.repositoryStateId()) {
+                        if (metadata.pendingGeneration() > snapshotEntry.repositoryStateId()) {


The concrete situation that this test was covering is technically gone with this change, but I think it's reasonable to keep testing the very similar situation of master fail-over during the repository side of the finalization here now.

original-brownbear · 2020-04-16T08:58:52Z

Jenkins run elasticsearch-ci/2 (some ML thing)

ywelsch

I've left one comment about naming. I think that technically this change is ok, but I wonder about the separation of SnapshotsService and BlobStoreRepository (which is only one implementation of Repository), where it becomes less and less clear where the responibilities for each class lie.

ywelsch · 2020-04-21T08:50:09Z

server/src/main/java/org/elasticsearch/repositories/blobstore/BlobStoreRepository.java

@@ -1301,10 +1304,11 @@ public boolean isReadOnly() {
     * @param repositoryData RepositoryData to write
     * @param expectedGen    expected repository generation at the start of the operation
     * @param writeShardGens whether to write {@link ShardGenerations} to the new {@link RepositoryData} blob
+     * @param stateFilter    filter for the last cluster state update executed by this method


It's not a filter, rather a transformation function. Perhaps adapt naming?

…nalization

original-brownbear · 2020-04-21T11:00:31Z

but I wonder about the separation of SnapshotsService and BlobStoreRepository (which is only one implementation of Repository), where it becomes less and less clear where the responibilities for each class lie.

+1 in theory. In practice, we're already exposing a lot of the specific internals of the blob store repository (e.g. shard generations are handled in snapshots-in-progress, repo generation, ...) so that ship seems to have sailed already. Also, practically speaking all our repos are eventually backed by a blob store repository for writing. Maybe it would make more sense to eventually adjust the repository interface to be more centered around just writing blobs and move all the state handling out of it completely. But for now I don't see a short way of enabling more concurrent operations without this kind of mixing of the code.

original-brownbear · 2020-04-21T11:17:08Z

Jenkins run elasticsearch-ci/1 (unrelated reindex failure)

ywelsch

LGTM

original-brownbear · 2020-04-21T12:02:43Z

Thanks Yannick!

This change folds the removal of the in-progress snapshot entry into setting the safe repository generation. Outside of removing an unnecessary cluster state update, this also has the advantage of removing a somewhat inconsistent cluster state where the safe repository generation points at `RepositoryData` that contains a finished snapshot while it is still in-progress in the cluster state, making it easier to reason about the state machine of upcoming concurrent snapshot operations.

Same as elastic#55276 but for snapshot deletes. This change folds the removal of the snapshot delete state entry into the the safe generation step where possible. This measn that for repositories that write shard generations, the time the snapshot delte entry will stay in the cluster state will be shortened a lot and reduced to the time it takes to update the repository metadata. It is fully safe in this case to run other snapshot operations after the metadata. We can not do this for repositories that do not write shard generations so those need to go through a different path and submit a separate state update task still. Also, this PR fixes a problem with the cooldown period for S3 non-shard-generation repos introduced by elastic#55286. We can not run the state update outright in the repository because we enforce the cooldown via the listener wrapping. I fixed this by folding the final state update into the listener in this case.

We do the delete in three steps here: 1. Get the repository data and find the snapshot ids to delete 2. Put the delete entry in the CS 3. Run the actual delete The test `testConcurrentSnapshotCreateAndDeleteOther` was failing because between `1.` and `2.` a full snapshot completed moving the repository generation ahead by 1 which we chose to fail on because we expect the repository generation from step `1.` to still be there in step `3.`. In the past, using the repository generation from step `1.` made sense as a safety measure because any rapid increase in repository generation could have spelled trouble on eventually consistent blob stores. Nowadays, it's just needless to fail here though and we can simply rely on the generation we read from the repository in step `3.` to avoid ever passing a broken repository generation to the repository when deleting. NOTE: This exception was always a possibility but became massively more likely due to improved/faster snapshot finalization via elastic#55276 so it only showed up now. Closes elastic#55702

original-brownbear added >non-issue :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs v8.0.0 v7.8.0 labels Apr 16, 2020

original-brownbear added 3 commits April 16, 2020 07:54

noisy indent

0f82c26

Merge remote-tracking branch 'elastic/master' into atomic-snapshot-fi…

b0e85ec

…nalization

fix test

6c1cbb8

original-brownbear commented Apr 16, 2020

View reviewed changes

original-brownbear requested review from ywelsch and tlrx April 16, 2020 08:52

ywelsch reviewed Apr 21, 2020

View reviewed changes

original-brownbear added 2 commits April 21, 2020 12:51

Merge remote-tracking branch 'elastic/master' into atomic-snapshot-fi…

0741cb1

…nalization

naming

df04dfe

original-brownbear requested a review from ywelsch April 21, 2020 11:00

ywelsch approved these changes Apr 21, 2020

View reviewed changes

original-brownbear merged commit e9fbfea into elastic:master Apr 21, 2020

original-brownbear deleted the atomic-snapshot-finalization branch April 21, 2020 12:03

original-brownbear mentioned this pull request Apr 21, 2020

Remove Redundant CS Update on Snapshot Finalization (#55276) #55528

Merged

original-brownbear mentioned this pull request Apr 21, 2020

Remove Redundant CS Update during Snapshot Delete #55536

Closed

original-brownbear mentioned this pull request Apr 24, 2020

Fix Snapshot Delete Needlessly Failing on Concurrent Snapshot #55706

Closed

original-brownbear restored the atomic-snapshot-finalization branch August 6, 2020 18:24

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove Redundant CS Update on Snapshot Finalization #55276

Remove Redundant CS Update on Snapshot Finalization #55276

original-brownbear commented Apr 16, 2020 •

edited

Loading

elasticmachine commented Apr 16, 2020

original-brownbear Apr 16, 2020

original-brownbear commented Apr 16, 2020

ywelsch left a comment

ywelsch Apr 21, 2020

original-brownbear Apr 21, 2020

original-brownbear commented Apr 21, 2020

original-brownbear commented Apr 21, 2020

ywelsch left a comment

original-brownbear commented Apr 21, 2020

Remove Redundant CS Update on Snapshot Finalization #55276

Remove Redundant CS Update on Snapshot Finalization #55276

Conversation

original-brownbear commented Apr 16, 2020 • edited Loading

elasticmachine commented Apr 16, 2020

original-brownbear Apr 16, 2020

Choose a reason for hiding this comment

original-brownbear commented Apr 16, 2020

ywelsch left a comment

Choose a reason for hiding this comment

ywelsch Apr 21, 2020

Choose a reason for hiding this comment

original-brownbear Apr 21, 2020

Choose a reason for hiding this comment

original-brownbear commented Apr 21, 2020

original-brownbear commented Apr 21, 2020

ywelsch left a comment

Choose a reason for hiding this comment

original-brownbear commented Apr 21, 2020

original-brownbear commented Apr 16, 2020 •

edited

Loading