Refactor SnapshotsInProgress to Use RepositoryId for Concurency Logic #75501

original-brownbear · 2021-07-19T21:16:19Z

This refactors the snapshots-in-progress logic to work from RepositoryShardId when working out what parts of the repository are in-use by writes for snapshot concurrency safety. This change does not go all the way yet on this topic and there are a number of possible follow-up further improvements to simplify the logic that I'd work through over time.
But for now this allows fixing the remaining known issues that snapshot stress testing surfaced when combined with the fix in #75530.

These issues all come from the fact that ShardId is not a stable key across multiple snapshots if snapshots are partial. The scenarios that are broken are all roughly this:

snapshot-1 for index-A with uuid-A runs and is partial
index-A is deleted and re-created and now has uuid-B
snapshot-2 for index-A is started and we now have it queued up behind snapshot-1 for the index
snapshot-1 finishes and the logic tries to start the next snapshot for the same shard-id
- this fails because the shard-id is not the same, we can't compare index uuids, just index name + shard id
- this change fixes all these spots by always taking the round trip via RepositoryShardId

planned follow-ups here are:

dry up logic across cloning and snapshotting more as both now essentially run the same code in many state-machine steps
serialize snapshots-in-progress efficiently instead of re-computing the index and by-repository-shard-id lookups in the constructor every time
- refactor the logic in snapshots-in-progress away from maps keyed by shard-id in almost all spots to this end, just keep an index name to Index map to work out what exactly is being snapshotted
refactoring snapshots-in-progress to be a map of list of operations keyed by repository shard id instead of a list of maps as it currently is to make the concurrency simpler and more obviously correct

closes #75423

relates (#75339 ... should also fix this, but I have to verify by testing with a backport to 7.x)

…epository-id

elasticmachine · 2021-07-21T10:12:13Z

Pinging @elastic/es-distributed (Team:Distributed)

DaveCTurner · 2021-07-21T10:41:15Z

I rebased the stress tests on top of this branch, see 3ea18f80f99ec5f420e8f91f921d9eb0a862c850, and I still got the same Missing assignment assertion to trip 😢

testoutput-1626863771.tar.gz

…epository-id

original-brownbear · 2021-07-21T10:57:09Z

@DaveCTurner that one should be fixed by #75530 I believe. The index lookup there is broken and incorrectly interprets a re-created index with changed UUID as a deleted index still existing.

Still seeing a different exception locally though about unknown completion listeners ... looking into that now as well.

DaveCTurner · 2021-07-21T11:20:20Z

Hmm I merged #75530 into my branch, see 8fe45fa, and still the Missing assignment assertion trips:

testoutput-1626866221.tar.gz

…epository-id

original-brownbear · 2021-07-21T17:39:53Z

The above was caused by 064f45a.

Also I unfortunately found #75598 as a result of going through this

…epository-id

original-brownbear · 2021-07-30T10:29:41Z

@DaveCTurner to me it looks like this branch fixes the stress test now with the recent changes to master merged in. I couldn't reproduce any failures that didn't seem to be issues with the test (trying to clone indices that weren't successfully snapshotted).
Maybe take a look when you have a chance :)? This PR would be the minimal changset that I can see that would deal with the various issues around index uuids changing across snapshots but there's more cleanup planned to this code. We really should get away from the list of maps data structure we're currently using to make it obvious what operation touches what shard at any point without doing a bunch of looping to figure it out.

DaveCTurner

Sure, this LGTM (and to a fixed version of the stress tests too it seems)

original-brownbear · 2021-07-30T15:45:55Z

Thanks David!

…elastic#75501) This refactors the snapshots-in-progress logic to work from `RepositoryShardId` when working out what parts of the repository are in-use by writes for snapshot concurrency safety. This change does not go all the way yet on this topic and there are a number of possible follow-up further improvements to simplify the logic that I'd work through over time. But for now this allows fixing the remaining known issues that snapshot stress testing surfaced when combined with the fix in elastic#75530. These issues all come from the fact that `ShardId` is not a stable key across multiple snapshots if snapshots are partial. The scenarios that are broken are all roughly this: * snapshot-1 for index-A with uuid-A runs and is partial * index-A is deleted and re-created and now has uuid-B * snapshot-2 for index-A is started and we now have it queued up behind snapshot-1 for the index * snapshot-1 finishes and the logic tries to start the next snapshot for the same shard-id * this fails because the shard-id is not the same, we can't compare index uuids, just index name + shard id * this change fixes all these spots by always taking the round trip via `RepositoryShardId` planned follow-ups here are: * dry up logic across cloning and snapshotting more as both now essentially run the same code in many state-machine steps * serialize snapshots-in-progress efficiently instead of re-computing the index and by-repository-shard-id lookups in the constructor every time * refactor the logic in snapshots-in-progress away from maps keyed by shard-id in almost all spots to this end, just keep an index name to `Index` map to work out what exactly is being snapshotted * refactoring snapshots-in-progress to be a map of list of operations keyed by repository shard id instead of a list of maps as it currently is to make the concurrency simpler and more obviously correct closes elastic#75423 relates (elastic#75339 ... should also fix this, but I have to verify by testing with a backport to 7.x)

…#75501) (#76539) This refactors the snapshots-in-progress logic to work from `RepositoryShardId` when working out what parts of the repository are in-use by writes for snapshot concurrency safety. This change does not go all the way yet on this topic and there are a number of possible follow-up further improvements to simplify the logic that I'd work through over time. But for now this allows fixing the remaining known issues that snapshot stress testing surfaced when combined with the fix in #75530. These issues all come from the fact that `ShardId` is not a stable key across multiple snapshots if snapshots are partial. The scenarios that are broken are all roughly this: * snapshot-1 for index-A with uuid-A runs and is partial * index-A is deleted and re-created and now has uuid-B * snapshot-2 for index-A is started and we now have it queued up behind snapshot-1 for the index * snapshot-1 finishes and the logic tries to start the next snapshot for the same shard-id * this fails because the shard-id is not the same, we can't compare index uuids, just index name + shard id * this change fixes all these spots by always taking the round trip via `RepositoryShardId` planned follow-ups here are: * dry up logic across cloning and snapshotting more as both now essentially run the same code in many state-machine steps * serialize snapshots-in-progress efficiently instead of re-computing the index and by-repository-shard-id lookups in the constructor every time * refactor the logic in snapshots-in-progress away from maps keyed by shard-id in almost all spots to this end, just keep an index name to `Index` map to work out what exactly is being snapshotted * refactoring snapshots-in-progress to be a map of list of operations keyed by repository shard id instead of a list of maps as it currently is to make the concurrency simpler and more obviously correct closes #75423 relates (#75339 ... should also fix this, but I have to verify by testing with a backport to 7.x)

…elastic#75501) (elastic#76539) This refactors the snapshots-in-progress logic to work from `RepositoryShardId` when working out what parts of the repository are in-use by writes for snapshot concurrency safety. This change does not go all the way yet on this topic and there are a number of possible follow-up further improvements to simplify the logic that I'd work through over time. But for now this allows fixing the remaining known issues that snapshot stress testing surfaced when combined with the fix in elastic#75530. These issues all come from the fact that `ShardId` is not a stable key across multiple snapshots if snapshots are partial. The scenarios that are broken are all roughly this: * snapshot-1 for index-A with uuid-A runs and is partial * index-A is deleted and re-created and now has uuid-B * snapshot-2 for index-A is started and we now have it queued up behind snapshot-1 for the index * snapshot-1 finishes and the logic tries to start the next snapshot for the same shard-id * this fails because the shard-id is not the same, we can't compare index uuids, just index name + shard id * this change fixes all these spots by always taking the round trip via `RepositoryShardId` planned follow-ups here are: * dry up logic across cloning and snapshotting more as both now essentially run the same code in many state-machine steps * serialize snapshots-in-progress efficiently instead of re-computing the index and by-repository-shard-id lookups in the constructor every time * refactor the logic in snapshots-in-progress away from maps keyed by shard-id in almost all spots to this end, just keep an index name to `Index` map to work out what exactly is being snapshotted * refactoring snapshots-in-progress to be a map of list of operations keyed by repository shard id instead of a list of maps as it currently is to make the concurrency simpler and more obviously correct closes elastic#75423 relates (elastic#75339 ... should also fix this, but I have to verify by testing with a backport to 7.x)

…#75501) (#76539) (#76547) This refactors the snapshots-in-progress logic to work from `RepositoryShardId` when working out what parts of the repository are in-use by writes for snapshot concurrency safety. This change does not go all the way yet on this topic and there are a number of possible follow-up further improvements to simplify the logic that I'd work through over time. But for now this allows fixing the remaining known issues that snapshot stress testing surfaced when combined with the fix in #75530. These issues all come from the fact that `ShardId` is not a stable key across multiple snapshots if snapshots are partial. The scenarios that are broken are all roughly this: * snapshot-1 for index-A with uuid-A runs and is partial * index-A is deleted and re-created and now has uuid-B * snapshot-2 for index-A is started and we now have it queued up behind snapshot-1 for the index * snapshot-1 finishes and the logic tries to start the next snapshot for the same shard-id * this fails because the shard-id is not the same, we can't compare index uuids, just index name + shard id * this change fixes all these spots by always taking the round trip via `RepositoryShardId` planned follow-ups here are: * dry up logic across cloning and snapshotting more as both now essentially run the same code in many state-machine steps * serialize snapshots-in-progress efficiently instead of re-computing the index and by-repository-shard-id lookups in the constructor every time * refactor the logic in snapshots-in-progress away from maps keyed by shard-id in almost all spots to this end, just keep an index name to `Index` map to work out what exactly is being snapshotted * refactoring snapshots-in-progress to be a map of list of operations keyed by repository shard id instead of a list of maps as it currently is to make the concurrency simpler and more obviously correct closes #75423 relates (#75339 ... should also fix this, but I have to verify by testing with a backport to 7.x)

original-brownbear added 8 commits July 19, 2021 12:43

start

f649c82

step

d92c2ff

bck

9381759

Merge remote-tracking branch 'elastic/master' into refactor-towards-r…

4efcf83

…epository-id

bck

ce1deec

noice

a449b8d

fixes

bd414d9

fix

d12da69

original-brownbear added :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs >refactoring labels Jul 19, 2021

elasticsearchmachine added the v8.0.0 label Jul 19, 2021

original-brownbear added 7 commits July 20, 2021 07:53

Merge remote-tracking branch 'elastic/master' into refactor-towards-r…

ca780f3

…epository-id

nicer

04c992f

nicer

51f5b8e

fix

9eed459

Merge remote-tracking branch 'elastic/master' into refactor-towards-r…

79565d9

…epository-id

nicer

73d63d7

Merge remote-tracking branch 'elastic/master' into refactor-towards-r…

4987acc

…epository-id

original-brownbear added v7.14.1 v7.15.0 >bug labels Jul 21, 2021

original-brownbear marked this pull request as ready for review July 21, 2021 10:12

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Jul 21, 2021

Merge remote-tracking branch 'elastic/master' into refactor-towards-r…

d659b2a

…epository-id

original-brownbear added 2 commits July 21, 2021 19:34

Merge remote-tracking branch 'elastic/master' into refactor-towards-r…

22fbd1f

…epository-id

fix

064f45a

original-brownbear mentioned this pull request Jul 27, 2021

Fix Concurrent Snapshot Repository Corruption from Operations Queued after Failing Operations #75733

Merged

Merge remote-tracking branch 'elastic/master' into refactor-towards-r…

74e4d57

…epository-id

original-brownbear requested a review from DaveCTurner July 30, 2021 10:29

DaveCTurner approved these changes Jul 30, 2021

View reviewed changes

original-brownbear merged commit 6592cfe into elastic:master Jul 30, 2021

original-brownbear deleted the refactor-towards-repository-id branch July 30, 2021 15:46

original-brownbear added the backport pending label Jul 30, 2021

DaveCTurner mentioned this pull request Aug 4, 2021

Replace String shard gen with ShardGeneration #75927

Merged

mark-vieira added v8.0.0-alpha1 and removed v8.0.0 labels Aug 4, 2021

original-brownbear mentioned this pull request Aug 15, 2021

Refactor SnapshotsInProgress to Use RepositoryId for Concurency Logic(#75501) #76539

Merged

original-brownbear removed the backport pending label Aug 16, 2021

original-brownbear mentioned this pull request Aug 16, 2021

Refactor SnapshotsInProgress to Use RepositoryId for Concurency Logic(#75501) (#76539) #76547

Merged

henningandersen mentioned this pull request Aug 16, 2021

[CI] MixedClusterClientYamlTestSuiteIT failing (crash due to assertion) #76552

Closed

original-brownbear restored the refactor-towards-repository-id branch April 18, 2023 20:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor SnapshotsInProgress to Use RepositoryId for Concurency Logic #75501

Refactor SnapshotsInProgress to Use RepositoryId for Concurency Logic #75501

original-brownbear commented Jul 19, 2021 •

edited

Loading

elasticmachine commented Jul 21, 2021

DaveCTurner commented Jul 21, 2021

original-brownbear commented Jul 21, 2021 •

edited

Loading

DaveCTurner commented Jul 21, 2021

original-brownbear commented Jul 21, 2021

original-brownbear commented Jul 30, 2021

DaveCTurner left a comment

original-brownbear commented Jul 30, 2021

Refactor SnapshotsInProgress to Use RepositoryId for Concurency Logic #75501

Refactor SnapshotsInProgress to Use RepositoryId for Concurency Logic #75501

Conversation

original-brownbear commented Jul 19, 2021 • edited Loading

elasticmachine commented Jul 21, 2021

DaveCTurner commented Jul 21, 2021

original-brownbear commented Jul 21, 2021 • edited Loading

DaveCTurner commented Jul 21, 2021

original-brownbear commented Jul 21, 2021

original-brownbear commented Jul 30, 2021

DaveCTurner left a comment

Choose a reason for hiding this comment

original-brownbear commented Jul 30, 2021

original-brownbear commented Jul 19, 2021 •

edited

Loading

original-brownbear commented Jul 21, 2021 •

edited

Loading