Move all Snapshot Master Node Steps to SnapshotsService (#56365) #59373

original-brownbear · 2020-07-12T16:55:44Z

This refactoring has three motivations:

Separate all master node steps during snapshot operations from all data node steps in code.
Set up next steps in concurrent repository operations and general improvements by centralizing tracking of each shard's state in the repository in SnapshotsService so that operations for each shard can be linearized efficiently (i.e. without having to inspect the full snapshot state for all shards on every cluster state update, allowing us to track more in memory and only fall back to inspecting the full CS on master failover like we do in the snapshot shards service).
- This PR already contains some best effort examples of this, but obviously this could be way improved upon still (just did not want to do it in this PR for complexity reasons)
Make the SnapshotsService less expensive on the CS thread for large snapshots

backport of #56365

This refactoring has three motivations: 1. Separate all master node steps during snapshot operations from all data node steps in code. 2. Set up next steps in concurrent repository operations and general improvements by centralizing tracking of each shard's state in the repository in `SnapshotsService` so that operations for each shard can be linearized efficiently (i.e. without having to inspect the full snapshot state for all shards on every cluster state update, allowing us to track more in memory and only fall back to inspecting the full CS on master failover like we do in the snapshot shards service). * This PR already contains some best effort examples of this, but obviously this could be way improved upon still (just did not want to do it in this PR for complexity reasons) 3. Make the `SnapshotsService` less expensive on the CS thread for large snapshots

elasticmachine · 2020-07-12T16:55:46Z

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

original-brownbear · 2020-07-12T19:36:39Z

Jenkins run elasticsearch-ci/2 (GCS port infra issue)

original-brownbear added :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs backport labels Jul 12, 2020

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Jul 12, 2020

original-brownbear merged commit 4833861 into elastic:7.x Jul 12, 2020

original-brownbear deleted the 56365-7.x branch July 12, 2020 20:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move all Snapshot Master Node Steps to SnapshotsService (#56365) #59373

Move all Snapshot Master Node Steps to SnapshotsService (#56365) #59373

original-brownbear commented Jul 12, 2020

elasticmachine commented Jul 12, 2020

original-brownbear commented Jul 12, 2020

Move all Snapshot Master Node Steps to SnapshotsService (#56365) #59373

Move all Snapshot Master Node Steps to SnapshotsService (#56365) #59373

Conversation

original-brownbear commented Jul 12, 2020

elasticmachine commented Jul 12, 2020

original-brownbear commented Jul 12, 2020