Allow shards of closed indices to be replicated as regular shards #38024

tlrx · 2019-01-30T11:13:10Z

Note: this pull request is aimed to be merged in the replicated-closed-indices feature branch

This pull request allows shards of indices in CLOSE state to be replicated as normal shards. It changes the MetaDataIndexStateService so that index routing tables of closed indices are kept in cluster state when the index is closed. Index routing tables are modified so that shard routings are reinitialized with the INDEX_CLOSED unassigned information. The IndicesClusterStateService is modified to remove IndexService instances of closed or reopened indices. In combination with the ShardRouting being in INITIALIZING state the shards are recreated on the data nodes to reflect the new state. If the index state is closed, the IndexShard instances will be created using the NoOpEngine as the engine implementation.

This pull request also modifies the RestoreService so that primary terms are increased when indices are restored from snapshots. This is necessary in order to avoid shards to be restored with primary terms that have a value lower than the current primary term (which is increased when the index is closed).

This pull request also mutes two tests that rely on the fact that shard locks are released when an index is closed, which is not the case anymore with replicated closed indices (actually the locks are released but reacquired once the shard is reinitialized after being closed). These tests will be adapted in follow up PRs.

Finally, many things will require to be adapted or improved in follow up PRs (see #33888) but this is the first big step towards replicated closed indices.

elasticmachine · 2019-01-30T11:13:14Z

Pinging @elastic/es-distributed

ywelsch

Left some smaller comments. Looking very good already.

server/src/main/java/org/elasticsearch/snapshots/RestoreService.java

server/src/main/java/org/elasticsearch/cluster/metadata/MetaDataIndexStateService.java

server/src/main/java/org/elasticsearch/cluster/routing/RoutingTable.java

server/src/main/java/org/elasticsearch/cluster/metadata/MetaDataIndexStateService.java

server/src/main/java/org/elasticsearch/index/engine/NoOpEngine.java

ywelsch · 2019-01-31T16:25:57Z

server/src/main/java/org/elasticsearch/indices/cluster/IndicesClusterStateService.java

                assert indexMetaData != null || event.isNewCluster() :
                    "index " + index + " does not exist in the cluster state, it should either " +
                        "have been deleted or the cluster must be new";
-                final AllocatedIndices.IndexRemovalReason reason =
-                    indexMetaData != null && indexMetaData.getState() == IndexMetaData.State.CLOSE ? CLOSED : NO_LONGER_ASSIGNED;


for BWC reasons, I think we will need to keep this.

I guess you're thinking of the search context releasing?

Sorry for not being clear here. I was rather thinking about the case where an older-version master has removed the routing table for a closed index (i.e. old-style closed indices). We still need to handle these here.

server/src/main/java/org/elasticsearch/indices/cluster/IndicesClusterStateService.java

server/src/main/java/org/elasticsearch/search/SearchService.java

server/src/test/java/org/elasticsearch/index/shard/IndexShardIT.java

server/src/test/java/org/elasticsearch/indices/IndicesServiceTests.java

…xisting indices (#38177) When restoring shards of existing indices, the RestoreService also restores the values of primary terms stored in the snapshot index metadata. The primary terms are not updated and could potentially conflict with current index primary terms if the restored primary terms are lower than the existing ones. This situation is likely to happen with replicated closed indices (because primary terms are increased when the index is transitioning from open to closed state, and the snapshotted primary terms are the one at the time the index was opened) (see #38024) and maybe also with CCR. This commit changes the RestoreService so that it updates the primary terms using the maximum value between the snapshotted values and the existing values. Related to #33888

…xisting indices (elastic#38177) When restoring shards of existing indices, the RestoreService also restores the values of primary terms stored in the snapshot index metadata. The primary terms are not updated and could potentially conflict with current index primary terms if the restored primary terms are lower than the existing ones. This situation is likely to happen with replicated closed indices (because primary terms are increased when the index is transitioning from open to closed state, and the snapshotted primary terms are the one at the time the index was opened) (see elastic#38024) and maybe also with CCR. This commit changes the RestoreService so that it updates the primary terms using the maximum value between the snapshotted values and the existing values. Backport elastic/elasticsearch@da6269b Related to elastic#33888

…xisting indices (#38177) When restoring shards of existing indices, the RestoreService also restores the values of primary terms stored in the snapshot index metadata. The primary terms are not updated and could potentially conflict with current index primary terms if the restored primary terms are lower than the existing ones. This situation is likely to happen with replicated closed indices (because primary terms are increased when the index is transitioning from open to closed state, and the snapshotted primary terms are the one at the time the index was opened) (see #38024) and maybe also with CCR. This commit changes the RestoreService so that it updates the primary terms using the maximum value between the snapshotted values and the existing values. Backport da6269b Related to #33888

…dices

tlrx · 2019-02-04T09:29:45Z

Thanks for the review @ywelsch ! I updated the code, can you have another look please?

ywelsch

LGTM. I have left one question

server/src/main/java/org/elasticsearch/index/engine/NoOpEngine.java

tlrx · 2019-02-04T13:20:13Z

Thanks @ywelsch! 🎉

…38327) Relates to #33888 and #38024

Before this change, closed indexes were simply not replicated. It was therefore possible to close an index and then decommission a data node without knowing that this data node contained shards of the closed index, potentially leading to data loss. Shards of closed indices were not completely taken into account when balancing the shards within the cluster, or automatically replicated through shard copies, and they were not easily movable from node A to node B using APIs like Cluster Reroute without being fully reopened and closed again. This commit changes the logic executed when closing an index, so that its shards are not just removed and forgotten but are instead reinitialized and reallocated on data nodes using an engine implementation which does not allow searching or indexing, which has a low memory overhead (compared with searchable/indexable opened shards) and which allows shards to be recovered from peer or promoted as primaries when needed. This new closing logic is built on top of the new Close Index API introduced in 6.7.0 (#37359). Some pre-closing sanity checks are executed on the shards before closing them, and closing an index on a 8.0 cluster will reinitialize the index shards and therefore impact the cluster health. Some APIs have been adapted to make them work with closed indices: - Cluster Health API - Cluster Reroute API - Cluster Allocation Explain API - Recovery API - Cat Indices - Cat Shards - Cat Health - Cat Recovery This commit contains all the following changes (most recent first): * c6c42a1 Adapt NoOpEngineTests after #39006 * 3f9993d Wait for shards to be active after closing indices (#38854) * 5e7a428 Adapt the Cluster Health API to closed indices (#39364) * 3e61939 Adapt CloseFollowerIndexIT for replicated closed indices (#38767) * 71f5c34 Recover closed indices after a full cluster restart (#39249) * 4db7fd9 Adapt the Recovery API for closed indices (#38421) * 4fd1bb2 Adapt more tests suites to closed indices (#39186) * 0519016 Add replica to primary promotion test for closed indices (#39110) * b756f6c Test the Cluster Shard Allocation Explain API with closed indices (#38631) * c484c66 Remove index routing table of closed indices in mixed versions clusters (#38955) * 00f1828 Mute CloseFollowerIndexIT.testCloseAndReopenFollowerIndex() * e845b0a Do not schedule Refresh/Translog/GlobalCheckpoint tasks for closed indices (#38329) * cf9a015 Adapt testIndexCanChangeCustomDataPath for replicated closed indices (#38327) * b9becdd Adapt testPendingTasks() for replicated closed indices (#38326) * 02cc730 Allow shards of closed indices to be replicated as regular shards (#38024) * e53a9be Fix compilation error in IndexShardIT after merge with master * cae4155 Relax NoOpEngine constraints (#37413) * 54d110b [RCI] Adapt NoOpEngine to latest FrozenEngine changes * c63fd69 [RCI] Add NoOpEngine for closed indices (#33903) Relates to #33888

…astic#38024) This commit allows shards of indices in CLOSE state to be replicated as normal shards. It changes the MetaDataIndexStateService so that index routing tables of closed indices are kept in cluster state when the index is closed. Index routing tables are modified so that shard routings are reinitialized with the INDEX_CLOSED unassigned information. The IndicesClusterStateService is modified to remove IndexService instances of closed or reopened indices. In combination with the ShardRouting being in INITIALIZING state the shards are recreated on the data nodes to reflect the new state. If the index state is closed, the IndexShard instances will be created using the NoOpEngine as the engine implementation. This commit also mutes two tests that rely on the fact that shard locks are released when an index is closed, which is not the case anymore with replicated closed indices (actually the locks are released but reacquired once the shard is reinitialized after being closed). These tests will be adapted in follow up PRs. Finally, many things will require to be adapted or improved in follow up PRs (see elastic#33888) but this is the first big step towards replicated closed indices. Relates to elastic#33888

…lastic#38327) Relates to elastic#33888 and elastic#38024

Backport support for replicating closed indices (#39499) Before this change, closed indexes were simply not replicated. It was therefore possible to close an index and then decommission a data node without knowing that this data node contained shards of the closed index, potentially leading to data loss. Shards of closed indices were not completely taken into account when balancing the shards within the cluster, or automatically replicated through shard copies, and they were not easily movable from node A to node B using APIs like Cluster Reroute without being fully reopened and closed again. This commit changes the logic executed when closing an index, so that its shards are not just removed and forgotten but are instead reinitialized and reallocated on data nodes using an engine implementation which does not allow searching or indexing, which has a low memory overhead (compared with searchable/indexable opened shards) and which allows shards to be recovered from peer or promoted as primaries when needed. This new closing logic is built on top of the new Close Index API introduced in 6.7.0 (#37359). Some pre-closing sanity checks are executed on the shards before closing them, and closing an index on a 8.0 cluster will reinitialize the index shards and therefore impact the cluster health. Some APIs have been adapted to make them work with closed indices: - Cluster Health API - Cluster Reroute API - Cluster Allocation Explain API - Recovery API - Cat Indices - Cat Shards - Cat Health - Cat Recovery This commit contains all the following changes (most recent first): * c6c42a1 Adapt NoOpEngineTests after #39006 * 3f9993d Wait for shards to be active after closing indices (#38854) * 5e7a428 Adapt the Cluster Health API to closed indices (#39364) * 3e61939 Adapt CloseFollowerIndexIT for replicated closed indices (#38767) * 71f5c34 Recover closed indices after a full cluster restart (#39249) * 4db7fd9 Adapt the Recovery API for closed indices (#38421) * 4fd1bb2 Adapt more tests suites to closed indices (#39186) * 0519016 Add replica to primary promotion test for closed indices (#39110) * b756f6c Test the Cluster Shard Allocation Explain API with closed indices (#38631) * c484c66 Remove index routing table of closed indices in mixed versions clusters (#38955) * 00f1828 Mute CloseFollowerIndexIT.testCloseAndReopenFollowerIndex() * e845b0a Do not schedule Refresh/Translog/GlobalCheckpoint tasks for closed indices (#38329) * cf9a015 Adapt testIndexCanChangeCustomDataPath for replicated closed indices (#38327) * b9becdd Adapt testPendingTasks() for replicated closed indices (#38326) * 02cc730 Allow shards of closed indices to be replicated as regular shards (#38024) * e53a9be Fix compilation error in IndexShardIT after merge with master * cae4155 Relax NoOpEngine constraints (#37413) * 54d110b [RCI] Adapt NoOpEngine to latest FrozenEngine changes * c63fd69 [RCI] Add NoOpEngine for closed indices (#33903) Relates to #33888

Replicate closed indices

bc92dc0

tlrx added >enhancement v7.0.0 :Distributed Indexing/Distributed A catch all label for anything in the Distributed Area. Please avoid if you can. labels Jan 30, 2019

tlrx requested a review from ywelsch January 30, 2019 11:13

This was referenced Jan 30, 2019

[RCI] Keep index routing table for closed indices #34108

Closed

Replicate closed indices #33888

Closed

Remove assertion in NoOpEngine on the number of commit points

a5a940a

ywelsch suggested changes Jan 31, 2019

View reviewed changes

tlrx mentioned this pull request Feb 1, 2019

RestoreService should update primary terms when restoring shards of existing indices #38177

Merged

Revert changes in RestoreService

acd7f49

tlrx added 2 commits February 4, 2019 10:24

Apply feedback

0801f59

Merge branch 'replicated-closed-indices' into rci-replicate-closed-in…

27bb402

…dices

tlrx requested a review from ywelsch February 4, 2019 09:30

Adapt SharedClusterSnapshotRestoreIT.testRestoreIncreasesPrimaryTerms

5af364d

ywelsch approved these changes Feb 4, 2019

View reviewed changes

server/src/main/java/org/elasticsearch/index/engine/NoOpEngine.java Show resolved Hide resolved

tlrx merged commit 02cc730 into elastic:replicated-closed-indices Feb 4, 2019

tlrx deleted the rci-replicate-closed-indices branch February 4, 2019 13:20

This was referenced Feb 4, 2019

Adapt testPendingTasks() for replicated closed indices #38326

Merged

Adapt testIndexCanChangeCustomDataPath for replicated closed indices #38327

Merged

colings86 added v7.0.0-beta1 and removed v7.0.0 labels Feb 7, 2019

tlrx added a commit that referenced this pull request Feb 11, 2019

Adapt testIndexCanChangeCustomDataPath for replicated closed indices (#…

cf9a015

…38327) Relates to #33888 and #38024

tlrx removed the v7.0.0-beta1 label Feb 18, 2019

henningandersen mentioned this pull request Feb 27, 2019

Adapt the Cluster Health and Cat Indices APIs to closed indices #39364

Merged

tlrx mentioned this pull request Feb 28, 2019

Add support for replicating closed indices #39499

Merged

tlrx added a commit to tlrx/elasticsearch that referenced this pull request Mar 1, 2019

Adapt testIndexCanChangeCustomDataPath for replicated closed indices (e…

224ee2e

…lastic#38327) Relates to elastic#33888 and elastic#38024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow shards of closed indices to be replicated as regular shards #38024

Allow shards of closed indices to be replicated as regular shards #38024

tlrx commented Jan 30, 2019 •

edited

Loading

elasticmachine commented Jan 30, 2019

ywelsch left a comment

ywelsch Jan 31, 2019

tlrx Feb 4, 2019

ywelsch Feb 4, 2019

tlrx commented Feb 4, 2019

ywelsch left a comment

tlrx commented Feb 4, 2019

Allow shards of closed indices to be replicated as regular shards #38024

Allow shards of closed indices to be replicated as regular shards #38024

Conversation

tlrx commented Jan 30, 2019 • edited Loading

elasticmachine commented Jan 30, 2019

ywelsch left a comment

Choose a reason for hiding this comment

ywelsch Jan 31, 2019

Choose a reason for hiding this comment

tlrx Feb 4, 2019

Choose a reason for hiding this comment

ywelsch Feb 4, 2019

Choose a reason for hiding this comment

tlrx commented Feb 4, 2019

ywelsch left a comment

Choose a reason for hiding this comment

tlrx commented Feb 4, 2019

tlrx commented Jan 30, 2019 •

edited

Loading