Do not close bad indices on startup #39500

ywelsch · 2019-02-28T09:49:36Z

With #17187, we verified IndexService creation during initial state recovery on the master and if the recovery failed the index was imported as closed, not allocating any shards. This was mainly done to prevent endless allocation loops and full log files on data-nodes when the indexmetadata contained broken settings / analyzers. Zen2 loads the cluster state eagerly, and this check currently runs on all nodes (not only the elected master), which can significantly slow down startup on data nodes. Furthermore, with replicated closed indices (#33888) on the horizon, importing the index as closed will no longer not allocate any shards. Fortunately, the original issue for endless allocation loops is no longer a problem due to #18467, where we limit the retries of failed allocations. The solution here is therefore to just undo #17187, as it's no longer necessary, and covered by #18467, which will solve the issue for Zen2 and replicated closed indices as well.

elasticmachine · 2019-02-28T09:49:38Z

Pinging @elastic/es-distributed

…ndices-on-startup

With #17187, we verified IndexService creation during initial state recovery on the master and if the recovery failed the index was imported as closed, not allocating any shards. This was mainly done to prevent endless allocation loops and full log files on data-nodes when the indexmetadata contained broken settings / analyzers. Zen2 loads the cluster state eagerly, and this check currently runs on all nodes (not only the elected master), which can significantly slow down startup on data nodes. Furthermore, with replicated closed indices (#33888) on the horizon, importing the index as closed will no longer not allocate any shards. Fortunately, the original issue for endless allocation loops is no longer a problem due to #18467, where we limit the retries of failed allocations. The solution here is therefore to just undo #17187, as it's no longer necessary, and covered by #18467, which will solve the issue for Zen2 and replicated closed indices as well.

Do not close bad indices on startup

cf00e0b

ywelsch added >non-issue v7.0.0 :Distributed Coordination/Cluster Coordination Cluster formation and cluster state publication, including cluster membership and fault detection. v8.0.0 v7.2.0 labels Feb 28, 2019

ywelsch requested review from andrershov and tlrx February 28, 2019 09:49

ywelsch mentioned this pull request Feb 28, 2019

A new cluster coordination layer #32006

Closed

61 tasks

tlrx approved these changes Feb 28, 2019

View reviewed changes

always a pleasure, checkstyle

50cf7af

tlrx approved these changes Feb 28, 2019

View reviewed changes

ywelsch added 3 commits February 28, 2019 11:14

test imports

0db344c

fix test

1d99e59

Merge remote-tracking branch 'elastic/master' into do-not-close-bad-i…

de81de7

…ndices-on-startup

ywelsch merged commit 349c8a3 into elastic:master Mar 1, 2019

jakelandis added v7.0.0-rc2 and removed v7.0.0 labels Apr 3, 2019

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not close bad indices on startup #39500

Do not close bad indices on startup #39500

ywelsch commented Feb 28, 2019

elasticmachine commented Feb 28, 2019

Do not close bad indices on startup #39500

Do not close bad indices on startup #39500

Conversation

ywelsch commented Feb 28, 2019

elasticmachine commented Feb 28, 2019