Only start re-assigning persistent tasks if they are not already being reassigned #76258

benwtrent · 2021-08-09T19:19:04Z

In cluster recovery scenarios, it is possible there has been a flurry of cluster state updates. These updates may be routing updates in an attempt to get indices searchable again on nodes.

For each of these updates, a new persistent task re-assignment update may cause more queued cluster update requests.

This can cause unnecessary work, and consequently slow down the cluster's recovery.

This commit guards cluster update action for persistent tasks re-assignment so that only one is queued at a time.

This MAY cause certain persistent tasks to be re-assigned more slowly, but since we periodically recheck for re-assignment, this is acceptable.

…g reassigned

elasticmachine · 2021-08-09T19:23:44Z

Pinging @elastic/es-distributed (Team:Distributed)

mark-vieira · 2021-08-09T22:27:30Z

@elasticmachine retest this please

droberts195 · 2021-08-10T08:52:13Z

The changes so far are fine, but I think there's an even more important change that needs to be made as well.

The seemingly innocuous change of https://github.com/elastic/elasticsearch/pull/72260/files#diff-73e70d1002fe8bcafa19e34b892feb628153094db2ce30d77ca2f56ae5752523R316-R317 which went into 7.14 causes a major problem for any type of persistent task where the reason for failure to assign includes detailed per-node information.

It means that if a particular type of persistent task cannot be assigned then it will lead to a vicious circle of: fail to assign -> set assignment failure reason -> update cluster state -> trigger cluster state listener -> try to reassign unassigned persistent tasks -> fail to assign -> set assignment failure reason to something different to what it was before -> update cluster state -> trigger cluster state listener -> etc.

For example, the failure reasons might go:

Could not assign ML job [node B doesn't work because Y][node A doesn't work because X][node C doesn't work because Z]
Could not assign ML job [node A doesn't work because X][node C doesn't work because Z][node B doesn't work because Y]
Could not assign ML job [node A doesn't work because X][node B doesn't work because Y][node C doesn't work because Z]

So even though the reasons are effectively the same they're different.

The to something different to what it was before is what's different in 7.14. In 7.13 and earlier the second cycle would choose the same assignment failure reason as the first because the nodes would be checked in the same order, hence there wouldn't be a second cluster state update caused by the second assignment attempt because the cluster state would be identical.

I think this is a serious bug that will affect other users if it's not fixed soon, so it needs to be fixed for 7.14.1.

Two possible fixes I can see are:

Delete https://github.com/elastic/elasticsearch/pull/72260/files#diff-73e70d1002fe8bcafa19e34b892feb628153094db2ce30d77ca2f56ae5752523R316-R317 and replace it with a comment saying it's really important that consecutive calls to getAssignment see the nodes in the same order (if the cluster hasn't changed in between).
Sort the failure reasons for every type of persistent task that includes per-node detail in its failure reasons. Obviously ML falls into this category but other types of persistent task might also need their per-node detail sorting too.

It would also be good to add a test that two consecutive assignment failures with the same cluster state generate the same failure reason.

droberts195 · 2021-08-10T09:10:32Z

Sort the failure reasons for every type of persistent task that includes per-node detail in its failure reasons. Obviously ML falls into this category but other types of persistent task might also need their per-node detail sorting too.

I had a look and transforms already does this - it puts the per-node detailed reasons in a tree map keyed on node ID. The other types of persistent tasks just use very simple high level reasons, so won't be affected. I think it's just ML that will be affected, although all types of ML persistent tasks.

henningandersen

Thanks Ben, this looks good. Can we add a test to verify the behavior too such that we do not inadvertently disable this somehow in the future?

henningandersen

Test looks good, one minor comment on it. I did not review the other added part yet.

server/src/test/java/org/elasticsearch/persistent/PersistentTasksClusterServiceTests.java

…sign-p-tasks-if-not-currently-reassigning

droberts195

LGTM if you could just make one more tweak

server/src/test/java/org/elasticsearch/persistent/PersistentTasksClusterServiceTests.java

henningandersen

LGTM.

henningandersen · 2021-08-10T15:29:08Z

x-pack/plugin/ml/src/test/java/org/elasticsearch/xpack/ml/job/JobNodeSelectorTests.java

+        assertThat(
+            result.getExplanation(),
+            equalTo(
+                "Not opening job [incompatible_type_job] on node [{_node_name1}{version=8.0.0}], "


I think we should substitute Version.CURRENT.toString instead of 8.0.0, otherwise this test will break every time we release?

Same issue two lines down.

I can do that

…sign-p-tasks-if-not-currently-reassigning

elasticsearchmachine · 2021-08-10T18:12:47Z

💔 Backport failed

Status	Branch	Result
❌	7.14	Commit could not be cherrypicked due to conflicts
❌	7.x	Commit could not be cherrypicked due to conflicts

To backport manually run:
backport --pr 76258

…g reassigned (elastic#76258) * Only start re-assigning persistent tasks if they are not already being reassigned * adding tests addressing PR comments * addressing Pr COmments * addressing PR comments + style" * improving test rigor

…y being reassigned (#76258) (#76314) * Only start re-assigning persistent tasks if they are not already being reassigned (#76258) * Only start re-assigning persistent tasks if they are not already being reassigned * adding tests addressing PR comments * addressing Pr COmments * addressing PR comments + style" * improving test rigor * test improvement

…dy being reassigned (#76258) (#76315) * Only start re-assigning persistent tasks if they are not already being reassigned (#76258) * Only start re-assigning persistent tasks if they are not already being reassigned * adding tests addressing PR comments * addressing Pr COmments * addressing PR comments + style" * improving test rigor * test improvement

ferozsalam · 2021-09-03T14:03:32Z

We were on 7.14.0 and experienced the issue described in #76258 (comment) - upgrading to 7.14.1 (which includes the fix in this PR) fixed the problem. Posting some symptoms here in case it helps anyone searching for solutions to their problems:

A large (and growing) number of pending tasks
A significant number of these tasks having the source reassign persistent tasks

Upgrading to 7.14.1 and rolling all our pods (we run ECK) resolved the issue entirely.

Only start re-assigning persistent tasks if they are not already bein…

195c781

…g reassigned

benwtrent requested a review from droberts195 August 9, 2021 19:19

benwtrent added the :Distributed Coordination/Task Management Issues for anything around the Tasks API - both persistent and node level. label Aug 9, 2021

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Aug 9, 2021

benwtrent added >bug v7.15.0 v8.0.0 and removed Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. labels Aug 9, 2021

benwtrent added the v7.14.1 label Aug 10, 2021

henningandersen reviewed Aug 10, 2021

View reviewed changes

adding tests addressing PR comments

298fe8e

benwtrent requested a review from henningandersen August 10, 2021 12:40

henningandersen reviewed Aug 10, 2021

View reviewed changes

server/src/test/java/org/elasticsearch/persistent/PersistentTasksClusterServiceTests.java Show resolved Hide resolved

droberts195 reviewed Aug 10, 2021

View reviewed changes

server/src/test/java/org/elasticsearch/persistent/PersistentTasksClusterServiceTests.java Outdated Show resolved Hide resolved

benwtrent added 2 commits August 10, 2021 08:57

addressing Pr COmments

246022e

Merge remote-tracking branch 'upstream/master' into feature/only-reas…

34bb037

…sign-p-tasks-if-not-currently-reassigning

benwtrent added the auto-backport Automatically create backport pull requests when merged label Aug 10, 2021

droberts195 approved these changes Aug 10, 2021

View reviewed changes

server/src/test/java/org/elasticsearch/persistent/PersistentTasksClusterServiceTests.java Outdated Show resolved Hide resolved

addressing PR comments + style"

57297a8

henningandersen approved these changes Aug 10, 2021

View reviewed changes

benwtrent added 2 commits August 10, 2021 13:16

Merge remote-tracking branch 'upstream/master' into feature/only-reas…

356d67f

…sign-p-tasks-if-not-currently-reassigning

improving test rigor

6f1909a

benwtrent added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Aug 10, 2021

elasticsearchmachine merged commit e8a3b05 into elastic:master Aug 10, 2021

benwtrent deleted the feature/only-reassign-p-tasks-if-not-currently-reassigning branch August 10, 2021 18:21

benwtrent mentioned this pull request Aug 10, 2021

[7.x] Only start re-assigning persistent tasks if they are not already being reassigned (#76258) #76314

Merged

benwtrent mentioned this pull request Aug 10, 2021

[7.14] Only start re-assigning persistent tasks if they are not already being reassigned (#76258) #76315

Merged

jakelandis added v8.0.0-alpha2 and removed v8.0.0 labels Sep 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only start re-assigning persistent tasks if they are not already being reassigned #76258

Only start re-assigning persistent tasks if they are not already being reassigned #76258

benwtrent commented Aug 9, 2021

elasticmachine commented Aug 9, 2021

mark-vieira commented Aug 9, 2021

droberts195 commented Aug 10, 2021

droberts195 commented Aug 10, 2021

henningandersen left a comment

henningandersen left a comment

droberts195 left a comment

henningandersen left a comment

henningandersen Aug 10, 2021

benwtrent Aug 10, 2021

elasticsearchmachine commented Aug 10, 2021

ferozsalam commented Sep 3, 2021

Only start re-assigning persistent tasks if they are not already being reassigned #76258

Only start re-assigning persistent tasks if they are not already being reassigned #76258

Conversation

benwtrent commented Aug 9, 2021

elasticmachine commented Aug 9, 2021

mark-vieira commented Aug 9, 2021

droberts195 commented Aug 10, 2021

droberts195 commented Aug 10, 2021

henningandersen left a comment

Choose a reason for hiding this comment

henningandersen left a comment

Choose a reason for hiding this comment

droberts195 left a comment

Choose a reason for hiding this comment

henningandersen left a comment

Choose a reason for hiding this comment

henningandersen Aug 10, 2021

Choose a reason for hiding this comment

benwtrent Aug 10, 2021

Choose a reason for hiding this comment

elasticsearchmachine commented Aug 10, 2021

💔 Backport failed

ferozsalam commented Sep 3, 2021