Add linearizability checker for coordination layer #36943

ywelsch · 2018-12-21T17:08:55Z

Checks that the core coordination algorithm implemented as part of Zen2 (#32006) supports linearizable semantics. This PR adds a linearizability checker based on the Wing and Gong graph search algorithm with support for compositional checking and activates these checks for all CoordinatorTests.

elasticmachine · 2018-12-21T17:08:57Z

Pinging @elastic/es-distributed

…ecker

henningandersen

LGTM.

I added a few comments which are primarily discussion subjects.

server/src/test/java/org/elasticsearch/cluster/coordination/LinearizabilityCheckerTests.java

henningandersen · 2019-02-05T20:13:24Z

server/src/test/java/org/elasticsearch/cluster/coordination/CoordinatorTests.java

+                    @Override
+                    public void onFailure(String source, Exception e) {
+                        // do not remove event from history, the write might still take place
+                        // instead, complete history when checking for linearizability


There is something concerning about this. Towards the linearizability checker, we are really saying that the write that came in may complete any time between the call and the next complete cluster shutdown.

Looking at how onFailure is handled, it varies from ignore to logging an error to propagating out the error. This might be OK, if carefully considered in each case and understanding that the action taken may still complete "later".

In reality, the system probably relies on a failure meaning that the action may have been written at the time of failure, but will not be written much later than that. A user may check if the write was done and repeat if not. This could lead to doing things twice, depending on key generation. But since the write will likely not succeed (since a user/operator interaction takes seconds), this is unlikely to happen.

I think the main actionable item I have is to better document what onFailure means on ClusterStateTaskListener, otherwise this is mainly a discussion topic.

The failures that can occur might not only come from the system (e.g. publication layer), but also from the user-defined cluster state update function. Task batching can make this even more complicated. Also worth pointing out is that many master-level actions (i.e. subclasses of TransportMasterNodeAction) already have a retry mechanism built-in that reacts to publication-level failures.

henningandersen · 2019-02-06T09:36:49Z

server/src/test/java/org/elasticsearch/cluster/coordination/CoordinatorTests.java

@@ -1093,13 +1098,22 @@ void runRandomly() {
                }

                try {
-                    if (rarely()) {
+                    if (randomBoolean() && randomBoolean() && randomBoolean()) {


Running all tests in the class, 75-80% of the linearizability checks are against histories where all invocations come before all responses (form 1). Around half of the remaining checks are against histories with form (invocation*, response*, invocation*, response*), ie. two rounds of invocation/response (form 2). Only checking histories larger than size 100, the numbers are 63-65% of form 1 and 77- 80% of form 2. Larger than size 1000, 1 out of 2 histories were of form 1.

I believe form 1 will only fail if responses contains values that are not part of one of the invocations, but otherwise the linearizability checks will not detect any problems.

Perhaps we need to tweak the randomness to better provoke situations that could reveal linearizability problems?

Thank you for the investigation. Tweaking the randomness here is not too straightforward, and has a more general impact on how these Coordinator tests run. I will leave this to a follow-up PR, as I don't want to any longer delay merging the infrastructure in this PR so that it is available to other follow-up work.

…ecker

Checks that the core coordination algorithm implemented as part of Zen2 (#32006) supports linearizable semantics. This commit adds a linearizability checker based on the Wing and Gong graph search algorithm with support for compositional checking and activates these checks for all CoordinatorTests.

Today the default stabilisation time is calculated on the assumption that the elected master has no pending tasks to process when it is elected, but this is not a safe assumption to make. This can result in a cluster reaching the end of its stabilisation time without having stabilised. Furthermore in elastic#36943 we increased the probability that each step in `runRandomly()` enqueues another task, vastly increasing the chance that we hit such a situation. This change extends the stabilisation process to allow time for all pending tasks, plus a task that might currently be in flight. Fixes elastic#41967, in which the master entered the stabilisation phase with over 800 tasks to process.

Today the default stabilisation time is calculated on the assumption that the elected master has no pending tasks to process when it is elected, but this is not a safe assumption to make. This can result in a cluster reaching the end of its stabilisation time without having stabilised. Furthermore in #36943 we increased the probability that each step in `runRandomly()` enqueues another task, vastly increasing the chance that we hit such a situation. This change extends the stabilisation process to allow time for all pending tasks, plus a task that might currently be in flight. Fixes #41967, in which the master entered the stabilisation phase with over 800 tasks to process.

Today the default stabilisation time is calculated on the assumption that the elected master has no pending tasks to process when it is elected, but this is not a safe assumption to make. This can result in a cluster reaching the end of its stabilisation time without having stabilised. Furthermore in elastic#36943 we increased the probability that each step in `runRandomly()` enqueues another task, vastly increasing the chance that we hit such a situation. This change extends the stabilisation process to allow time for all pending tasks, plus a task that might currently be in flight. Fixes elastic#41967, in which the master entered the stabilisation phase with over 800 tasks to process.

Today the default stabilisation time is calculated on the assumption that the elected master has no pending tasks to process when it is elected, but this is not a safe assumption to make. This can result in a cluster reaching the end of its stabilisation time without having stabilised. Furthermore in #36943 we increased the probability that each step in `runRandomly()` enqueues another task, vastly increasing the chance that we hit such a situation. This change extends the stabilisation process to allow time for all pending tasks, plus a task that might currently be in flight. Fixes #41967, in which the master entered the stabilisation phase with over 800 tasks to process.

ywelsch added 4 commits December 21, 2018 14:49

Add linearizability checker

8ec6845

add keyed partitioning scheme

89b6ed9

check jepsen histories

07dae83

Add KeyedSpec tests and remove external tests

f91d81c

ywelsch added >test Issues or PRs that are addressing/adding tests v7.0.0 :Distributed Coordination/Cluster Coordination Cluster formation and cluster state publication, including cluster membership and fault detection. labels Dec 21, 2018

ywelsch requested a review from DaveCTurner December 21, 2018 17:08

ywelsch added 3 commits December 24, 2018 10:20

Merge remote-tracking branch 'elastic/master' into linearizability-ch…

22a963c

…ecker

Merge remote-tracking branch 'elastic/master' into linearizability-ch…

23db90d

…ecker

fix javadoc

46635cf

ywelsch requested a review from henningandersen February 1, 2019 14:13

henningandersen approved these changes Feb 6, 2019

View reviewed changes

jasontedor added v8.0.0 and removed v7.0.0 labels Feb 6, 2019

henningandersen mentioned this pull request Feb 7, 2019

SeqNo CAS linearizability #38561

Merged

ywelsch added 4 commits February 13, 2019 13:29

Merge remote-tracking branch 'elastic/master' into linearizability-ch…

c15ff78

…ecker

Add more tests

2423c2a

Merge remote-tracking branch 'elastic/master' into linearizability-ch…

455f7c1

…ecker

only run when not finishing

c5e33b1

ywelsch merged commit 8581983 into elastic:master Feb 26, 2019

ywelsch added v7.0.0 v7.2.0 labels Feb 26, 2019

ywelsch mentioned this pull request Feb 27, 2019

CoordinatorTests.testDiscoveryUsesNodesFromLastClusterState fails on CI #39437

Closed

michaelbaamonde added v7.0.0-rc1 and removed v7.0.0 labels Mar 25, 2019

DaveCTurner mentioned this pull request May 24, 2019

Drain master task queue when stabilising #42504

Merged

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add linearizability checker for coordination layer #36943

Add linearizability checker for coordination layer #36943

ywelsch commented Dec 21, 2018

elasticmachine commented Dec 21, 2018

henningandersen left a comment

henningandersen Feb 5, 2019

ywelsch Feb 25, 2019

henningandersen Feb 6, 2019

ywelsch Feb 25, 2019

Add linearizability checker for coordination layer #36943

Add linearizability checker for coordination layer #36943

Conversation

ywelsch commented Dec 21, 2018

elasticmachine commented Dec 21, 2018

henningandersen left a comment

Choose a reason for hiding this comment

henningandersen Feb 5, 2019

Choose a reason for hiding this comment

ywelsch Feb 25, 2019

Choose a reason for hiding this comment

henningandersen Feb 6, 2019

Choose a reason for hiding this comment

ywelsch Feb 25, 2019

Choose a reason for hiding this comment