kvserver: narrow down 'finishing a proposal with outstanding reproposal' #97564

tbg · 2023-02-23T12:04:07Z

In #94633, I introduced¹ an assertion that attempted to catch cases
in which we might otherwise accidentally end up applying a proposal
twice.

This assertion had a false positive.

I was able to reproduce the failure within ~minutes via
./experiment.sh in #97173 as of 33dcdef.

Better testing of these cases would be desirable. Unfortunately, while
there is an abstraction over command application (apply.Task), most
of the logic worth testing lives in (*replicaAppBatch) which is
essentially a *Replica with more moving parts attached. This does
not lend itself well to unit testing.

I had a run²³⁴ earlier this year to make log application
standalone, but then didn't have enough time to follow through.
It would be desirable to do so at a later date, perhaps with
the explicit goals of having interactions like the one discussion
in this PR unit become testable.

This assertion was previously trying to assert too much at a distance
and was not only incorrect, but additionally inscrutable.

It was mixing up two assertions, the first one of which is sensible:
If an entry is accepted, it must not be superseded by inflight proposal.
If this were violated, this superseded proposal could also apply,
resulting in a failure of replay protection. This assertion is now
still around as a stand-alone assertion.

The other half of the assertion was more confused: if an entry is
rejected, it was claiming that it couldn't also be superseded. The
thinking was that if a superseding log entry exists, maybe it could
apply, and that would be bad since we just told the waiting client
that their proposal got rejected.

This reasoning is incorrect, as the following example shows. Consider
the following initial situation:

[lease seq is 1]
log idx 99:  unrelated cmd at LAI 10000, lease seq = 1
log idx 100: cmd X at LAI 10000, lease seq = 1

And next:

a new lease enters the log at idx 101 (lease seq = 2)
an identical copy of idx 100 enters the log at idx 102
we apply idx 100, leading to superseding reproposal at idx 103

resulting in the log:

[lease seq is 1]
log idx 99: unrelated cmd at LAI 10000, lease seq = 1
log idx 100: cmd X at LAI 10000, lease seq = 1
log idx 101: lease seq = 2
log idx 102: (same as idx 100)
log idx 103: cmd X at LAI = 20000, lease seq = 1

During application of idx 102, we get a permanent rejection and yet
the entry is superseded (by the proposal at idx 103). This would
erroneously trigger the assertion, even though this is a legal sequence
of events with no detrimental outcomes: the superseding proposal will
always have the same lease sequence as its superseded copies, so it
will also fail.

I initially tried only soften the assertion a little bit. Observing
that the example above led to a permanent rejection, should we only
require that a proposal (which in this assertion is always local) is not
superseded if it got rejected due to its lease index (which implies that
it passed the lease check)? It turns out that this is primarily an
assertion on when superseded proposals are counted as "local" at this
point in the code: if there were multiple copies of this rejected
proposal in the current appTask (i.e. the current CommittedEntries
slice handed to us for application by raft), then all copies are
initially local; and a copy that successfully spawns a superseding
proposal would be made non-local from that point on. On the face
of it, All other copies in the same appTask would now hit the
assertion (erroneously): they are local, they are rejected, so
why don't they enter the branch? The magic ingredient is that
if an entry is superseded when we handle the lease index rejection,
we also unlink the proposal from it. So these never enter this
path since it's not local at this point.

For example, if these are the log entries to apply (all at valid lease
seq):

log idx 99: unrelated cmd at LAI 10000
log idx 100: cmd X at LAI 10000
log idx 101: (identical copy of idx 100)

and idxs 99-101 are applied in one batch, then idx 100 would spawn
a reproposal at a new lease applied index:

log idx 99: unrelated cmd at LAI 10000
log idx 100: cmd X at LAI 10000             <- applied
log idx 101: (identical copy of idx 100)
log idx 100: cmd X at LAI 20000             <- not in current batch

When we apply 101, we observe an illegal lease index, but the proposal
supersedes the entry, so we mark it as non-local and don't enter the
branch that contains the assertion.

The above reasoning is very difficult to understand, and it happens
too far removed from where the interesting state changes happen.

Also, for testing purposes it is interesting to introduce "errors"
in the lease applied index assignment to artificially exercise these
reproposal mechanisms. In doing so, these assertions can trip because
the lease applied index assigned to a reproposal might accidentally
(or intentionally!) match the existing lease applied index, in which
case copies of the command in the same batch now don't consider
themselves superseded.

The value of this testing outweighs the very limited benefit of
this branch of the assertion. An argument could even be made that
this assertion alone as negative benefit due to its complexity.

We are removing it in this commit and will instead work towards
simplifying the mechanisms that played a role in explaining the
asssertion.

Closes #97102.
Closes #97347.
Closes #97447.
Closes #97612.

No release note because unreleased (except perhaps in an alpha).

Epic: none
Release note: None

blathers-crl · 2023-02-23T12:04:12Z

It looks like your PR touches production code but doesn't add or edit any test code. Did you consider adding tests to your PR?

_{🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf.}

cockroach-teamcity · 2023-02-23T12:04:20Z

This change is

pkg/kv/kvserver/replica_application_result.go

pav-kv

Is this basically a rollback to pre-#94633 state w.r.t. "finishing a proposal with outstanding reproposal" assert? (+refactoring)

pkg/kv/kvserver/replica_application_result.go

pkg/kv/kvserver/replica_application_state_machine.go

pkg/kv/kvserver/replica_application_result.go

This exposes the madness of this control flow for what it is. There is no behavior change here. We were previously unconditionally overriding `pErr`, potentially with a `nil`, which seems sketchy but we get away with it and now we document this explicitly. Epic: none Release note: None

`rabbitHole.deepen()` Epic: none Release note: None

In cockroachdb#94633, I introduced[^1] an assertion that attempted to catch cases in which we might otherwise accidentally end up applying a proposal twice. This assertion had a false positive. I was able to reproduce the failure within ~minutes via `./experiment.sh` in cockroachdb#97173 as of 33dcdef. Better testing of these cases would be desirable. Unfortunately, while there is an abstraction over command application (`apply.Task`), most of the logic worth testing lives in `(*replicaAppBatch)` which is essentially a `*Replica` with more moving parts attached. This does not lend itself well to unit testing. I had a run[^2][^3][^4] earlier this year to make log application standalone, but then didn't have enough time to follow through. It would be desirable to do so at a later date, perhaps with the explicit goals of having interactions like the one discussion in this PR unit become testable. [^4]: cockroachdb#93309 [^3]: cockroachdb#93266 [^2]: cockroachdb#93239 [^1]: https://github.com/cockroachdb/cockroach/pull/94633/files#diff-50e458584d176deae52b20a7c04461b3e4110795c8c9a307cf7ee6696ba6bc60R238 This assertion was previously trying to assert too much at a distance and was not only incorrect, but additionally inscrutable. It was mixing up two assertions, the first one of which is sensible: If an entry is accepted, it must not be superseded by inflight proposal. If this were violated, this superseded proposal could also apply, resulting in a failure of replay protection. This assertion is now still around as a stand-alone assertion. The other half of the assertion was more confused: if an entry is rejected, it was claiming that it couldn't also be superseded. The thinking was that if a superseding log entry exists, maybe it could apply, and that would be bad since we just told the waiting client that their proposal got rejected. This reasoning is incorrect, as the following example shows. Consider the following initial situation: [lease seq is 1] log idx 99: unrelated cmd at LAI 10000, lease seq = 1 log idx 100: cmd X at LAI 10000, lease seq = 1 And next: - a new lease enters the log at idx 101 (lease seq = 2) - an identical copy of idx 100 enters the log at idx 102 - we apply idx 100, leading to superseding reproposal at idx 103 resulting in the log: [lease seq is 1] log idx 99: unrelated cmd at LAI 10000, lease seq = 1 log idx 100: cmd X at LAI 10000, lease seq = 1 log idx 101: lease seq = 2 log idx 102: (same as idx 100) log idx 103: cmd X at LAI = 20000, lease seq = 1 During application of idx 102, we get a *permanent* rejection and yet the entry is superseded (by the proposal at idx 103). This would erroneously trigger the assertion, even though this is a legal sequence of events with no detrimental outcomes: the superseding proposal will always have the same lease sequence as its superseded copies, so it will also fail. I initially tried only soften the assertion a *little bit*. Observing that the example above led to a *permanent* rejection, should we only require that a proposal (which in this assertion is always local) is not superseded if it got rejected due to its lease index (which implies that it passed the lease check)? It turns out that this is primarily an assertion on when superseded proposals are counted as "local" at this point in the code: if there were multiple copies of this rejected proposal in the current `appTask` (i.e. the current `CommittedEntries` slice handed to us for application by raft), then all copies are initially local; and a copy that successfully spawns a superseding proposal would be made non-local from that point on. On the face of it, All other copies in the same `appTask` would now hit the assertion (erroneously): they are local, they are rejected, so why don't they enter the branch? The magic ingredient is that if an entry is superseded when we handle the lease index rejection, we also unlink the proposal from it. So these never enter this path since it's not local at this point. For example, if these are the log entries to apply (all at valid lease seq): log idx 99: unrelated cmd at LAI 10000 log idx 100: cmd X at LAI 10000 log idx 101: (identical copy of idx 100) and idxs 99-101 are applied in one batch, then idx 100 would spawn a reproposal at a new lease applied index: log idx 99: unrelated cmd at LAI 10000 log idx 100: cmd X at LAI 10000 <- applied log idx 101: (identical copy of idx 100) log idx 100: cmd X at LAI 20000 <- not in current batch When we apply 101, we observe an illegal lease index, but the proposal supersedes the entry, so we mark it as non-local and don't enter the branch that contains the assertion. The above reasoning is very difficult to understand, and it happens too far removed from where the interesting state changes happen. Also, for testing purposes it is interesting to introduce "errors" in the lease applied index assignment to artificially exercise these reproposal mechanisms. In doing so, these assertions can trip because the lease applied index assigned to a reproposal might accidentally (or intentionally!) match the existing lease applied index, in which case copies of the command in the same batch now *don't* consider themselves superseded. The value of this testing outweighs the very limited benefit of this branch of the assertion. An argument could even be made that this assertion alone as negative benefit due to its complexity. We are removing it in this commit and will instead work towards simplifying the mechanisms that played a role in explaining the asssertion. Closes cockroachdb#94633. Closes cockroachdb#97347. No release note because unreleased (except perhaps in an alpha). Epic: none Release note: None

tbg · 2023-02-24T20:07:54Z

TFTR!

Is this basically a rollback

I checked and yes, almost. There is this part that hasn't been brought back: #97564 (comment)

tbg · 2023-02-26T21:50:51Z

bors r=pavelkalinnikov

craig · 2023-02-26T22:22:06Z

Build succeeded:

Bazel Essential CI (Cockroach)

…reproposal'" Reverts cockroachdb#97564. This reverts commit 554a5d5.

…ndex" Reverts cockroachdb#97564. This reverts commit 6283099.

…ndex" Reverts cockroachdb#97564. This reverts commit 548b2f3.

98481: kvserver: revert recent changes to reproposals r=pavelkalinnikov a=tbg Reverts #97606, #97564, #94825, #94633. - Revert "kvserver: disable assertion 'finished proposal inserted'" - Revert "kvserver: narrow down 'finishing a proposal with outstanding reproposal'" - Revert "kvserver: fill gaps in comment near tryReproposeWithNewLeaseIndex" - Revert "kvserver: hoist early return out of tryReproposeWithNewLeaseIndex" - Revert "fixup! kvserver: prevent finished proposal from being present in proposals map" - Revert "kvserver: prevent finished proposal from being present in proposals map" - Revert "kvserver: improve reproposal assertions and documentation" Closes #97973. Epic: CRDB-25287 Release Note: none 98537: sql: check row level ttl change before truncating a table r=chengxiong-ruan a=chengxiong-ruan Fixes: #93443 Release note (sql change): This commit fixed a bug where crdb paniced wehn user tried to truncate a table which is has an ongoing row level ttl change. We still don't support table truncates in this scenario, but a more gentle unimplemented error is returned instead of panic. 98575: cdc: use int64 for emitted bytes telemetry r=miretskiy a=jayshrivastava Previously, the stored `emitted_bytes` field was an int32, which can hold a maximum value of 2.1GB. This value is too small because the logging period is 24h and changefeeds can emit much more than 2.1GB in 24h. This change updates the field to be an int64, which solves this problem. Epic: None Release note: None 98582: ci: allow-list `BUILD_VCS_NUMBER` env var in cloud unit tests r=jlinder a=rickystewart This job was filing issues linking to the wrong commit. Epic: none Release note: None Co-authored-by: Tobias Grieger <[email protected]> Co-authored-by: Chengxiong Ruan <[email protected]> Co-authored-by: Jayant Shrivastava <[email protected]> Co-authored-by: Ricky Stewart <[email protected]>

tbg commented Feb 23, 2023

View reviewed changes

pkg/kv/kvserver/replica_application_result.go Show resolved Hide resolved

tbg commented Feb 23, 2023

View reviewed changes

pkg/kv/kvserver/replica_application_result.go Show resolved Hide resolved

tbg force-pushed the revamp-and-explain-assertions branch from d9e8d39 to f1fa9d5 Compare February 23, 2023 12:43

tbg marked this pull request as ready for review February 23, 2023 12:44

tbg requested a review from a team February 23, 2023 12:44

tbg mentioned this pull request Feb 23, 2023

kvserver: fix false positive in 'outstanding reproposal' assertion #97347

Closed

pav-kv approved these changes Feb 24, 2023

View reviewed changes

This was referenced Feb 24, 2023

roachtest: replicate/up/1to3 failed #97447

Closed

roachtest: clearrange/checks=false/rangeTs=false failed #97612

Closed

tbg added 3 commits February 24, 2023 21:06

kvserver: fill gaps in comment near tryReproposeWithNewLeaseIndex

6283099

`rabbitHole.deepen()` Epic: none Release note: None

tbg force-pushed the revamp-and-explain-assertions branch from f1fa9d5 to 554a5d5 Compare February 24, 2023 20:08

craig bot merged commit 13c58f6 into cockroachdb:master Feb 26, 2023

tbg deleted the revamp-and-explain-assertions branch February 27, 2023 08:46

tbg mentioned this pull request Mar 3, 2023

kvserver: revert recent reproposal changes #97973

Closed

5 tasks

tbg added a commit to tbg/cockroach that referenced this pull request Mar 13, 2023

Revert "kvserver: narrow down 'finishing a proposal with outstanding …

70894d2

…reproposal'" Reverts cockroachdb#97564. This reverts commit 554a5d5.

tbg added a commit to tbg/cockroach that referenced this pull request Mar 13, 2023

Revert "kvserver: fill gaps in comment near tryReproposeWithNewLeaseI…

fcc8742

…ndex" Reverts cockroachdb#97564. This reverts commit 6283099.

tbg added a commit to tbg/cockroach that referenced this pull request Mar 13, 2023

Revert "kvserver: hoist early return out of tryReproposeWithNewLeaseI…

9a44384

…ndex" Reverts cockroachdb#97564. This reverts commit 548b2f3.

tbg mentioned this pull request Mar 13, 2023

kvserver: revert recent changes to reproposals #98481

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kvserver: narrow down 'finishing a proposal with outstanding reproposal' #97564

kvserver: narrow down 'finishing a proposal with outstanding reproposal' #97564

tbg commented Feb 23, 2023 •

edited by pav-kv

Loading

blathers-crl bot commented Feb 23, 2023

cockroach-teamcity commented Feb 23, 2023

pav-kv left a comment •

edited

Loading

tbg commented Feb 24, 2023

tbg commented Feb 26, 2023

craig bot commented Feb 26, 2023

kvserver: narrow down 'finishing a proposal with outstanding reproposal' #97564

kvserver: narrow down 'finishing a proposal with outstanding reproposal' #97564

Conversation

tbg commented Feb 23, 2023 • edited by pav-kv Loading

Footnotes

blathers-crl bot commented Feb 23, 2023

cockroach-teamcity commented Feb 23, 2023

pav-kv left a comment • edited Loading

Choose a reason for hiding this comment

tbg commented Feb 24, 2023

tbg commented Feb 26, 2023

craig bot commented Feb 26, 2023

tbg commented Feb 23, 2023 •

edited by pav-kv

Loading

pav-kv left a comment •

edited

Loading