sql: fix race condition in internal executor #63010

ajwerner · 2021-04-02T02:51:58Z

The async and sync implementations were too close to justify two structs.
Also, the async behavior of not stopping the writer in case the reader
called close wasn't desireable. This commit unifies the implementation.
It also ensures that we propagate context errors in all cases triggered
by the closure of the done channel. It also makes closing the channel
idempotent.

Additionally, this commit transitions the execution flow into draining
state without setting our custom error on the resultWriter.

Fixes #62948.

Release note: None

cockroach-teamcity · 2021-04-02T02:52:05Z

This change is

yuzefovich

Reviewed 1 of 1 files at r1.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @ajwerner)

pkg/sql/internal_result_channel.go, line 105 at r1 (raw file):

	var firstErr error
	for {
		res, done, err := c.nextResult(context.TODO())

I started thinking that this context.TODO might also be insufficient for correctness purposes. WDYT?

I wonder if I should change the signature to taking in ctx for close and rowsIterator.Close in a separate PR.

pkg/sql/internal_result_channel.go, line 184 at r1 (raw file):

	case res, ok := <-i.dataCh:
		if !ok {
			return ieIteratorResult{}, true, nil

What about this place?

I'm starting to think that possibly removing doneCh and using context.CancelFunc might be less error-prone. Thoughts?

ajwerner

See how this change makes you feel.

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @yuzefovich)

pkg/sql/internal_result_channel.go, line 105 at r1 (raw file):

Previously, yuzefovich (Yahor Yuzefovich) wrote…

I started thinking that this context.TODO might also be insufficient for correctness purposes. WDYT?

I wonder if I should change the signature to taking in ctx for close and rowsIterator.Close in a separate PR.

We could. It'd be better but I'm not sure it's needed. I'm thinking that draining this all the way isn't great. I'm thinking that maybe the differences between the sync and async impls here are not justified.

pkg/sql/internal_result_channel.go, line 184 at r1 (raw file):

Previously, yuzefovich (Yahor Yuzefovich) wrote…

What about this place?

I'm starting to think that possibly removing doneCh and using context.CancelFunc might be less error-prone. Thoughts?

That wouldn't have saved us from this one. What would have saved us from this one is never closing the dataCh and having finish do something different. The other thing I've done to save us is to deduplicate the code.

yuzefovich

I like the unification, thanks!

Reviewed 2 of 2 files at r2.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @ajwerner)

pkg/sql/internal_result_channel.go, line 105 at r1 (raw file):

Previously, ajwerner wrote…

We could. It'd be better but I'm not sure it's needed. I'm thinking that draining this all the way isn't great. I'm thinking that maybe the differences between the sync and async impls here are not justified.

I think that draining is still needed for correctness. In the query execution we need to allow for all metadata (e.g. LeafTxnFinalState) to be propagated throughout the flow to the gateway.

What we could improve is: instead of "draining" artificially once we have received enough rows - by letting the query run to completion - we would actually be draining properly - by making DistSQLReceiver change its status to DrainRequested. A possible way to do that is checking whether our custom errIEResultChannelClosed is returned from AddRow with something like this:

diff --git a/pkg/sql/distsql_running.go b/pkg/sql/distsql_running.go
index 5c2d8d5a3c..dd4c666904 100644
--- a/pkg/sql/distsql_running.go
+++ b/pkg/sql/distsql_running.go
@@ -724,32 +724,38 @@ func (r *DistSQLReceiver) Push(
        r.tracing.TraceExecRowsResult(r.ctx, r.row)
        // Note that AddRow accounts for the memory used by the Datums.
        if commErr := r.resultWriter.AddRow(r.ctx, r.row); commErr != nil {
-               // ErrLimitedResultClosed is not a real error, it is a
-               // signal to stop distsql and return success to the client.
-               if !errors.Is(commErr, ErrLimitedResultClosed) {
-                       // Set the error on the resultWriter too, for the convenience of some of the
-                       // clients. If clients don't care to differentiate between communication
-                       // errors and query execution errors, they can simply inspect
-                       // resultWriter.Err(). Also, this function itself doesn't care about the
-                       // distinction and just uses resultWriter.Err() to see if we're still
-                       // accepting results.
-                       r.resultWriter.SetError(commErr)
-
-                       // We don't need to shut down the connection
-                       // if there's a portal-related error. This is
-                       // definitely a layering violation, but is part
-                       // of some accepted technical debt (see comments on
-                       // sql/pgwire.limitedCommandResult.moreResultsNeeded).
-                       // Instead of changing the signature of AddRow, we have
-                       // a sentinel error that is handled specially here.
-                       if !errors.Is(commErr, ErrLimitedResultNotSupported) {
-                               r.commErr = commErr
+               if errors.Is(commErr, errIEResultChannelClosed) {
+                       r.status = execinfra.DrainRequested
+               } else {
+                       // ErrLimitedResultClosed is not a real error, it is a
+                       // signal to stop distsql and return success to the client.
+                       if !errors.Is(commErr, ErrLimitedResultClosed) {
+                               // Set the error on the resultWriter too, for the convenience of some of the
+                               // clients. If clients don't care to differentiate between communication
+                               // errors and query execution errors, they can simply inspect
+                               // resultWriter.Err(). Also, this function itself doesn't care about the
+                               // distinction and just uses resultWriter.Err() to see if we're still
+                               // accepting results.
+                               r.resultWriter.SetError(commErr)
+
+                               // We don't need to shut down the connection
+                               // if there's a portal-related error. This is
+                               // definitely a layering violation, but is part
+                               // of some accepted technical debt (see comments on
+                               // sql/pgwire.limitedCommandResult.moreResultsNeeded).
+                               // Instead of changing the signature of AddRow, we have
+                               // a sentinel error that is handled specially here.
+                               if !errors.Is(commErr, ErrLimitedResultNotSupported) {
+                                       r.commErr = commErr
+                               }
                        }
+                       // TODO(andrei): We should drain here. Metadata from this query would be
+                       // useful, particularly as it was likely a large query (since AddRow()
+                       // above failed, presumably with an out-of-memory error).
+                       r.status = execinfra.ConsumerClosed
                }
-               // TODO(andrei): We should drain here. Metadata from this query would be
-               // useful, particularly as it was likely a large query (since AddRow()
-               // above failed, presumably with an out-of-memory error).
-               r.status = execinfra.ConsumerClosed
        }
        return r.status
 }

pkg/sql/internal_result_channel.go, line 93 at r2 (raw file):

	// doneCh is used to indicate that the ReadWriter has been closed.
	// doneCh is closed under the doneOnce. The doneCh is only used for the

nit: doneCh is now used for both variants.

ajwerner

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @ajwerner and @yuzefovich)

pkg/sql/internal_result_channel.go, line 105 at r1 (raw file):

Previously, yuzefovich (Yahor Yuzefovich) wrote…

I think that draining is still needed for correctness. In the query execution we need to allow for all metadata (e.g. LeafTxnFinalState) to be propagated throughout the flow to the gateway.

What we could improve is: instead of "draining" artificially once we have received enough rows - by letting the query run to completion - we would actually be draining properly - by making DistSQLReceiver change its status to DrainRequested. A possible way to do that is checking whether our custom errIEResultChannelClosed is returned from AddRow with something like this:

diff --git a/pkg/sql/distsql_running.go b/pkg/sql/distsql_running.go
index 5c2d8d5a3c..dd4c666904 100644
--- a/pkg/sql/distsql_running.go
+++ b/pkg/sql/distsql_running.go
@@ -724,32 +724,38 @@ func (r *DistSQLReceiver) Push(
        r.tracing.TraceExecRowsResult(r.ctx, r.row)
        // Note that AddRow accounts for the memory used by the Datums.
        if commErr := r.resultWriter.AddRow(r.ctx, r.row); commErr != nil {
-               // ErrLimitedResultClosed is not a real error, it is a
-               // signal to stop distsql and return success to the client.
-               if !errors.Is(commErr, ErrLimitedResultClosed) {
-                       // Set the error on the resultWriter too, for the convenience of some of the
-                       // clients. If clients don't care to differentiate between communication
-                       // errors and query execution errors, they can simply inspect
-                       // resultWriter.Err(). Also, this function itself doesn't care about the
-                       // distinction and just uses resultWriter.Err() to see if we're still
-                       // accepting results.
-                       r.resultWriter.SetError(commErr)
-
-                       // We don't need to shut down the connection
-                       // if there's a portal-related error. This is
-                       // definitely a layering violation, but is part
-                       // of some accepted technical debt (see comments on
-                       // sql/pgwire.limitedCommandResult.moreResultsNeeded).
-                       // Instead of changing the signature of AddRow, we have
-                       // a sentinel error that is handled specially here.
-                       if !errors.Is(commErr, ErrLimitedResultNotSupported) {
-                               r.commErr = commErr
+               if errors.Is(commErr, errIEResultChannelClosed) {
+                       r.status = execinfra.DrainRequested
+               } else {
+                       // ErrLimitedResultClosed is not a real error, it is a
+                       // signal to stop distsql and return success to the client.
+                       if !errors.Is(commErr, ErrLimitedResultClosed) {
+                               // Set the error on the resultWriter too, for the convenience of some of the
+                               // clients. If clients don't care to differentiate between communication
+                               // errors and query execution errors, they can simply inspect
+                               // resultWriter.Err(). Also, this function itself doesn't care about the
+                               // distinction and just uses resultWriter.Err() to see if we're still
+                               // accepting results.
+                               r.resultWriter.SetError(commErr)
+
+                               // We don't need to shut down the connection
+                               // if there's a portal-related error. This is
+                               // definitely a layering violation, but is part
+                               // of some accepted technical debt (see comments on
+                               // sql/pgwire.limitedCommandResult.moreResultsNeeded).
+                               // Instead of changing the signature of AddRow, we have
+                               // a sentinel error that is handled specially here.
+                               if !errors.Is(commErr, ErrLimitedResultNotSupported) {
+                                       r.commErr = commErr
+                               }
                        }
+                       // TODO(andrei): We should drain here. Metadata from this query would be
+                       // useful, particularly as it was likely a large query (since AddRow()
+                       // above failed, presumably with an out-of-memory error).
+                       r.status = execinfra.ConsumerClosed
                }
-               // TODO(andrei): We should drain here. Metadata from this query would be
-               // useful, particularly as it was likely a large query (since AddRow()
-               // above failed, presumably with an out-of-memory error).
-               r.status = execinfra.ConsumerClosed
        }
        return r.status
 }

Hmm, are you implying then that what we have here is wrong? The main thing is that now the code will drain results that have already been sent but it will return an error on the sending side for subsequent additions. I can add code to node close the doneCh in async if that sounds right to you.

yuzefovich

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @ajwerner)

pkg/sql/internal_result_channel.go, line 105 at r1 (raw file):

Previously, ajwerner wrote…

Hmm, are you implying then that what we have here is wrong? The main thing is that now the code will drain results that have already been sent but it will return an error on the sending side for subsequent additions. I can add code to node close the doneCh in async if that sounds right to you.

Yes, I'm implying that. I originally started typing out that thought explicitly, then persuaded myself that we were doing the correct thing, but now I believe we're doing an incorrect thing.

Generally speaking, the queries that run via the internal executor could end up being distributed. If that's the case, we'll be using a leaf txn for the execution, and it is required for correctness that we propagate LeafTxnFinalState metadata. You are correctly pointing out that by returning an error in addResult, we will shut down the flow immediately, without properly draining the metadata.

I think before this PR we were doing the right thing in the async case but the wrong thing in the sync case, and with the current change we'll be doing the wrong thing in both cases. I now believe that something like the diff above is needed for correctness.

yuzefovich

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @ajwerner)

pkg/sql/internal_result_channel.go, line 105 at r1 (raw file):

Previously, yuzefovich (Yahor Yuzefovich) wrote…

Yes, I'm implying that. I originally started typing out that thought explicitly, then persuaded myself that we were doing the correct thing, but now I believe we're doing an incorrect thing.

Generally speaking, the queries that run via the internal executor could end up being distributed. If that's the case, we'll be using a leaf txn for the execution, and it is required for correctness that we propagate LeafTxnFinalState metadata. You are correctly pointing out that by returning an error in addResult, we will shut down the flow immediately, without properly draining the metadata.

I think before this PR we were doing the right thing in the async case but the wrong thing in the sync case, and with the current change we'll be doing the wrong thing in both cases. I now believe that something like the diff above is needed for correctness.

My understanding is that if there is an error on the context, that indicates that the consumer gave up on the query execution (kinda like ROLLBACK), so it is ok to perform the "hard" shutdown using ConsumerClosed status without draining the flow, but if we're returning errIEResultChannelClosed error to signal to the producer that we don't need any more rows, we must transition into DrainRequested state.

ajwerner

Can you take that over the finish line?

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @ajwerner)

yuzefovich

Of course! I started thinking about this more, and I think we have a similar problem in the non-internal executor path too (#63032).

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @ajwerner)

ajwerner · 2021-04-02T23:08:55Z

This seems better than what's on master. Maybe we should merge it to deflake tests?

yuzefovich · 2021-04-02T23:16:26Z

I agree that it's an improvement. I opened up a separate PR #63032 for that - which actually uncovered an issue with shutting down the changefeed processors. I think I'd rather skip the flaky test temporarily, take the time with #63032 (it is missing a test, and I don't see a good way to write it), and then merge this fix.

The async and sync implementations were too close to justify two structs. Also, the async behavior of not stopping the writer in case the reader called close wasn't desireable. This commit unifies the implementation. It also ensures that we propagate context errors in all cases triggered by the closure of the done channel. It also makes closing the channel idempotent. Additionally, this commit transitions the execution flow into draining state without setting our customer error on the resultWriter. Release note: None

yuzefovich · 2021-04-06T20:28:32Z

Alright, #63032 landed, so I rebased on top of it and unskipped the flaky test. I'll merge once CI is green (assuming I have an implicit approval from Andrew).

yuzefovich · 2021-04-06T21:16:54Z

Alright, the build is green, thanks Andrew for working with me on this PR!

bors r+

craig · 2021-04-06T21:49:39Z

Build failed (retrying...):

GitHub CI (Cockroach)

craig · 2021-04-06T23:05:15Z

Build failed (retrying...):

GitHub CI (Cockroach)

craig · 2021-04-07T01:26:13Z

Build succeeded:

GitHub CI (Cockroach)

ajwerner requested a review from yuzefovich April 2, 2021 02:51

ajwerner mentioned this pull request Apr 2, 2021

sql: return an error when looking up hashed password returned no rows #62999

Closed

yuzefovich reviewed Apr 2, 2021

View reviewed changes

ajwerner force-pushed the ajwerner/fix-internal-executor-bug branch from 010591f to 2d3c4aa Compare April 2, 2021 03:36

ajwerner commented Apr 2, 2021

View reviewed changes

yuzefovich approved these changes Apr 2, 2021

View reviewed changes

ajwerner commented Apr 2, 2021

View reviewed changes

yuzefovich requested changes Apr 2, 2021

View reviewed changes

yuzefovich reviewed Apr 2, 2021

View reviewed changes

ajwerner commented Apr 2, 2021

View reviewed changes

yuzefovich reviewed Apr 2, 2021

View reviewed changes

yuzefovich force-pushed the ajwerner/fix-internal-executor-bug branch from 2d3c4aa to b606d7d Compare April 2, 2021 18:45

yuzefovich added the do-not-merge bors won't merge a PR with this label. label Apr 2, 2021

yuzefovich force-pushed the ajwerner/fix-internal-executor-bug branch from b606d7d to d8f85e2 Compare April 6, 2021 20:26

yuzefovich removed the do-not-merge bors won't merge a PR with this label. label Apr 6, 2021

craig bot merged commit 651184b into cockroachdb:master Apr 7, 2021

yuzefovich mentioned this pull request Apr 9, 2021

release-21.1: sql: properly synchronize the internal executor iterator #62923

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: fix race condition in internal executor #63010

sql: fix race condition in internal executor #63010

ajwerner commented Apr 2, 2021 •

edited by yuzefovich

Loading

cockroach-teamcity commented Apr 2, 2021

yuzefovich left a comment

ajwerner left a comment

yuzefovich left a comment

ajwerner left a comment

yuzefovich left a comment

yuzefovich left a comment

ajwerner left a comment

yuzefovich left a comment

ajwerner commented Apr 2, 2021

yuzefovich commented Apr 2, 2021

yuzefovich commented Apr 6, 2021

yuzefovich commented Apr 6, 2021

craig bot commented Apr 6, 2021

craig bot commented Apr 6, 2021

craig bot commented Apr 7, 2021

sql: fix race condition in internal executor #63010

sql: fix race condition in internal executor #63010

Conversation

ajwerner commented Apr 2, 2021 • edited by yuzefovich Loading

cockroach-teamcity commented Apr 2, 2021

yuzefovich left a comment

Choose a reason for hiding this comment

ajwerner left a comment

Choose a reason for hiding this comment

yuzefovich left a comment

Choose a reason for hiding this comment

ajwerner left a comment

Choose a reason for hiding this comment

yuzefovich left a comment

Choose a reason for hiding this comment

yuzefovich left a comment

Choose a reason for hiding this comment

ajwerner left a comment

Choose a reason for hiding this comment

yuzefovich left a comment

Choose a reason for hiding this comment

ajwerner commented Apr 2, 2021

yuzefovich commented Apr 2, 2021

yuzefovich commented Apr 6, 2021

yuzefovich commented Apr 6, 2021

craig bot commented Apr 6, 2021

craig bot commented Apr 6, 2021

craig bot commented Apr 7, 2021

ajwerner commented Apr 2, 2021 •

edited by yuzefovich

Loading