colexec: fix spilling queue #58013

yuzefovich · 2020-12-17T02:38:06Z

This commit refactors enqueue method of the spilling queue to
deep-copy the passed-in batches if they are kept in memory. Previous
behavior was suboptimal because it was forcing the caller to always
allocate a new batch. Additionally, the spilling queue will now perform
a coalescing step by attempting to append as many tuples to the tail
in-memory batch as possible. The in-memory batches are allocated with
dynamically increasing capacity.

This allows us to significantly simplify the code of the router outputs
which were performing the coalescing step previously.

Additionally, this commit fixes a couple of uses of enqueue method
(the router outputs and the merge joiner) in which they forgot to
enqueue a zero-length batch which is necessary when the disk queue is
initialized.

Fixes: #47062.

Release note: None

cockroach-teamcity · 2020-12-17T02:38:13Z

This change is

asubiotto

Nice to see this 😂 For reference, this is what I did a while ago but never ended up finishing: https://github.com/asubiotto/cockroach/commit/450cecbf71b0004779054f48f8c718b5b01eb393.

Are there any router benchmarks to check out?

Reviewed 6 of 10 files at r1.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @asubiotto and @yuzefovich)

pkg/col/coldata/testutils.go, line 63 at r1 (raw file):

		// 'maybeHasNulls' field, so we override it manually to be 'true' for
		// both nulls vectors if it is 'true' for at least one of them. This is
		// acceptable since we still check the bitmaps precisely.

Why do we have to do this? Won't the require.Equal check fail in the previous version of this code if maybeHasNulls is different?

pkg/sql/colexec/routers.go, line 115 at r1 (raw file):

		// zeroBatchAdded indicates whether zero-length batch was added to this
		// output (meaning that no more batches will be added).
		zeroBatchAdded bool

Instead of adding a boolean, I think it's preferable to add this as a state or use one of the states to indicate this.

pkg/sql/colexec/routers.go, line 188 at r1 (raw file):

	// alwaysFlush, if set to true, will always flush o.mu.pendingBatch to
	// o.mu.data.
	alwaysFlush bool

Do we need to add this to the spilling queue? I think it gives us good testing coverage of what happens when we spill in the routers.

pkg/sql/colexec/routers.go, line 319 at r1 (raw file):

//  writing rows to a fast output if we have to write to disk for a single
//  slow output.
func (o *routerOutputOp) addBatch(ctx context.Context, batch coldata.Batch) bool {

Why is the selection argument being removed? It means we have to do extra copies to set the desired selection vector on a batch. I think the way I handled this is to add an optional selection argument to the spilling queue, which will use the batch's selection vector if the argument is unset. I prefer that, but I'm also biased, so let's discuss

pkg/sql/colexec/spilling_queue.go, line 180 at r1 (raw file):

			q.unlimitedAllocator.PerformOperation(q.diskQueueDeselectionScratch.ColVecs(), func() {
				for i := range q.typs {
					q.diskQueueDeselectionScratch.ColVec(i).Copy(

This looks like when we only have in-memory items, we will deselect into a scratch batch, then additionally append this batch into a tailBatch. We should merge these two steps into one, i.e. deselect and append at the same time.

yuzefovich

The benchmarks are very good (this is somewhat surprising to me, tbh).

I also prototyped keeping a separate selection vector as you suggested below, and the comparison shows pretty small improvement when avoiding the copying of the selection onto the selection vector of the batch. To me this minor perf improvement doesn't seem worth the confusion of having separate selection things.

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @asubiotto)

pkg/col/coldata/testutils.go, line 63 at r1 (raw file):

Previously, asubiotto (Alfonso Subiotto Marqués) wrote…

Why do we have to do this? Won't the require.Equal check fail in the previous version of this code if maybeHasNulls is different?

Without this override, previously it would fail when it shouldn't. Consider two nulls vectors without any nulls set (i.e. all elements are valid), the bitmaps are bunch of 0xFF, the only difference between two vectors is that maybeHasNulls is set to different values - such vectors should be treated as equal by this check, but the check would fail. This came up because of the way we track all input tuples in TestSpillingQueue - we create a window into a huge batch, and although there might not be any null values within the window, if there are some in the whole vector (outside of the window), maybeHasNulls is still set to true.

pkg/sql/colexec/routers.go, line 115 at r1 (raw file):

Previously, asubiotto (Alfonso Subiotto Marqués) wrote…

Instead of adding a boolean, I think it's preferable to add this as a state or use one of the states to indicate this.

Done.

pkg/sql/colexec/routers.go, line 188 at r1 (raw file):

Previously, asubiotto (Alfonso Subiotto Marqués) wrote…

Do we need to add this to the spilling queue? I think it gives us good testing coverage of what happens when we spill in the routers.

I think you might be confused on the meaning of alwaysFlush on master - it disables the coalescing behavior in the router output. So are you wondering whether it is worth adding this knob to the spilling queue (to disable the newly-added coalescing behavior)? I'm not sure, it doesn't seem useful to me if we're always running with the coalescing behavior present in the production. Or am I missing something?

pkg/sql/colexec/routers.go, line 319 at r1 (raw file):

Previously, asubiotto (Alfonso Subiotto Marqués) wrote…

Why is the selection argument being removed? It means we have to do extra copies to set the desired selection vector on a batch. I think the way I handled this is to add an optional selection argument to the spilling queue, which will use the batch's selection vector if the argument is unset. I prefer that, but I'm also biased, so let's discuss

Personally, I don't like having separate selection since all batches might already have the selection vector set, so it could be quite confusing - which one is it? Are all tuples in separate selection also selected according to the selection vector on the batch? Should we check that case?

I find it a lot cleaner to set the desired selection on the batch explicitly.

I agree that performance should be taken into the consideration, and I'll run a benchmark to see whether it is a noticeable hit, but my guess is that it won't be.

Update: benchmarks are in the main thread.

pkg/sql/colexec/spilling_queue.go, line 180 at r1 (raw file):

Previously, asubiotto (Alfonso Subiotto Marqués) wrote…

This looks like when we only have in-memory items, we will deselect into a scratch batch, then additionally append this batch into a tailBatch. We should merge these two steps into one, i.e. deselect and append at the same time.

Note that this code blocks applies only if we're spilling the batch to disk. In the code that deals with the in-memory batches the deselection is done during the append.

asubiotto

Reviewed 3 of 10 files at r1, 1 of 1 files at r2.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @asubiotto and @yuzefovich)

pkg/sql/colexec/routers.go, line 188 at r1 (raw file):

Previously, yuzefovich (Yahor Yuzefovich) wrote…

I think you might be confused on the meaning of alwaysFlush on master - it disables the coalescing behavior in the router output. So are you wondering whether it is worth adding this knob to the spilling queue (to disable the newly-added coalescing behavior)? I'm not sure, it doesn't seem useful to me if we're always running with the coalescing behavior present in the production. Or am I missing something?

It's useful to test cases in which we actually spill to disk (even though it's writing to an in-memory file system) in unit tests without having to write a bunch of data. I still believe it's useful for router tests and might be so for other uses of the spilling queue as well now that in-memory buffering is delegated to the queue.

yuzefovich

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @asubiotto)

pkg/sql/colexec/routers.go, line 188 at r1 (raw file):

It's useful to test cases in which we actually spill to disk (even though it's writing to an in-memory file system) in unit tests without having to write a bunch of data. I still believe it's useful for router tests and might be so for other uses of the spilling queue as well now that in-memory buffering is delegated to the queue.

Note that there is no coalescing behavior going on when the batch is enqueued to the disk queue, so old alwaysFlush knob only disables the coalescing behavior when the batch is kept in the in-memory buffer. Currently, this knob is used only in TestHashRouterOneOutput unit test, and with the knob removed we spill to disk the same number of times as with the knob present.

I have thought some more about the need for this knob, and I still don't see it. Can you come up with a concrete example when disabling the coalescing behavior of the in-memory buffer provides us new test coverage that we don't already get otherwise?

asubiotto

Reviewed 3 of 10 files at r1, 1 of 1 files at r2.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @asubiotto and @yuzefovich)

pkg/sql/colexec/routers.go, line 188 at r1 (raw file):
Sorry, apparently didn't end up sending this comment:

so old alwaysFlush knob only disables the coalescing behavior when the batch is kept in the in-memory buffer.

But now we have coalescing behavior in the spilling queue itself, right?

It's useful to test cases in which we actually spill to disk (even though it's writing to an in-memory file system) in unit tests without having to write a bunch of data. I still believe it's useful for router tests and might be so for other uses of the spilling queue as well now that in-memory buffering is delegated to the queue.

Maybe a more concrete example of what I'm thinking will be more productive:

Assume you write tuple A to the spilling queue and then tuple B. With the spilling queue's in-memory coalescing tuple A and tuple B are never Enqueued to the underlying disk queue. However, with alwaysFlush, both of them are. I guess technically the disk queue will do its own in-memory buffering, but having this alwaysFlush testing knob allows us to test code paths that Enqueue and Dequeue from the disk queue.

yuzefovich · 2021-01-12T00:09:49Z

Added a knob to limit the number of batches added to the in-memory buffer as we discussed.

asubiotto

thanks for bearing with me

Reviewed 6 of 6 files at r3.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @asubiotto and @yuzefovich)

pkg/sql/colexec/spilling_queue.go, line 76 at r3 (raw file):

		// numEnqueues tracks the number of times enqueue() has been called with
		// non-zero batch.
		numEnqueues int

Nit: This might be a bit specific. Consider making it a testing callback (onEnqueue) that can optionally be set.

This commit refactors `enqueue` method of the spilling queue to deep-copy the passed-in batches if they are kept in memory. Previous behavior was suboptimal because it was forcing the caller to always allocate a new batch. Additionally, the spilling queue will now perform a coalescing step by attempting to append as many tuples to the tail in-memory batch as possible. The in-memory batches are allocated with dynamically increasing capacity. This allows us to significantly simplify the code of the router outputs which were performing the coalescing step previously. Additionally, this commit fixes a couple of uses of `enqueue` method (the router outputs and the merge joiner) in which they forgot to enqueue a zero-length batch which is necessary when the disk queue is initialized. Release note: None

yuzefovich

TFTR!

bors r+

Reviewable status: complete! 0 of 0 LGTMs obtained (and 1 stale) (waiting on @asubiotto)

pkg/sql/colexec/routers.go, line 188 at r1 (raw file):

Previously, asubiotto (Alfonso Subiotto Marqués) wrote…

Sorry, apparently didn't end up sending this comment:

so old alwaysFlush knob only disables the coalescing behavior when the batch is kept in the in-memory buffer.

But now we have coalescing behavior in the spilling queue itself, right?

It's useful to test cases in which we actually spill to disk (even though it's writing to an in-memory file system) in unit tests without having to write a bunch of data. I still believe it's useful for router tests and might be so for other uses of the spilling queue as well now that in-memory buffering is delegated to the queue.

Maybe a more concrete example of what I'm thinking will be more productive:

Assume you write tuple A to the spilling queue and then tuple B. With the spilling queue's in-memory coalescing tuple A and tuple B are never Enqueued to the underlying disk queue. However, with alwaysFlush, both of them are. I guess technically the disk queue will do its own in-memory buffering, but having this alwaysFlush testing knob allows us to test code paths that Enqueue and Dequeue from the disk queue.

Done.

pkg/sql/colexec/spilling_queue.go, line 76 at r3 (raw file):

Previously, asubiotto (Alfonso Subiotto Marqués) wrote…

Nit: This might be a bit specific. Consider making it a testing callback (onEnqueue) that can optionally be set.

Done.

craig · 2021-01-12T18:36:22Z

Build failed (retrying...):

GitHub CI (Cockroach)

yuzefovich · 2021-01-12T18:38:41Z

Need to fix the test.

bors r-

craig · 2021-01-12T18:38:43Z

Canceled.

yuzefovich

bors r+

Reviewable status: complete! 0 of 0 LGTMs obtained (and 1 stale) (waiting on @asubiotto)

pkg/sql/colexec/spilling_queue.go, line 76 at r3 (raw file):

Previously, yuzefovich (Yahor Yuzefovich) wrote…

Done.

I reverted the change to the original version because I don't see a clean way of forcing the spilling queue to use the disk use from inside of the callback (simply calling maybeSpillToDisk doesn't work). I think it is reasonable because in case we need to have a more general callback, we can introduce it then.

craig · 2021-01-12T19:58:01Z

Build succeeded:

GitHub CI (Cockroach)

yuzefovich force-pushed the spilling-queue branch from 35a1814 to 364361a Compare December 17, 2020 02:47

yuzefovich requested review from asubiotto and a team December 17, 2020 02:48

yuzefovich force-pushed the spilling-queue branch from 364361a to 0c57b12 Compare December 17, 2020 02:49

asubiotto suggested changes Dec 17, 2020

View reviewed changes

yuzefovich force-pushed the spilling-queue branch from 0c57b12 to 4aaf068 Compare December 18, 2020 00:25

yuzefovich commented Dec 18, 2020

View reviewed changes

yuzefovich mentioned this pull request Dec 28, 2020

roachtest: sqlsmith countingFileWriter nil pointer #54552

Closed

asubiotto suggested changes Jan 7, 2021

View reviewed changes

yuzefovich commented Jan 10, 2021

View reviewed changes

asubiotto reviewed Jan 11, 2021

View reviewed changes

yuzefovich force-pushed the spilling-queue branch from 4aaf068 to 118f2fc Compare January 12, 2021 00:09

yuzefovich force-pushed the spilling-queue branch 3 times, most recently from 98209de to a7055f2 Compare January 12, 2021 02:32

asubiotto approved these changes Jan 12, 2021

View reviewed changes

yuzefovich force-pushed the spilling-queue branch from a7055f2 to d92d7c8 Compare January 12, 2021 17:48

yuzefovich commented Jan 12, 2021

View reviewed changes

yuzefovich force-pushed the spilling-queue branch from d92d7c8 to e2d602e Compare January 12, 2021 18:42

yuzefovich commented Jan 12, 2021

View reviewed changes

craig bot merged commit 158601d into cockroachdb:master Jan 12, 2021

yuzefovich deleted the spilling-queue branch January 12, 2021 20:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

colexec: fix spilling queue #58013

colexec: fix spilling queue #58013

yuzefovich commented Dec 17, 2020 •

edited

Loading

cockroach-teamcity commented Dec 17, 2020

asubiotto left a comment

yuzefovich left a comment •

edited

Loading

asubiotto left a comment

yuzefovich left a comment

asubiotto left a comment

yuzefovich commented Jan 12, 2021

asubiotto left a comment

yuzefovich left a comment

craig bot commented Jan 12, 2021

yuzefovich commented Jan 12, 2021

craig bot commented Jan 12, 2021

yuzefovich left a comment

craig bot commented Jan 12, 2021

colexec: fix spilling queue #58013

colexec: fix spilling queue #58013

Conversation

yuzefovich commented Dec 17, 2020 • edited Loading

cockroach-teamcity commented Dec 17, 2020

asubiotto left a comment

Choose a reason for hiding this comment

yuzefovich left a comment • edited Loading

Choose a reason for hiding this comment

asubiotto left a comment

Choose a reason for hiding this comment

yuzefovich left a comment

Choose a reason for hiding this comment

asubiotto left a comment

Choose a reason for hiding this comment

yuzefovich commented Jan 12, 2021

asubiotto left a comment

Choose a reason for hiding this comment

yuzefovich left a comment

Choose a reason for hiding this comment

craig bot commented Jan 12, 2021

yuzefovich commented Jan 12, 2021

craig bot commented Jan 12, 2021

yuzefovich left a comment

Choose a reason for hiding this comment

craig bot commented Jan 12, 2021

yuzefovich commented Dec 17, 2020 •

edited

Loading

yuzefovich left a comment •

edited

Loading