sql: cleanup fetchers #83010

yuzefovich · 2022-06-16T19:31:56Z

sql: clean up Fetcher interfaces a bit

This commit removes a couple of arguments (traceKV,
forceProductionBatchSize) from StartScan* fetcher methods in
favor of passing them on Init. Additionally, several fields are
removed from row.Fetcher in favor of accessing the args struct
directly.

The only meaningful change is that now we correctly propagate traceKV
flag in the column backfiller code path when it is set up in
a distributed case.

Release note: None

sql: clean up the lifecycle of fetchers

Previously, the lifecycle of different fetcher objects was a mess.
Consider the sequence of fetchers when used by the join reader with the
old non-streamer code path:
rowexec.joinReader -> row.Fetcher -> row.KVFetcher -> row.txnKVFetcher.
row.Fetcher was initialized once, but then on every call to
StartScan, we would create a new row.txnKVFetcher and then wrap it
with a new row.KVFetcher (during an internal StartScanFrom call). In
other words, throughout the lifetime of the join reader, its fetcher
would create a new pair of objects for each input row batch. This setup
is very unintuitive and previously led to some bugs with memory
accounting.

I believe such a setup was created organically, without giving too much
thought to it. Some considerations should be pointed out:

in some cases, we have some state from the previous fetch that we want
to discard
in some cases, we provide the row.Fetcher with a custom
KVBatchFetcher implementation.

This commit refactors all of this stuff to make it much more sane. In
particular, we now only create a single row.KVFetcher object that is
powered by a single row.txnKVFetcher or row.txnKVStreamer
implementation throughout the whole lifetime of row.Fetcher. In the
main code path, the callers are now expected to only use StartScan
method which correctly discards unnecessary state from the previous
call. This is achieved by adding a new method to KVBatchFetcher
interface.

This commit supports the use case with custom KVBatchFetchers too by
asking the caller to explicitly specify a knob during the initialization
of the row.Fetcher - in such case, only StartScanFrom calls are
allowed. There, we still close the KVBatchFetcher from the previous
call (tbh I believe this is not necessary since these custom
KVBatchFetchers don't have anything to clean up, but it's probably
safer to keep the old behavior here).

Furthermore, this commit pushes some arguments from StartScan into
Init - most notably the txn is now passed only once. However, there
are some use cases (like a column backfill, done in chunks) where the
txn might change throughout the lifetime of the fetcher - we allow
updating it later if needed.

This also allows us to unify the streamer and the non-streamer
code paths - to remove some of the duplicated code as well as push the
usage of the streamer lower in the stack.

Release note: None

cockroach-teamcity · 2022-06-16T19:32:06Z

This change is

yuzefovich · 2022-06-17T02:23:38Z

This was inspired by Michael's suggestion.

miretskiy

changefeed changes are 👍

rharding6373

Nice cleanup!

Reviewed 20 of 20 files at r1, 29 of 29 files at r2.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @michae2, @stevendanna, and @yuzefovich)

pkg/sql/row/fetcher.go line 391 at r2 (raw file):

// SetTxn updates the Fetcher to use the provided txn.
func (rf *Fetcher) SetTxn(txn *kv.Txn) {
	rf.setTxnAndSendFn(txn, makeKVBatchFetcherDefaultSendFunc(txn))

What if there was already a sendFn set? Wouldn't this overwrite it?

pkg/sql/row/kv_batch_streamer.go line 83 at r2 (raw file):

) error {
	if bytesLimit != rowinfra.NoBytesLimit {
		return errors.AssertionFailedf("unexpectedly non-zero bytes limit for txnKVStreamer")

nit: s/unexpectedly/unexpected

pkg/sql/row/kv_fetcher.go line 272 at r2 (raw file):

// SetupNextFetch overrides the same method from the wrapped KVBatchFetcher in
// order reset this KVFetcher.

nit: s/in order/in order to

yuzefovich

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @michae2, @rharding6373, and @stevendanna)

pkg/sql/row/fetcher.go line 391 at r2 (raw file):

Previously, rharding6373 (Rachael Harding) wrote…

What if there was already a sendFn set? Wouldn't this overwrite it?

Yes, this would overwrite it, added some commentary.

This method is exposed to support the column backfill use case where a single row.Fetcher object is reused many times throughout the column backfill operation which is broken down into chunks, and each chunk is processed under a new txn. The previous usage pattern of creating new row.txnKVFetchers for each chunk was doing essentially this - creating a new sendFn for each chunk.

This commit removes a couple of arguments (`traceKV`, `forceProductionBatchSize`) from `StartScan*` fetcher methods in favor of passing them on `Init`. Additionally, several fields are removed from `row.Fetcher` in favor of accessing the args struct directly. The only meaningful change is that now we correctly propagate `traceKV` flag in the column backfiller code path when it is set up in a distributed case. Release note: None

michae2

Thank you for doing this. It turned out really, really nice. I was not expecting all the unifications of streamer and non-streamer code!

I have a few nits. Almost done reading, I will finish tonight.

Reviewed 20 of 20 files at r1, 14 of 29 files at r2, 3 of 3 files at r3, all commit messages.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @rharding6373, @stevendanna, and @yuzefovich)

pkg/sql/row/fetcher.go line 227 at r3 (raw file):

	// WillUseCustomKVFetcher, if true, indicates that the caller will only use
	// StartScanFrom() method and will be providing its own KVFetcher.
	WillUseCustomKVFetcher bool

nit: Maybe this is pedantic, but wouldn't WillUseCustomKVBatchFetcher be more accurate? And likewise, shouldn't the comment say "its own KVBatchFetcher" instead of "its own KVFetcher"?

pkg/sql/row/fetcher.go line 403 at r3 (raw file):

// setTxnAndSendFn peeks inside of the KVFetcher to update the underlying
// txnKVFetcher with the new txn and sendFn.
func (rf *Fetcher) setTxnAndSendFn(txn *kv.Txn, sendFn sendFunc) {

Could we do this in a txnKVFetcher method rather than here? I know it's a pain to plumb another method call down to KVFetcher / KVBatchFetcher / txnKVFetcher, but reaching into the txnKVFetcher like this this seems like a layer violation.

Or maybe a compromise would be: we put the changes to txnKVFetcher in a txnKVFetcher method, but then we peek inside kvFetcher.KVBatchFetcher.(*txnKVFetcher) to call the method instead of adding the method to KVBatchFetcher and KVFetcher? That at least shrinks the surface area of the layer violation slightly. Would that be reasonable?

pkg/sql/row/fetcher.go line 500 at r3 (raw file):

) error {
	if rf.args.StreamingKVFetcher != nil {
		return errors.AssertionFailedf("StartInconsistentScan is called instead of StartScanFrom")

nit: Should this say "... instead of StartScan" instead of "... instead of StartScanFrom"?

pkg/sql/row/kv_fetcher.go line 138 at r3 (raw file):

	diskBuffer kvstreamer.ResultDiskBuffer,
) *KVFetcher {
	streamer := kvstreamer.NewStreamer(

Nice that you can push this lower!

yuzefovich

Indeed, I'm quite happy with this refactor, thanks for the idea!

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @michae2, @rharding6373, and @stevendanna)

pkg/sql/row/fetcher.go line 227 at r3 (raw file):

Previously, michae2 (Michael Erickson) wrote…

nit: Maybe this is pedantic, but wouldn't WillUseCustomKVBatchFetcher be more accurate? And likewise, shouldn't the comment say "its own KVBatchFetcher" instead of "its own KVFetcher"?

You're right, fixed. I guess I wanted to make things shorter, but it seems better to be precise here.

pkg/sql/row/fetcher.go line 403 at r3 (raw file):

Previously, michae2 (Michael Erickson) wrote…

Could we do this in a txnKVFetcher method rather than here? I know it's a pain to plumb another method call down to KVFetcher / KVBatchFetcher / txnKVFetcher, but reaching into the txnKVFetcher like this this seems like a layer violation.

Or maybe a compromise would be: we put the changes to txnKVFetcher in a txnKVFetcher method, but then we peek inside kvFetcher.KVBatchFetcher.(*txnKVFetcher) to call the method instead of adding the method to KVBatchFetcher and KVFetcher? That at least shrinks the surface area of the layer violation slightly. Would that be reasonable?

I agree that this is rather hacky, and I like your suggestion, done. I also added some assertions to make things more explicit (in the future, we might need to do a similar thing for txnKVStreamer, but for now we only need to support the updating of the txn on txnKVFetcher).

michae2

💯 🏍️

Reviewed 29 of 29 files at r4, 29 of 29 files at r5, all commit messages.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @rharding6373 and @stevendanna)

Previously, the lifecycle of different fetcher objects was a mess. Consider the sequence of fetchers when used by the join reader with the old non-streamer code path: `rowexec.joinReader` -> `row.Fetcher` -> `row.KVFetcher` -> `row.txnKVFetcher`. `row.Fetcher` was initialized once, but then on every call to `StartScan`, we would create a new `row.txnKVFetcher` and then wrap it with a new `row.KVFetcher` (during an internal `StartScanFrom` call). In other words, throughout the lifetime of the join reader, its fetcher would create a new pair of objects for each input row batch. This setup is very unintuitive and previously led to some bugs with memory accounting. I believe such a setup was created organically, without giving too much thought to it. Some considerations should be pointed out: - in some cases, we have some state from the previous fetch that we want to discard - in some cases, we provide the `row.Fetcher` with a custom `KVBatchFetcher` implementation. This commit refactors all of this stuff to make it much more sane. In particular, we now only create a single `row.KVFetcher` object that is powered by a single `row.txnKVFetcher` or `row.txnKVStreamer` implementation throughout the whole lifetime of `row.Fetcher`. In the main code path, the callers are now expected to only use `StartScan` method which correctly discards unnecessary state from the previous call. This is achieved by adding a new method to `KVBatchFetcher` interface. This commit supports the use case with custom `KVBatchFetcher`s too by asking the caller to explicitly specify a knob during the initialization of the `row.Fetcher` - in such case, only `StartScanFrom` calls are allowed. There, we still close the `KVBatchFetcher` from the previous call (tbh I believe this is not necessary since these custom `KVBatchFetcher`s don't have anything to clean up, but it's probably safer to keep the old behavior here). Furthermore, this commit pushes some arguments from `StartScan` into `Init` - most notably the txn is now passed only once. However, there are some use cases (like a column backfill, done in chunks) where the txn might change throughout the lifetime of the fetcher - we allow updating it later if needed. This also allows us to unify the streamer and the non-streamer code paths - to remove some of the duplicated code as well as push the usage of the streamer lower in the stack. Release note: None

yuzefovich · 2022-06-23T16:20:33Z

@rharding6373 do you wanna take another look?

All feedback has been addressed.

yuzefovich · 2022-06-24T16:42:43Z

I'm assuming that Rachael is on board with this change.

Thanks for the reviews!

bors r+

craig · 2022-06-24T17:54:10Z

Build succeeded:

GitHub CI (Cockroach)

yuzefovich force-pushed the cleanup-fetchers branch 7 times, most recently from f13171d to f4f984b Compare June 17, 2022 02:22

yuzefovich requested review from michae2, rharding6373 and a team June 17, 2022 02:22

yuzefovich marked this pull request as ready for review June 17, 2022 02:22

yuzefovich requested review from a team as code owners June 17, 2022 02:22

yuzefovich requested review from a team and stevendanna and removed request for a team June 17, 2022 02:22

yuzefovich mentioned this pull request Jun 17, 2022

sql: reuse the slice of RequestUnion objects between fetches #82384

Merged

miretskiy approved these changes Jun 17, 2022

View reviewed changes

rharding6373 previously requested changes Jun 22, 2022

View reviewed changes

yuzefovich force-pushed the cleanup-fetchers branch from f4f984b to 238d479 Compare June 22, 2022 04:41

yuzefovich commented Jun 22, 2022

View reviewed changes

michae2 requested changes Jun 23, 2022

View reviewed changes

yuzefovich force-pushed the cleanup-fetchers branch from 238d479 to f7fe9bd Compare June 23, 2022 00:27

yuzefovich commented Jun 23, 2022

View reviewed changes

michae2 approved these changes Jun 23, 2022

View reviewed changes

yuzefovich force-pushed the cleanup-fetchers branch from f7fe9bd to e7e724e Compare June 23, 2022 14:10

yuzefovich mentioned this pull request Jun 23, 2022

kvstreamer: improve the performance #82159

Closed

yuzefovich mentioned this pull request Jun 23, 2022

kvstreamer: improve the stability #82160

Closed

craig bot merged commit 324c837 into cockroachdb:master Jun 24, 2022

yuzefovich deleted the cleanup-fetchers branch June 24, 2022 17:59

yuzefovich mentioned this pull request Jul 7, 2022

panic crash while on the database pages #83935

Closed

nicktrav mentioned this pull request Oct 19, 2022

storage: disk space not reclaimed even though live bytes are deleted #90149

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: cleanup fetchers #83010

sql: cleanup fetchers #83010

yuzefovich commented Jun 16, 2022 •

edited

Loading

cockroach-teamcity commented Jun 16, 2022

yuzefovich commented Jun 17, 2022

miretskiy left a comment

rharding6373 left a comment

yuzefovich left a comment

michae2 left a comment

yuzefovich left a comment

michae2 left a comment

yuzefovich commented Jun 23, 2022

yuzefovich commented Jun 24, 2022

craig bot commented Jun 24, 2022

sql: cleanup fetchers #83010

sql: cleanup fetchers #83010

Conversation

yuzefovich commented Jun 16, 2022 • edited Loading

cockroach-teamcity commented Jun 16, 2022

yuzefovich commented Jun 17, 2022

miretskiy left a comment

Choose a reason for hiding this comment

rharding6373 left a comment

Choose a reason for hiding this comment

yuzefovich left a comment

Choose a reason for hiding this comment

michae2 left a comment

Choose a reason for hiding this comment

yuzefovich left a comment

Choose a reason for hiding this comment

michae2 left a comment

Choose a reason for hiding this comment

yuzefovich commented Jun 23, 2022

yuzefovich commented Jun 24, 2022

craig bot commented Jun 24, 2022

yuzefovich commented Jun 16, 2022 •

edited

Loading