distsql: make the number of DistSQL runners dynamic #84946

yuzefovich · 2022-07-22T23:39:23Z

distsql: make the number of DistSQL runners dynamic

This commit improves the infrastructure around a pool of "DistSQL
runners" that are used for issuing SetupFlow RPCs in parallel.
Previously, we had a hard-coded number of 16 goroutines which was
probably insufficient in many cases. This commit makes it so that we use
the default value of 4 x N(cpus) to make it proportional to how beefy
the node is (under the expectation that the larger the node is, the more
distributed queries it will be handling). The choice of the four as the
multiple was made so that we get the previous default on machines with
4 CPUs.

Additionally, this commit introduces a mechanism to dynamically adjust
the number of runners based on a cluster setting. Whenever the setting
is reduced, some of the workers are stopped, if the setting is
increased, then new workers are spun up accordingly. This coordinator
listens on two channels: one about the server quescing, and another
about the new target pool size. Whenever a new target size is received,
the coordinator will spin up / shut down one worker at a time until that
target size is achieved. The worker, however, doesn't access the server
quescing channel and, instead, relies on the coordinator to tell it to
exit (either by closing the channel when quescing or sending a single
message when the target size is decreased).

Fixes: #84459.

Release note: None

distsql: change the flow setup code a bit

Previously, when setting up a distributed plan, we would wait for all
SetupFlow RPCs to come back before setting up the flow on the gateway.
Most likely (in the happy scenario) all those RPCs would be successful,
so we can parallelize the happy path a bit by setting up the local flow
while the RPCs are in-flight which is what this commit does. This seems
especially beneficial given the change in the previous commit to
increase the number of DistSQL runners for beefy machines - we are now
more likely to issue SetupFlow RPCs asynchronously.

Release note: None

cockroach-teamcity · 2022-07-22T23:39:31Z

This change is

yuzefovich · 2022-07-22T23:54:37Z

cc @dt I'm curious if you could take a look at the first commit - this is what I was thinking about when asking on Slack last week.

dt · 2022-07-23T20:02:59Z

pkg/sql/distsql_running.go

+var settingDistSQLRunnersCPUMultiple = settings.RegisterIntSetting(
+	settings.TenantWritable,
+	"sql.distsql.runners_cpu_multiple",
+	"the value multiplied by the number of CPUs on a node to determine "+


do we ever imagine wanting to set this to less than numCores?

you could have the value just be the desired worker per node count, and then have the default value be 4*numCores?

dt · 2022-07-23T20:05:03Z

pkg/sql/distsql_running.go

+			return
+		}
+	}
+	// Whenever the corresponding setting is updated, we need to notify the


If you were so inclined, could expand this comment with a note of the fact that initRunners is only called once per server lifetime so this won't leak an unbounded numbers of onchange callbacks, since in general an onChange not in an init() is suspect

yuzefovich

Thanks for taking a look!

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @dt)

pkg/sql/distsql_running.go line 65 at r1 (raw file):

Previously, dt (David Taylor) wrote…

do we ever imagine wanting to set this to less than numCores?

you could have the value just be the desired worker per node count, and then have the default value be 4*numCores?

Yeah, good point, done.

pkg/sql/distsql_running.go line 160 at r2 (raw file):

Previously, dt (David Taylor) wrote…

If you were so inclined, could expand this comment with a note of the fact that initRunners is only called once per server lifetime so this won't leak an unbounded numbers of onchange callbacks, since in general an onChange not in an init() is suspect

Done.

yuzefovich · 2022-07-26T23:23:01Z

The last commit in this PR exposed a possible data race which is now fixed in the first commit. That first commit is in #85101 and should be ignored here.

This commit improves the infrastructure around a pool of "DistSQL runners" that are used for issuing SetupFlow RPCs in parallel. Previously, we had a hard-coded number of 16 goroutines which was probably insufficient in many cases. This commit makes it so that we use the default value of `4 x N(cpus)` to make it proportional to how beefy the node is (under the expectation that the larger the node is, the more distributed queries it will be handling). The choice of the four as the multiple was made so that we get the previous default on machines with 4 CPUs. Additionally, this commit introduces a mechanism to dynamically adjust the number of runners based on a cluster setting. Whenever the setting is reduced, some of the workers are stopped, if the setting is increased, then new workers are spun up accordingly. This coordinator listens on two channels: one about the server quescing, and another about the new target pool size. Whenever a new target size is received, the coordinator will spin up / shut down one worker at a time until that target size is achieved. The worker, however, doesn't access the server quescing channel and, instead, relies on the coordinator to tell it to exit (either by closing the channel when quescing or sending a single message when the target size is decreased). Release note: None

Previously, when setting up a distributed plan, we would wait for all SetupFlow RPCs to come back before setting up the flow on the gateway. Most likely (in the happy scenario) all those RPCs would be successful, so we can parallelize the happy path a bit by setting up the local flow while the RPCs are in-flight which is what this commit does. This seems especially beneficial given the change in the previous commit to increase the number of DistSQL runners for beefy machines - we are now more likely to issue SetupFlow RPCs asynchronously. Release note: None

yuzefovich · 2022-07-27T23:26:57Z

Rebased on top of #85101, so now all commits in the PR are relevant.

cucaroach

Reviewed 2 of 3 files at r5, 3 of 3 files at r7.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @cucaroach, @dt, @michae2, and @yuzefovich)

pkg/sql/distsql_running.go line 72 at r7 (raw file):

	// original value of 16 on machines with 4 CPUs.
	4*int64(runtime.GOMAXPROCS(0)), /* defaultValue */
	func(v int64) error {

What happens if this is set to 0? Like does the setup flow phase fail or something and an error gets sent back to gateway?

yuzefovich

TFTRs!

bors r+

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @cucaroach, @dt, and @michae2)

pkg/sql/distsql_running.go line 72 at r7 (raw file):

Previously, cucaroach (Tommy Reilly) wrote…

What happens if this is set to 0? Like does the setup flow phase fail or something and an error gets sent back to gateway?

In that case, the setup goroutine on the gateway will issue all SetupFlow RPCs for remote nodes sequentially followed by performing a setup on the gateway. This pool of DistSQL runners allows for those RPCs to be issued in parallel, but if the pool is used up, the setup goroutine doesn't wait for any worker and, instead, does the "work" itself.

craig · 2022-08-01T20:12:13Z

Build succeeded:

GitHub CI (Cockroach)

yuzefovich force-pushed the distsql-runners branch 2 times, most recently from b26e334 to b8f1343 Compare July 22, 2022 23:53

dt reviewed Jul 23, 2022

View reviewed changes

yuzefovich force-pushed the distsql-runners branch from b8f1343 to 93abea7 Compare July 25, 2022 22:20

yuzefovich commented Jul 25, 2022

View reviewed changes

yuzefovich marked this pull request as ready for review July 25, 2022 22:22

yuzefovich requested a review from a team as a code owner July 25, 2022 22:22

yuzefovich requested review from michae2 and cucaroach July 25, 2022 22:23

yuzefovich force-pushed the distsql-runners branch from 93abea7 to 59ccb7e Compare July 26, 2022 23:22

yuzefovich requested a review from a team as a code owner July 26, 2022 23:22

yuzefovich removed the request for review from a team July 26, 2022 23:22

yuzefovich added 2 commits July 27, 2022 16:26

yuzefovich force-pushed the distsql-runners branch from 59ccb7e to 27ade23 Compare July 27, 2022 23:26

cucaroach approved these changes Aug 1, 2022

View reviewed changes

yuzefovich commented Aug 1, 2022

View reviewed changes

craig bot merged commit 314baa5 into cockroachdb:master Aug 1, 2022

irfansharif mentioned this pull request Sep 8, 2022

roachtest: tpcc/multiregion/survive=region/chaos=true failed #85711

Closed

yuzefovich deleted the distsql-runners branch September 8, 2022 04:37

yuzefovich mentioned this pull request Sep 9, 2022

[wip] sql: add distribution policy for join readers #69210

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

distsql: make the number of DistSQL runners dynamic #84946

distsql: make the number of DistSQL runners dynamic #84946

yuzefovich commented Jul 22, 2022 •

edited

Loading

cockroach-teamcity commented Jul 22, 2022

yuzefovich commented Jul 22, 2022

dt Jul 23, 2022

dt Jul 23, 2022

yuzefovich left a comment

yuzefovich commented Jul 26, 2022

yuzefovich commented Jul 27, 2022

cucaroach left a comment

yuzefovich left a comment

craig bot commented Aug 1, 2022

distsql: make the number of DistSQL runners dynamic #84946

distsql: make the number of DistSQL runners dynamic #84946

Conversation

yuzefovich commented Jul 22, 2022 • edited Loading

cockroach-teamcity commented Jul 22, 2022

yuzefovich commented Jul 22, 2022

dt Jul 23, 2022

Choose a reason for hiding this comment

dt Jul 23, 2022

Choose a reason for hiding this comment

yuzefovich left a comment

Choose a reason for hiding this comment

yuzefovich commented Jul 26, 2022

yuzefovich commented Jul 27, 2022

cucaroach left a comment

Choose a reason for hiding this comment

yuzefovich left a comment

Choose a reason for hiding this comment

craig bot commented Aug 1, 2022

yuzefovich commented Jul 22, 2022 •

edited

Loading