feat: take the concurrencyLimit from feature flags and keep in dependencies #4564

adrian-thurston · 2022-03-16T00:08:55Z

Pull the concurrencyLimit from feature flags at the start of the execution
process, before planning, and stash it in the ExecutionOptions, which live in
the ExecutionDependencies. From there it can then be modified by planner rules.
This allows parallelization rules to raise the limit if they parallelize a
query.

At the same time we move the defaultMemoryLimit from the planner and into the
execution options. We also move the computation of memory limit and concurrency
quota from the planner and into the executor.

Included are test cases covering the existing and the new method of
determining query concurrency.

Done checklist

docs/SPEC.md updated
Test cases written

wolffcm

Interesting, it seems reasonable to me.

I am curious if @nathanielc has thoughts on this approach.

wolffcm · 2022-03-16T20:10:18Z

execute/executor.go

+			if concurrencyQuota > int(execOptions.ConcurrencyLimit) {
+				concurrencyQuota = int(execOptions.ConcurrencyLimit)
+			} else if concurrencyQuota == 0 {
+				concurrencyQuota = 1


Curious about this branch here. If we had a trivial query whose execution graph just had a ReadWindowAggregate node, I guess concurrencyQuota could be 0. Do we require it to be positive even if there are no non-sources?

I might be mistaken, but my guess is there needs to be at least one goroutine to work the consecutive transport belonging to the source nodes. The source nodes themselves I think just deposit messages to the outgoing dataset and that's as far as the source goroutines take it. There needs to be a dispatcher thread that reads those messages and writes to CSV writer.

Anecdotally, during recent refactors when I failed to mark any nodes as roots (my mistake) the concurrency quota was set to zero, which produced an error. It didn't seem like there was any conditional around that check since it failed for from/range/filter which I think would have be rewritten as a single source.

flux/execute/executor.go

Lines 80 to 85 in dc08c57

func validatePlan(p *plan.Spec) error {

if p.Resources.ConcurrencyQuota == 0 {

return errors.New(codes.Invalid, "plan must have a non-zero concurrency quota")

}

return nil

}

Right makes sense.

Given Go's convention of having a useful zero value, I wonder if we should change the meaning of concurrency quota to be the number of additional goroutines after the required one. Or maybe 0 should just mean the default of 1.

Nothing that needs to change here for this PR, it just seems a little weird.

Pull the concurrencyLimit from feature flags at the start of the execution process, before planning, and stash it in the ExecutionOptions, which live in the ExecutionDependencies. From there it can then be modified by planner rules. This allows parallelization rules to raise the limit if they parallelize a query. At the same time we move the defaultMemoryLimit from the planner and into the execution options. We also move the computation of memory limit and concurrency quota from the planner and into the executor. Included are test cases covering the existing and the new method of determining query concurrency.

wolffcm

Looks good to me.

onelson

LGTM. I agree with Chris that the handling of that 0 case feels a little clumsy, but I don't really see what we can do about it. 🤷

adrian-thurston requested a review from a team as a code owner March 16, 2022 00:08

adrian-thurston requested review from scbrickley and removed request for a team March 16, 2022 00:08

adrian-thurston marked this pull request as draft March 16, 2022 00:09

adrian-thurston removed the request for review from scbrickley March 16, 2022 00:09

wolffcm reviewed Mar 16, 2022

View reviewed changes

adrian-thurston self-assigned this Mar 16, 2022

adrian-thurston changed the title ~~wip: take the concurrencyLimit from FF and keep in dependencies~~ feat: take the concurrencyLimit from feature flags and keep in dependencies Mar 17, 2022

adrian-thurston force-pushed the feat/modifyable-concurrency-limit branch 5 times, most recently from 15c350b to 2b61cb2 Compare March 18, 2022 01:25

adrian-thurston force-pushed the feat/modifyable-concurrency-limit branch from 2b61cb2 to 9a4aad1 Compare March 18, 2022 02:12

adrian-thurston marked this pull request as ready for review March 18, 2022 02:12

adrian-thurston requested review from onelson and wolffcm March 18, 2022 02:12

wolffcm approved these changes Mar 18, 2022

View reviewed changes

onelson approved these changes Mar 18, 2022

View reviewed changes

adrian-thurston merged commit db497ca into master Mar 18, 2022

jacobmarble deleted the feat/modifyable-concurrency-limit branch January 4, 2024 16:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: take the concurrencyLimit from feature flags and keep in dependencies #4564

feat: take the concurrencyLimit from feature flags and keep in dependencies #4564

adrian-thurston commented Mar 16, 2022 •

edited

Loading

wolffcm left a comment

wolffcm Mar 16, 2022

adrian-thurston Mar 16, 2022

onelson Mar 16, 2022 •

edited

Loading

wolffcm Mar 18, 2022

wolffcm left a comment

onelson left a comment •

edited

Loading

	func validatePlan(p *plan.Spec) error {
	if p.Resources.ConcurrencyQuota == 0 {
	return errors.New(codes.Invalid, "plan must have a non-zero concurrency quota")
	}
	return nil
	}

feat: take the concurrencyLimit from feature flags and keep in dependencies #4564

feat: take the concurrencyLimit from feature flags and keep in dependencies #4564

Conversation

adrian-thurston commented Mar 16, 2022 • edited Loading

Done checklist

wolffcm left a comment

Choose a reason for hiding this comment

wolffcm Mar 16, 2022

Choose a reason for hiding this comment

adrian-thurston Mar 16, 2022

Choose a reason for hiding this comment

onelson Mar 16, 2022 • edited Loading

Choose a reason for hiding this comment

wolffcm Mar 18, 2022

Choose a reason for hiding this comment

wolffcm left a comment

Choose a reason for hiding this comment

onelson left a comment • edited Loading

Choose a reason for hiding this comment

adrian-thurston commented Mar 16, 2022 •

edited

Loading

onelson Mar 16, 2022 •

edited

Loading

onelson left a comment •

edited

Loading