feat(rush,node-core-library): allow weighted async concurrency #4672

aramissennyeydd · 2024-05-01T22:48:50Z

Summary

Followup to #4092. The goal of this PR is to allow rush users to define the weight of an operation, this is especially useful for operations that do not take a fair share of the available concurrency (either CPU or memory). Examples include jest's max-workers or other CPU intensive tasks, as well as memory intensive tasks where you may want to only run one heavy task without parallelism but you don't want to drop the overall rush parallelism.

This code is very similar to David's original PR, with a few modifications

the implementation is now very similar to Async.forEachAsync()
Operation weight can only be whole numbers.
A new WeightedOperationPlugin that adds the weight to the operation and validates that the weight is in fact a whole number and >= 0.

There was some original discussion around the name of weight, I think it's a decent name and the one I would reach for when naming this as well. I reference concurrencyUnits in my PR as the weighted sum of the current active operations.

Details

The decision to use whole numbers for concurrency is both for maintainer sanity and because I was running into a deadlock with concurrency 1 and various weights < 1. Whole numbers seems to fix this as there's no longer any risk of floating point math causing weird issues. Definitely looking for additional use cases to test that theory on though.

I didn't do it in this PR, but theoretically, Async.forEachAsync could be reimplemented in terms of Async.forEachAsync with each item having a weight of 1.

How it was tested

Added unit tests and tested that this runs on the cobuild sandbox repo.

Impacted documentation

Any references to rush parallelism may benefit from this.

…eights to scale concurrent units

aramissennyeydd · 2024-05-01T22:49:18Z

@dmichon-msft One more for ya 😅 , let me know if you want to revive your original PR. Happy to close this if so!

common/changes/@microsoft/rush/sennyeya-weighted-graph_2024-05-01-22-51.json

common/changes/@rushstack/node-core-library/sennyeya-weighted-graph_2024-05-01-22-51.json

iclanton

Nice!

common/reviews/api/node-core-library.api.md

libraries/node-core-library/src/Async.ts

Co-authored-by: Ian Clanton-Thuon <[email protected]>

common/reviews/api/node-core-library.api.md

aramissennyeydd · 2024-05-02T17:22:30Z

Thought of one more case we'd want to cover, but not sure what the right path forward is.

Say for example concurrency = 4 and we have an array of items with weights of 1, 2, 4 respectively. With this implementation, all 3 operations will be queued at once.

That feels like the opposite behavior we'd want, where the operation of size 4 should be queued first, finished and then smaller tasks complete after it. Weights of 0 should be allowed to execute at the same time as a full concurrency operation.

There's a block, https://github.com/aramissennyeydd/rushstack/blob/6184642eae6682d919edada2a989426fb3c298d5/libraries/rush-lib/src/logic/operations/OperationExecutionManager.ts#L43-L48 that I think we could flip from a.criticalPathLength - b.criticalPathLength to b.criticalPathLength - a.criticalPathLength to reflect this behavior?

libraries/node-core-library/src/Async.ts

libraries/rush-lib/src/logic/operations/WeightedOperationPlugin.ts

libraries/rush-lib/src/schemas/rush-project.schema.json

Co-authored-by: Ian Clanton-Thuon <[email protected]>

libraries/node-core-library/src/Async.ts

common/reviews/api/node-core-library.api.md

libraries/node-core-library/src/Async.ts

common/reviews/api/node-core-library.api.md

Co-authored-by: Ian Clanton-Thuon <[email protected]>

aramissennyeydd · 2024-05-06T12:54:35Z

@iclanton Thanks for the reviews on this! Excited to use this internally! 🎉

aramissennyeydd · 2024-05-06T16:52:36Z

@iclanton Does this need a rush release? I don't see a new @microsoft/rush-lib release with the PhasedScriptAction changes.

benkeen · 2024-09-17T17:10:57Z

Hi @aramissennyeydd - thanks for contributing the feature! However, this option isn't yet documented and I'm not clear on exactly how to use it from the typings info:

The number of concurrency units that this operation should take up. The maximum concurrency units is determined by the -p flag."

We're porting over our build to use Rush cobuilds, which will run all work in a single phased command. However, certain operations can't be run in parallel with others: we need to disable rush parallelization (-p=1) on those operations, which led me to this option.

For cobuilds, a trick is to set the overall task to use something like -p=25% on the rush command, encouraging distribution of the work across multiple agents. I've been experimenting with different values for weight for our tasks that need -p=1, but it's extremely difficult to understand precisely what's happening - we're still getting failures consistent with what would happen if they weren't being ran with no parallelism. Your remark above made me think perhaps this isn't usable for us yet?

Say for example concurrency = 4 and we have an array of items with weights of 1, 2, 4 respectively. With this implementation, all 3 operations will be queued at once.

Any tips?

aramissennyeydd · 2024-09-17T17:22:07Z

@benkeen I was in a similar situation which is what led me to implement this feature. Basically, we had 2 very expensive packages that were so resource intensive that if they ran on the same agent they would crash the build. The only way to move forward there was to either set parallelism to 1 (a non-starter) or go down this path.

What we've done with this is set parallelism to a known value, say 8, and then manually defined the weights for the expensive operations. The algorithm currently works such that if you set those 2 operations to weight 8, they cannot be scheduled on the same machine. That's worked very well for us.

Can you share more about your use case? Why can't run these operations run in parallel with others? Is this something that could be adjusted using the rush operation graph instead?

benkeen · 2024-09-17T17:36:38Z

That's encouraging! Thanks for the response :)

Yup, sounds very similar indeed. A while back we started using a new MSW-Storybook style of integration tests for our packages. There's a lot of positive things to say about them: you get more bang for your buck with the tests; they do a full test of the component with API calls; they're quite readable; you need fewer unit tests that often aren't awfully valuable in of themselves & add maintenance. But the big problem was that they take time to run (20-30 seconds isn't unusual) and don't parallelize well at all. We ended up creating a custom stage on our pipeline with -p=1 to ensure we only run 1 package's tests at a time.

What we've done with this is set parallelism to a known value, say 8, and then manually defined the weights for the expensive operations

Innnnnteresting. Our build agents have a total concurrency of 16, so it sounds like I'd do this:

set the rush command to -p=4 (i.e. 25% for us)
in the operation in rush-project.json, set weight to 4 for any operation where we don't want parallelism.

Sound about right?

aramissennyeydd · 2024-09-17T18:45:33Z

@benkeen You've probably already run into this, but we had issues with MSW v1 memory leaks in unit tests and moved to v2 to fix some of that.

For the parallelism, that sounds right. You'll still get some other operations with <4 weight running in parallel, but we've found that that doesn't generally affect CI times and if it does you can pin those to a higher weight as well. Setting the weight = rush global parallelism will guarantee that that operation will not run in parallel with any other operation with that same weight set. Other operations can creep in if they start before the operation with weight 4 is picked up.

benkeen · 2024-09-17T19:20:24Z

Thanks @aramissennyeydd, appreciate the help.

aramissennyeydd added 7 commits April 29, 2024 16:39

start working on weighted graph

919dff6

adds a new forEachWeightedAsync method to Async that uses operation w…

ea5bae4

…eights to scale concurrent units

add test cases for async weighted

f16e6a9

add a test case for weight 0

cc5b1df

add a header comment

8ed8fb8

fix-api-report

dbc8402

fix weightedoperationplugin

0a1ebc5

aramissennyeydd requested review from iclanton, octogonz, apostolisms, D4N14L and dmichon-msft as code owners May 1, 2024 22:48

aramissennyeydd added 2 commits May 1, 2024 18:51

add changesets

fed6a23

fix linting

f935016

iclanton reviewed May 2, 2024

View reviewed changes

common/changes/@microsoft/rush/sennyeya-weighted-graph_2024-05-01-22-51.json Outdated Show resolved Hide resolved

iclanton reviewed May 2, 2024

View reviewed changes

common/changes/@rushstack/node-core-library/sennyeya-weighted-graph_2024-05-01-22-51.json Outdated Show resolved Hide resolved

iclanton reviewed May 2, 2024

View reviewed changes

common/reviews/api/node-core-library.api.md Outdated Show resolved Hide resolved

libraries/node-core-library/src/Async.ts Outdated Show resolved Hide resolved

aramissennyeydd and others added 4 commits May 1, 2024 22:34

move the weighting behavior into an overload

06c7679

Apply suggestions from code review

32ad704

Co-authored-by: Ian Clanton-Thuon <[email protected]>

update changeset

1ac2b95

remove unnecessary tsdoc things

75701f7

iclanton reviewed May 2, 2024

View reviewed changes

common/reviews/api/node-core-library.api.md Outdated Show resolved Hide resolved

aramissennyeydd added 6 commits May 1, 2024 22:48

make weight required

56a70a6

fix api report

a04fe82

only use weights when weighted is set to true

86ae47a

add a test for weighted being disabled

63857ef

moving the comment to the public function

3406734

fix linting concern

80dae8f

handle larger than concurrency weights

ca1ffc6

iclanton reviewed May 3, 2024

View reviewed changes

aramissennyeydd and others added 3 commits May 3, 2024 14:48

Apply suggestions from code review

cc2a498

Co-authored-by: Ian Clanton-Thuon <[email protected]>

Update libraries/rush-lib/src/schemas/rush-project.schema.json

a4bbbff

Co-authored-by: Ian Clanton-Thuon <[email protected]>

address code review questions

6163ca9

aramissennyeydd requested a review from iclanton May 3, 2024 19:45

aramissennyeydd added 2 commits May 3, 2024 15:49

add documentation for weighted

343ef0e

fix api report

27ce6a1

iclanton approved these changes May 3, 2024

View reviewed changes

libraries/node-core-library/src/Async.ts Outdated Show resolved Hide resolved

iclanton reviewed May 3, 2024

View reviewed changes

common/reviews/api/node-core-library.api.md Outdated Show resolved Hide resolved

aramissennyeydd added 2 commits May 3, 2024 18:15

add weights for map too

84d8ce8

fix api report

0598f0d

iclanton approved these changes May 4, 2024

View reviewed changes

libraries/node-core-library/src/Async.ts Outdated Show resolved Hide resolved

libraries/node-core-library/src/Async.ts Show resolved Hide resolved

common/reviews/api/node-core-library.api.md Outdated Show resolved Hide resolved

Apply suggestions from code review

a526067

Co-authored-by: Ian Clanton-Thuon <[email protected]>

iclanton merged commit 21afaae into microsoft:main May 5, 2024
5 checks passed

This was referenced May 8, 2024

bug(rush-lib,operation-weighting): UNASSIGNED_OPERATION causing memory leak #4684

Merged

[node-core-library] iterator weighting isn't fully respected by Async#forEachAsync #4688

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(rush,node-core-library): allow weighted async concurrency #4672

feat(rush,node-core-library): allow weighted async concurrency #4672

aramissennyeydd commented May 1, 2024

aramissennyeydd commented May 1, 2024

iclanton left a comment

aramissennyeydd commented May 2, 2024

aramissennyeydd commented May 6, 2024

aramissennyeydd commented May 6, 2024

benkeen commented Sep 17, 2024 •

edited

Loading

aramissennyeydd commented Sep 17, 2024

benkeen commented Sep 17, 2024

aramissennyeydd commented Sep 17, 2024 •

edited

Loading

benkeen commented Sep 17, 2024

feat(rush,node-core-library): allow weighted async concurrency #4672

feat(rush,node-core-library): allow weighted async concurrency #4672

Conversation

aramissennyeydd commented May 1, 2024

Summary

Details

How it was tested

Impacted documentation

aramissennyeydd commented May 1, 2024

iclanton left a comment

Choose a reason for hiding this comment

aramissennyeydd commented May 2, 2024

aramissennyeydd commented May 6, 2024

aramissennyeydd commented May 6, 2024

benkeen commented Sep 17, 2024 • edited Loading

aramissennyeydd commented Sep 17, 2024

benkeen commented Sep 17, 2024

aramissennyeydd commented Sep 17, 2024 • edited Loading

benkeen commented Sep 17, 2024

benkeen commented Sep 17, 2024 •

edited

Loading

aramissennyeydd commented Sep 17, 2024 •

edited

Loading