Add simplified critical path scheduler to improve build times #2177

peterbell10 · 2022-08-10T17:14:54Z

This is a simplified alternative to #2019.

It still adds a critical path scheduler based on weighted edges in the build graph, but instead of using historical runtime it simply assigns a priority of 1 to everything except phony edges. In doing so, it prioritizes jobs by their depth in the build graph. This is better than random scheduling because jobs with dependents will unlock more work to be done when they are completed, thus we get fewer instances of starving the command runner.

For example, if your build has some code-generated sources then the code generator is prioritized above compiling non-generated sources and so all sources files will be unblocked and able to compile in parallel.

Compared to weighting by historic runtime, this will be worse in cases where build times are limited by a single long-running compile job at a low depth and with no dependencies. However, it does avoid the problem of systematically launching the resource-intensive jobs all at once.

The existing algorithm doesn't work because it strictly requires that all outputs are visited before updating an edge. So any task downstream from a task with multiple out-edges may get ignored. The fix is to always propagate your critical time to the next input node, and only place it in the queue if you offer a higher critical time.

1. Move EdgePriorityQueue to graph.h and inherit from priority_queue 2. Add comment about edge->critical_time()

AddTarget cannot add edges to the ready queue before the critical time has been computed.

Meinersbur · 2022-10-31T07:22:14Z

I would have preferred the approach from #2019. The number of edges is a quite inaccurate estimate for the build time of the critical path. Also, I feel the mentioned problem¹ is an orthogonal problem that should be solved differently, e.g. with pools. Both are symptoms that heavily depend on the particular project and do not necessarily generalize to other projects.

Have you considered only prioritizing the critical path (as if with infinite parallelism), and schedule other edges in parallel to it according to other priorities, such as minimizing peak resource usage²?

https://github.com/ninja-build/ninja/pull/2019#issuecomment-1076502406 ↩
I think ninja currently does not measure resource usage, but a simple peak rss could recorded? ↩

kaspar030 · 2022-11-14T09:45:44Z

The number of edges is a quite inaccurate estimate for the build time of the critical path.

The historic build time is inaccurate, too, if e.g., ccache is used.

nico and others added 22 commits August 25, 2021 11:48

support explicit build order

4af9fc5

Use explicit std:: style and remove debug print statements

12b5b7c

Change priority_list_ into a std::priority_queue of ready edges

8e23200

clang-format diff

c5d355c

Address review comments

63b0a9a

Fix total_time computation

5b8d19b

Address review comments

fe80637

Improve heuristic for unknown cost edges

c83167f

Remove redundant include

77448b4

Address review comments

24d1f5f

1. Move EdgePriorityQueue to graph.h and inherit from priority_queue 2. Add comment about edge->critical_time()

Merge remote-tracking branch 'upstream/master' into cpsched

1af6daf

Remove unnecessary whitespace

6ee9049

Add simple test for EdgeQueue

1128a56

Improve comments and retrieve edges into ready_queue directly

a861164

Add run_time_ms accessors and more comments

026498f

Add test and fix priority bug

4bd8db1

AddTarget cannot add edges to the ready queue before the critical time has been computed.

Pool: sort equally-weighted edges by priority

a643af2

Rename critical_time to critical_time_ms

f2333b7

Clarify the purpose of active_edges in back-propagation

09d4faa

Merge remote-tracking branch 'upstream/master' into cpsched-2

18220a3

Simplify scheduler to not use build log/execution time

29fe3ef

jhasse mentioned this pull request Aug 22, 2022

Add critical path scheduler to improve build times #2019

Closed

jhasse added this to the 1.12.0 milestone Aug 22, 2022

jhasse mentioned this pull request Jan 24, 2023

Influence the build order #2243

Closed

jhasse merged commit 460dff8 into ninja-build:master Feb 29, 2024

jhasse mentioned this pull request Mar 1, 2024

Use better logic for picking which node to build first #376

Closed

daljit46 mentioned this pull request Apr 15, 2024

Optimisation of compilation speed with CMake/Ninja MRtrix3/mrtrix3#2877

Merged

nikic mentioned this pull request Jun 3, 2024

Snapshot for 20240603, v19.0.0, 2fbc9f2 (big-merge) fedora-llvm-team/llvm-snapshots#526

Closed

msimberg mentioned this pull request Aug 15, 2024

Installation issue: llvm-amdgpu spack/spack#45746

Closed

4 tasks

emilazy mentioned this pull request Nov 7, 2024

swift: Workaround Hydra darwin build problem NixOS/nixpkgs#354192

Merged

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add simplified critical path scheduler to improve build times #2177

Add simplified critical path scheduler to improve build times #2177

peterbell10 commented Aug 10, 2022

Meinersbur commented Oct 31, 2022

kaspar030 commented Nov 14, 2022

Add simplified critical path scheduler to improve build times #2177

Add simplified critical path scheduler to improve build times #2177

Conversation

peterbell10 commented Aug 10, 2022

Meinersbur commented Oct 31, 2022

Footnotes

kaspar030 commented Nov 14, 2022