Better scheduler for parallelized linker #2029

rui314 · 2021-09-22T03:18:51Z

Parallelized linkers are gaining popularity these days. lld, which is multi-threaded, is very popular for large-scale program. A yet another linker, mold (https://github.com/rui314/mold), is more parallelized than lld. (Disclaimer: I'm the original author of the two linkers.)

The problem we observe with ninja + parallelized linker is that ninja spawns more subprocesses than necessary. Theoretically, we can't make a build faster by spawning more subprocesses once the CPU usage is saturated. In fact, doing so is likely to slows down the build due to higher memory pressure.

I believe ninja could do better by incorporating the existence of multi-threaded linker into scheduling decision. Currently, I guess ninja assumes all subprocesses are single-threaded. Is there any effort to improve ninja in this area?

rui314/mold#117

ukai · 2021-09-22T15:06:38Z

use pool?
https://ninja-build.org/manual.html#ref_pool

rui314 · 2021-09-23T01:34:48Z

Pool might work, but in order to use that, it looks like users have to manually edit an auto-generated build.ninja file. I wish ninja to detect parallelized linker and adjust scheduling decision accordingly.

ilyapopov · 2021-09-27T12:46:32Z

See also #991

hadrielk · 2022-06-02T09:04:31Z

@rui314 I think many people don't write build.ninja by hand, but use CMake to generate it for them. For such cases, a solution does exist: JOB_POOL_COMPILE and JOB_POOL_LINK.

Basically it lets them limit how many simultaneous compiler and/or linker jobs there are.

For example, this will limit the number of compilation jobs to 64 and linker jobs to 4, globally for the project:

set_property(GLOBAL PROPERTY JOB_POOLS "comp_jobs=64" "link_jobs=4")
set(CMAKE_JOB_POOL_COMPILE comp_jobs)
set(CMAKE_JOB_POOL_LINK link_jobs)

One can also set the pools per target.

(and I should note: that it's still limited even further by the -j <N> jobs argument given to Ninja, or its auto-detected jobs value if the argument is not given)

ilyapopov · 2022-06-02T13:22:59Z

@hadrielk

Unfortunately, that does not solve the problem. The setup you suggest would allow up 64 compile jobs and up to 4 link jobs at the same time (but no more than -j N jobs in total, counting a link job as one, and not taking into account that each occupies many cpus). What is requested is to specify that we want up to 64 compile jobs or 4 link jobs. There is currently no way to specify that.

hadrielk · 2022-06-02T17:33:38Z

@ilyapopov yeah, you're not wrong.

But in practice I think it happens to work out that way anyway - or at least it does at my day job. Because Ninja spawns the compiler rules first, building all/most of the .o, all at the beginning... and then has a long tail in the back half of the overall build, linking stuff.

Or at least that's how it appears to us monitoring it at my day job. But that may be very specific to our setup. (~1,400 targets being built, mostly unity files, of both .so libs and execs, using distcc, ~3 hour build time)

We've actually been looking into how to smooth out the workload, to force a more even/balanced scheduling algorithm.

hadrielk · 2022-06-02T17:59:29Z

BTW, while on this topic... if you look through the various open issues for Ninja, you'll find a bunch asking for different knobs for controlling what gets built, when.

I think part of the reason for that, is that one of the most important and interesting aspects to build systems is not how fast and well they can load and build a DAG and determine what-changed and needs rebuilding... but rather it's the scheduling decisions after they have such information.

Or at least it is for us at my employer's, where a full build only takes seconds to build a DAG and determine changes, but takes hours to actually build - and better scheduling can make a drastic difference.

The problem is I don't think there's one "best" algorithm. There're too many different resource constraints and needs for different users.

So we're thinking of forking Ninja and adding python plugin support, so that Ninja can let a loaded python callback decide the scheduling once the set of edges are chosen (...or really, within CommandRunner::CanRunMore() and Plan::FindWork(), so for each iteration of the Builder's while-loop).

That way we could prototype different scheduling ideas. Invoking python for this step would probably be fast enough to even just use permanently, rather than just for prototyping.

Has anyone already done this type of thing?

ilyapopov · 2022-06-02T18:07:54Z

I have not seen anyone adding scripting for scheduling, but there is a PR #2019 to add critical path scheduling.

hadrielk mentioned this issue Jun 11, 2022

Parallelism Variables #2125

Closed

This comment was marked as abuse.

Sign in to view

lb90 mentioned this issue Nov 21, 2024

Ninja: default pool depth for link.exe mesonbuild/meson#13937

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better scheduler for parallelized linker #2029

Better scheduler for parallelized linker #2029

rui314 commented Sep 22, 2021 •

edited

Loading

ukai commented Sep 22, 2021

rui314 commented Sep 23, 2021

ilyapopov commented Sep 27, 2021

hadrielk commented Jun 2, 2022

ilyapopov commented Jun 2, 2022 •

edited

Loading

hadrielk commented Jun 2, 2022 •

edited

Loading

hadrielk commented Jun 2, 2022

ilyapopov commented Jun 2, 2022

This comment was marked as abuse.

This comment was marked as abuse.

Better scheduler for parallelized linker #2029

Better scheduler for parallelized linker #2029

Comments

rui314 commented Sep 22, 2021 • edited Loading

ukai commented Sep 22, 2021

rui314 commented Sep 23, 2021

ilyapopov commented Sep 27, 2021

hadrielk commented Jun 2, 2022

ilyapopov commented Jun 2, 2022 • edited Loading

hadrielk commented Jun 2, 2022 • edited Loading

hadrielk commented Jun 2, 2022

ilyapopov commented Jun 2, 2022

This comment was marked as abuse.

This comment was marked as abuse.

rui314 commented Sep 22, 2021 •

edited

Loading

ilyapopov commented Jun 2, 2022 •

edited

Loading

hadrielk commented Jun 2, 2022 •

edited

Loading