Add a flag to remain persistent. #1389

jmgao · 2018-02-20T22:52:05Z

Large projects (e.g. Android, with its 1GB+ ninjafiles) can take
multiple seconds to parse ninjafiles, leading to a very painful
edit-compile-test cycle. Add a flag that causes ninja to remain
persistent and run a build upon receiving input to stdin, speeding up
a no-op incremental build of a directory in Android from 10 seconds to
under 200 milliseconds.

Large projects (e.g. Android, with its 1GB+ ninjafiles) can take multiple seconds to parse ninjafiles, leading to a very painful edit-compile-test cycle. Add a flag that causes ninja to remain persistent and run a build upon receiving input to stdin, speeding up a no-op incremental build of a directory in Android from 10 seconds to under 200 milliseconds.

jmgao · 2018-02-21T02:27:31Z

(in the off chance this is actually acceptable for upstream and not just something that should be done in a local hack, there should be an assertion that the line read in by getline is empty, to allow for future extensibility without breaking backward compat)

evmar · 2018-02-21T05:30:29Z

How did you get 1gb ninja files? At even 1kb of text per file (which is 10-20 lines of text per file) that's still a million build steps?

jmgao · 2018-02-21T06:07:50Z

@colincross and @danw can provide more context, but some cursory inspection says that 1kB per file is probably conservative. grepping for an arbitrary file finds a rule/build combo that's 3kB.

We seem to have some especially huge lines, there's one that's 5 MB, four that are 2.8 MB, and four more that are around 1 MB. They seem to mostly be giant conglomeration rules like "build all of the javadocs" or "build all of the native sources" or "build the dialer apk".

jmgao · 2018-02-21T06:22:09Z

(Also, it looks like we have about half a million build steps when targeting an arbitrarily selected device. There's a bit of combinatorial explosion with resources, but paging through the list, most of them seem legit. We just have way too much source...)

evmar · 2018-02-21T07:19:32Z

I'm pretty surprised ninja works at all at that scale. There are various places where we likely do O(N) scans of data structures. I guess you are lucky!

…

On Feb 20, 2018 10:22 PM, "Josh Gao" ***@***.***> wrote: (Also, it looks like we have about half a million build steps when targeting an arbitrarily selected device. There's a bit of combinatorial explosion with resources, but paging through the list, most of them seem legit. We just have way too much source...) — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#1389 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAAPBzp9pakJfhw81Eh4fIuNzHOUB1agks5tW7YSgaJpZM4SMzPr> .

danw · 2018-02-21T11:45:03Z

For our internal "master" tree, building for the Pixel 2 (taimen) device, our ninja files now add up to 1.5GB. Some stats on that configuration:

    603k     build statements
    135k 22% phony build statements
    849k     nodes
    625k 73% nodes that are outputs of build statements
    223k 26% input-only nodes

    105k actions run in a "normal" local build
    196k actions run in a build server build (building more tests and other artifacts)

We're seeing 10+ seconds of ninja-related startup time, last I checked this was largely due to:

Path -> Node* lookups (lots and lots of them)
Parsing/allocation of EvalStrings
Loading of the ninja_deps/ninja_log files. These can reach ~100MB, and we're running ninja with the same ninja directory 3 times (bootstrapping the generator).

The first two are fairly optimized code paths already for what they do, and my only significantly successful experiment has been to defer some of the work until we know that we really need it: https://android-review.googlesource.com/c/platform/external/ninja/+/461005

Realistically, for this scale, I've been looking at a binary file format -- precomputing as much as possible and using gzip'd string tables instead of variable substitution. That's actually where some of the stats above came from -- it brings the file size down to 104MB, and early testing with a multithreaded go implementation is showing <1/4 the time to do a similar set of tasks to ninja.

On the extreme side, I've gotten our generator to produce an 8.5GB ninja file when trying to produce a single build that builds many configurations at once, but that build isn't particularly realistic on most machines anymore.

evmar · 2018-02-21T17:09:03Z

(I don't mean to derail your larger thrust here, just wanted to point out some smaller options that might help a bit.)

One of the reasons we have the $variable support is to allow for some refactoring of input files -- like if a bunch of targets all have the same -I flags you could share them. I appreciate that maybe that isn't convenient for you depending on how your generator code is laid out, but the snippet you showed (where it was a separate rule per compile command) would significantly bloat the file.

A more common pattern is something like:

cflags = cflags used by most code
rule cc
  command = clang++ $cflags etc
build exceptionaltarget: cc inputs
  cflags = special cflags used by this target

to factor the long flags lines together.

Ultimately of course all of these strings must be expanded into full command lines in memory, but if the bottleneck is textual input -> in-memory representation then that would probably help.

A rule per target is surely constructing a lot of extra unneeded data structures. You could even maybe do something like

rule arbitrarycommand
  deps = gcc
  command = $cmd

build whatever.o: arbitrarycommand morestuff
  desc = ...
  cmd = lots of stuff here

to factor out the construction of a lot of intermediate Rule objects and share the "deps = gcc" line repeatedly.

colincross · 2018-02-21T18:34:28Z

We mix the results of two generators, blueprint and kati. Blueprint was designed around ninja, and creates sensible ninja files with only a few rules and repeated cflags in variables. Kati is converting makefiles to ninja, and produces a much cruder ninja file with a rule per build statement and no use of variables (not even $in and $out), which results in all the input files being listed in both the rule and build statements.

danw · 2018-02-22T03:36:11Z

While variables reduce the file size, that's not necessarily a win for loading time, expanding the variables isn't particularly cheap. Keeping them unique & human-understandable also leads to some of our variable names being longer than the value that they represent :(

From the 1.3GB -> 1.5GB jump in the last week, we did notice that we're duplicating some dependencies that weren't strictly necessary. So once my patches land, we'll be back to "only" 1GB :)

jhasse · 2019-04-10T07:53:11Z

Closing in favor of #1438.

jmgao force-pushed the persistent branch from dae0788 to 78adfff Compare February 21, 2018 00:14

jdarpinian mentioned this pull request May 12, 2018

Alternative implementation of a persistent build server mode. #1438

Closed

jhasse mentioned this pull request Nov 14, 2018

Introduce binary manifest format #1093

Closed

nico mentioned this pull request Nov 15, 2018

Consider adding a second, "heavy" binary #1499

Open

jhasse closed this Apr 10, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a flag to remain persistent. #1389

Add a flag to remain persistent. #1389

jmgao commented Feb 20, 2018

jmgao commented Feb 21, 2018

evmar commented Feb 21, 2018

jmgao commented Feb 21, 2018

jmgao commented Feb 21, 2018

evmar commented Feb 21, 2018 via email

danw commented Feb 21, 2018

evmar commented Feb 21, 2018 •

edited

Loading

colincross commented Feb 21, 2018

danw commented Feb 22, 2018

jhasse commented Apr 10, 2019

Add a flag to remain persistent. #1389

Add a flag to remain persistent. #1389

Conversation

jmgao commented Feb 20, 2018

jmgao commented Feb 21, 2018

evmar commented Feb 21, 2018

jmgao commented Feb 21, 2018

jmgao commented Feb 21, 2018

evmar commented Feb 21, 2018 via email

danw commented Feb 21, 2018

evmar commented Feb 21, 2018 • edited Loading

colincross commented Feb 21, 2018

danw commented Feb 22, 2018

jhasse commented Apr 10, 2019

evmar commented Feb 21, 2018 •

edited

Loading