Incremental compilation RFC #1298

nikomatsakis · 2015-09-28T17:39:49Z

High-level strategy for incremental compilation.

cc @rust-lang/compiler

Rendered

larsbergstrom · 2015-09-28T17:49:36Z

Awesome!

This is about incremental builds for a single-crate, right? If so, it's worth calling that out.

Also, if I'm correct, these caches are not meant to be shared across build machines, right?

aturon · 2015-09-28T17:50:42Z

@nikomatsakis The summary talks about debug builds specifically, but IIRC we discussed how this would apply to release builds as well? (I.e., a story a bit like parallel codegen units, where you'd be trading incrementality against optimization potential due to passing LLVM smaller units of code)

nikomatsakis · 2015-09-28T17:51:25Z

@larsbergstrom actually, I believe incremental builds across crates can be done relatively easily, though I didn't discuss it. I will add a TODO item to summarize how that would work.

@aturon yes I updated the summary, my mistake.

nikomatsakis · 2015-09-28T17:57:15Z

@larsbergstrom added a brief note about cross-crate dependencies

nikomatsakis · 2015-09-28T17:57:36Z

@larsbergstrom

Also, if I'm correct, these caches are not meant to be shared across build machines, right?

That is correct.

eefriedman · 2015-09-28T19:08:27Z

How does member function name (x.foo()) lookup work in this scheme, particularly in the case of autoderef? Presumably a failed lookup has to create a dependency on something, but it's not clear what exactly that "something" is.

nikomatsakis · 2015-09-28T19:15:35Z

@eefriedman

How does member function name (x.foo()) lookup work in this scheme, particularly in the case of autoderef? Presumably a failed lookup has to create a dependency on something, but it's not clear what exactly that "something" is.

That "something" is the IR tables that indicate what traits are in scope at a given point, as well as those that collect all the impls for a trait (I did not add an exhaustive listing to the RFC). Those will presumably be linked up something like the following:

there will be an edge from the containing module/scope to the tables indicating what traits are in scope, such that if a new use statement is added, portions of those tables are invalidated.
method search will be adding edges from the table of traits to the fns that include method calls.
the coherence pass will add edges from each impl to IR node representing the set of traits of that impl
trait search will add edges from the set of traits for a given impl to the fn using it

That is roughly the idea. Make sense?

eefriedman · 2015-09-28T19:23:46Z

Yes, that makes sense; thanks.

bstrie · 2015-09-28T19:43:03Z

text/0000-incremental-compilation.md

+strategies can be used to enable lazy or parallel compilation at later
+times. (Eventually, though, it might be nice to restructure the
+compiler so that it operates in more of a demand driven style, rather
+than a series of sweeping passes.)


Mind elaborating on what "demand driven style" entails and how it differs from our current approach?

retep998 · 2015-09-28T20:43:05Z

As an example of what I think is incremental compilation done right, see MSVC. Not only does it have an incremental compilation + linking mode that works fairly well, but it also has an incremental LTCG mode where it does full link time optimization, just incrementally.

shepmaster · 2015-09-28T21:16:27Z

text/0000-incremental-compilation.md

+    impl Type {         // Path: <root>::foo::<impl1>
+        fn bar() {..}   // Path: <root>::foo::<impl1>::bar
+    }
+    impl Type { }       // Path: <root>::foo::<impl2>


Since you don't indicate that every path has a unique integer, this seems to imply that you'd have to know if there are any duplicate children before you start naming, or have some amount of mutability to go back and "fix" the first child when you see the second child.

Is there a possibility to simply leave the first one as <impl> and then mark the second one as <impl2>?

shepmaster · 2015-09-28T21:34:52Z

As I understand it, a large benefit of incremental compilation is speed, but there's no mention of tests that attempt to quantify or ensure that the new world order will be faster. Is there anything more beyond time cargo build?

frewsxcv · 2015-09-29T00:58:25Z

text/0000-incremental-compilation.md

+- Object files
+  - This represents the final result of running LLVM. It may be that
+    the best strategy is to "cache" compiled code in the form of an
+    rlib that is progessively patched, or it may be easier to store


"progessively" → "progressively"

comex · 2015-09-29T10:17:12Z

Not a very helpful comment, but: 👍👍👍👍👍

nikomatsakis · 2015-09-29T10:56:29Z

@bstrie

Mind elaborating on what "demand driven style" entails and how it differs from our current approach?

By demand-driven style, what I meant was that we would build a dependency graph that we use to drive compilation. So, for example, we would begin by saying "we need to trans the main function" (assuming an application), so let's try to do that. But to trans the main function, we must know that it is correct, so that would require us to borrow check main. But to borrow check main, we must know it is type correct, so we would first type check it. This in turn would require knowing what its names refer to, so we would run name resolution. During type-checking, we would collect the signature of each fn that gets called would then get explored as well, since we can't type-check main without knowing that. Once we're done with main, we'd do the same procedure for every other fn in the crate. At the end, we'd have translated everything, but we do so depth-first rather than breadth-first. Make sense?

nikomatsakis · 2015-09-29T10:59:47Z

@shepmaster

Since you don't indicate that every path has a unique integer, this seems to imply that you'd have to know if there are any duplicate children before you start naming, or have some amount of mutability to go back and "fix" the first child when you see the second child.

In the actual implementation, every path element also has a disambiguating integer. This begins as zero, but when we create a new def-id, we check if the parent already has a child with that name and, if so, increment the disambiguating integer as many times as we have to until we get a unique name. I can tweak the RFC to reflect the impl more precisely.

nikomatsakis · 2015-09-29T11:04:58Z

@shepmaster

I believe this is an understood limitation, but it may be worth pointing out again that this wouldn't allow bar to be inlined into foo, otherwise you'd end up with two versions of bar.

I don't know what you mean here, actually. Do you mean that if we inlined, then the graph would be wrong? Because that is not the case: this graph refers to the front end's view of things, which is before inlining etc will take place. When we actually do codegen, if foo and bar are placed into the same codegen unit, then yes LLVM may choose to do inlining (and that would be reflected in the dependency graph). I don't think I have a good example graph showing how that would work, but it's described textually in the section on optimization.

nikomatsakis · 2015-10-16T18:52:25Z

Hear ye, hear ye. This RFC is now entering final comment period.

michaelwoerister · 2015-10-21T11:52:54Z

I think this RFC is good to go. Conceptually it seems sound to me and it contains enough of a concrete outline to start implementing.

Ericson2314 · 2015-10-21T20:32:53Z

I get that as a mere rust user that has never contributed to rustc, I'm basically pontificating on these design decisions that don't affect any public interface. But might somebody comment on whether the alternative of building the dependency graph explicitly and then processing it (lazily or otherwise) as I wrote earlier was considered?

nikomatsakis · 2015-10-21T22:53:23Z

But might somebody comment on whether the alternative of building the
dependency graph explicitly and then processing it (lazily or otherwise) as
I wrote earlier was considered?

Sorry, I meant to reply to your comment earlier. I did consider that design
and I suspect that ultimately we will actually do a bit of both ---
however, I very much want to prevent the dependency graph and the code from
falling out of sync. We have definitely had bad experience in this respect.
Simply building a graph a priori can very easily fall into this trap. If we
do build up a graph up-front, I want to try and refactor the code such that
requesting data where there is no graph edge fails (perhaps by asserting
that the graph edge exists, or by restructuring the API in some way that
it's not even possible).

On Wed, Oct 21, 2015 at 1:33 PM, John Ericson [email protected]
wrote:

I get that as a mere rust user that has never contributed to rustc, I'm
basically pontificating on these design decisions that don't affect any
public interface. But might somebody comment on whether the alternative of
building the dependency graph explicitly and then processing it (lazily or
otherwise) as I wrote earlier was considered?

—
Reply to this email directly or view it on GitHub
#1298 (comment).

Ericson2314 · 2015-10-22T02:33:48Z

@nikomatsakis Thank you, that is very reassuring. I absolutely agree on the soundness issue; in fact I'd say without refactoring to make sure the graph traversals are correct by construction, there's hardly any point in taking my route. Sounds like your view is the implicit dependency route is a good way to accurately catch all dependencies without forcing the big refactor, but explicit dependencies is a decent end goal?

nikomatsakis · 2015-10-22T10:06:54Z

I think there will always be some of both. Some dependencies at least
cannot be constructed "up front" but rather must be discovered -- for
example, we have to do method resolution and type-checking to know what
other fns are referenced and hence which dependencies exist.

On Wed, Oct 21, 2015 at 7:33 PM, John Ericson [email protected]
wrote:

@nikomatsakis https://github.com/nikomatsakis Thank you, that is very
reassuring. I absolutely agree on the soundness issue, and refactoring to
make sure the graph traversals are correct by construction. Sounds like
your view is the implicit dependency route is a good way to accurately
catch all dependencies without forcing the big refactor, but explicit
dependencies is a decent end goal?

—
Reply to this email directly or view it on GitHub
#1298 (comment).

Ericson2314 · 2015-10-22T17:53:23Z

Ah. I envisioned stuff like that working by the traversal of one graph creating another.

Ericson2314 · 2015-10-22T18:08:26Z

To clarify. Suppose we have something like token tree -(macros)-> collection of items -(type-checking and method resolution...)-> collection of MIR -(llvm)-> collection of bitcode.

To really do laziness right with this, not only would the graphs be traversed lazily, but also created lazily. The MIR for each function would be bundled with a thunk to generate the MIR for all referenced functions.

[For any Nix users out there (cough @eddyb cough) this is related to doing things like import (import ./foo.nix).]

Ericson2314 · 2015-10-22T18:17:35Z

Finally, I mentioned earlier I'd love to right some generic library to persist/cache all that. To make that a bit more concrete I was thinking of something like https://github.com/dmbarbour/haskell-vcache or https://github.com/mirage/irmin along with some infrastructure to serialize thunks.

bkoropoff · 2015-10-23T02:43:09Z

This looks great to me. The greatest challenge is going to be building a dependency graph that is as precise as possible (to get maximum benefit) without introducing unsoundness. I don't see any silver bullets here; just "be really careful" and "test a lot".

There may be an interesting class of source code changes affecting lifetime or variance inference where typechecking artifacts are invalidated, but it is theoretically possible to avoid invalidating trans artifacts since lifetimes are erased by then. I haven't thought of any concrete examples that would be worth exploiting, however.

michaelwoerister · 2015-10-23T14:20:27Z

One thing that is not mentioned in the RFC at all yet is monomorphization and the consequences it has.
The general case of a dependency graph with generic items will look more like the following:

BODY(foo) ----------------------------> TYPECK(foo) ----------------> MIR(foo)
                                          ^ ^ ^ ^                      |
SIG(foo) ----> COLLECT(foo)               | | | |         +------------+------------+
                 |                        | | | |         |            |            |
                 +--> ITEM_TYPE(foo) -----+ | | |         v            v            v
                 +--> PREDICATES(foo) ------+ | |      LLVM(foo'1)  LLVM(foo'2)  LLVM(foo'3)
                                              | |         |            |            |
SIG(bar) ----> COLLECT(bar)                   | |         v            v            v
                 |                            | |    OBJECT(foo'1) OBJECT(foo'2) OBJECT(foo'3)
                 +--> ITEM_TYPE(bar) ---------+ |
                 +--> PREDICATES(bar) ----------+

One complication I can see here is that we can only know after type-checking which monomorphizations are still used, but the proposed algorithm already wants to garbage-collect the on-disk cache right after building the HIR. This has to be accounted for somehow.

nikomatsakis · 2015-10-23T15:21:05Z

True. We don't actually know what monomorphizations we want until trans.
Type-checking doesn't expand things out. I was thinking about
monomorphizations at some point, but I don't remember just what I had in
mind. Regarding GCing of monomorphizations, I think I was originally
thinking that we would just keep all monomorphizations of foo until foo
changed. This does mean though that we might keep some monomorphizations we
no longer need (because they were only being used by bar, and bar
changed). It's also true that the "cache" on disk would have to include the
types in the key, something that the RFC doesn't really discuss explicitly.

I've also been thinking about what it would take to do an early target that
JUST saves LLVM IR and object code. This will require doing a few things
slightly differently, but seems like a good first "spike goal":

We would always recompute the signatures for all items, whether they've
changed or not. This is because
As you point out, we'll have to type-check the bodies for generic fns
that are potentially called, as we may need new monomorphizations thereof.
Probably the easiest way to start would be type-checking all bodies too, or
at least all generic bodies. I think the easiest way to address this would
be by saving and re-loading the MIR, which once it is in use ought not to
be that hard.

On Fri, Oct 23, 2015 at 10:20 AM, Michael Woerister <
[email protected]> wrote:

One thing that is not mentioned in the RFC at all yet is monomorphization
and the consequences it has.
The general case of a dependency graph with generic items will look more
like the following:

BODY(foo) ----------------------------> TYPECK(foo) ----------------> MIR(foo)
^ ^ ^ ^ |
SIG(foo) ----> COLLECT(foo) | | | | +------------+------------+
| | | | | | | |
+--> ITEM_TYPE(foo) -----+ | | | v v v
+--> PREDICATES(foo) ------+ | | LLVM(foo'1) LLVM(foo'2) LLVM(foo'3)
| | | | |
SIG(bar) ----> COLLECT(bar) | | v v v
| | | OBJECT(foo'1) OBJECT(foo'2) OBJECT(foo'3)
+--> ITEM_TYPE(bar) ---------+ |
+--> PREDICATES(bar) ----------+

One complication I can see here is that we can only know after
type-checking which monomorphizations are still used, but the proposed
algorithm already wants to garbage-collect the on-disk cache right after
building the HIR. This has to be accounted for somehow.

—
Reply to this email directly or view it on GitHub
#1298 (comment).

arielb1 · 2015-10-23T17:18:14Z

We already save the type-checked body of monomorphizable fns.

nikomatsakis · 2015-10-23T17:59:31Z

@arielb1

We already save the body of monomorphizable fns.

Yes, but what we are mostly talking about is preserving the monomorphized
LLVM bitcode.

Well, I guess I was saying that for a first draft, it might not be worth
trying to reuse the type-checked body at first. This is because currently
we save the body as part of the metadata in the final end-product, and it
would be work (however little) to save that data somewhere else. Clearly
eventually we want to. I'm mostly just trying to work out what is the
smallest thing we can get working to start.

On Fri, Oct 23, 2015 at 1:18 PM, arielb1 [email protected] wrote:

We already save the body of monomorphizable fns.

—
Reply to this email directly or view it on GitHub
#1298 (comment).

arielb1 · 2015-10-23T20:16:27Z

@nikomatsakis

Maybe convert all translation to use inlining and save the serialized data (we would also need to have some way of stably comparing it for this to work). Using serialized MIR instead of serialized AST may make this easier, but I feel like the issues are orthogonal.

nikomatsakis · 2015-10-24T00:17:30Z

@arielb1 I'm not clear on what problem you are proposing to solve here? (I
don't even see that there is a problem that needs solving)

On Fri, Oct 23, 2015 at 4:16 PM, arielb1 [email protected] wrote:

@nikomatsakis https://github.com/nikomatsakis

Maybe convert all translation to use inlining and save the serialized data
(we would also need to have some way of stably comparing it for this to
work).

—
Reply to this email directly or view it on GitHub
#1298 (comment).

michaelwoerister · 2015-10-24T18:12:47Z

We would always recompute the signatures for all items, whether they've changed or not. This is because

As you point out, we'll have to type-check the bodies for generic fns that are potentially called, as we may need new monomorphizations thereof. Probably the easiest way to start would be type-checking all bodies too, or at least all generic bodies. I think the easiest way to address this would be by saving and re-loading the MIR, which once it is in use ought not to be that hard.

Isn't it proposed anyway that the complete set of items hashed on every compilation?
I think it should not be a problem to just cache object code for starters. Only the dependency graph must be complete and not produce false negatives.

bstrie · 2015-11-03T01:41:04Z

text/0000-incremental-compilation.md

+## Basic usage
+
+The basic usage will be that one enables incremental compilation using
+a compiler flag like `-C incremental-compilation=TMPDIR`. The `TMPDIR`


Do you expect that Cargo will pass this flag by default for all projects?

nikomatsakis · 2015-11-06T19:03:19Z

Huzzah! The compiler team has decided to accept this RFC. The expectation is that the actual impl will discover numerous surprises (we've already found a few) that require adjustments, and that we will come back and update the RFC to be more inline with the final design when that has shaken out a bit.

matthewhammer · 2015-12-06T14:08:26Z

There's lots of interesting talk about incremental computation in this thread, which is great!

In case anyone was wondering about PL research literature on this topic, these researchers have also been thinking about incremental, demand-driven compilation / computation:

A sound and optimal build system with dynamic dependencies (OOPSLA 2015): http://www.informatik.uni-marburg.de/~seba/publications/pluto-incremental-build.pdf
Adapton: Composable, Demand-Driven Incremental Computation (PLDI 2014):
- http://www.cs.umd.edu/~hammer/adapton/
- http://adapton.org

The first paper is more recent, and specialized to a situation similar to the one described in the discussion above (incremental compilation, using demand-driven, dynamic dependency graphs). The second paper gives a general approach for such incremental, demand-driven computations. There is follow-on work on adapton.org.

White-Oak · 2015-12-30T18:40:44Z

Any update on the state of implementation?

jonas-schievink · 2015-12-30T21:04:45Z

@White-Oak Creation of a dependency graph is being done in rust-lang/rust#30532

Incremental compilation RFC

09c71cd

nikomatsakis added the T-compiler Relevant to the compiler team, which will review and decide on the RFC. label Sep 28, 2015

Update summary to not say "debug builds"

916834b

add a brief note about cross-crate dependencies

fa9ce6d

bstrie reviewed Sep 28, 2015
View reviewed changes

shepmaster reviewed Sep 28, 2015
View reviewed changes

frewsxcv reviewed Sep 29, 2015
View reviewed changes

nrc assigned nikomatsakis Sep 29, 2015

bstrie reviewed Nov 3, 2015
View reviewed changes

nikomatsakis merged commit 59b01f1 into rust-lang:master Nov 6, 2015

jonas-schievink mentioned this pull request Dec 26, 2015

Don't recheck files for 'build' if they have already passed 'check' rust-lang/cargo#2248

Closed

ticki mentioned this pull request Jan 24, 2016

Long build times for exa crate rust-lang/rust#31164

Closed

comex mentioned this pull request Apr 21, 2016

Procedural macros #1566

Merged

Centril added the A-incremental Proposals relating to incremental complilation. label Nov 23, 2018

Incremental compilation RFC #1298

Incremental compilation RFC #1298

Conversation

nikomatsakis commented Sep 28, 2015

larsbergstrom commented Sep 28, 2015

aturon commented Sep 28, 2015

nikomatsakis commented Sep 28, 2015

nikomatsakis commented Sep 28, 2015

nikomatsakis commented Sep 28, 2015

eefriedman commented Sep 28, 2015

nikomatsakis commented Sep 28, 2015

eefriedman commented Sep 28, 2015

bstrie Sep 28, 2015

Choose a reason for hiding this comment

retep998 commented Sep 28, 2015

shepmaster Sep 28, 2015

Choose a reason for hiding this comment

shepmaster commented Sep 28, 2015

frewsxcv Sep 29, 2015

Choose a reason for hiding this comment

comex commented Sep 29, 2015

nikomatsakis commented Sep 29, 2015

nikomatsakis commented Sep 29, 2015

nikomatsakis commented Sep 29, 2015

nikomatsakis commented Oct 16, 2015

michaelwoerister commented Oct 21, 2015

Ericson2314 commented Oct 21, 2015

nikomatsakis commented Oct 21, 2015

Ericson2314 commented Oct 22, 2015

nikomatsakis commented Oct 22, 2015

Ericson2314 commented Oct 22, 2015

Ericson2314 commented Oct 22, 2015

Ericson2314 commented Oct 22, 2015

bkoropoff commented Oct 23, 2015

michaelwoerister commented Oct 23, 2015

nikomatsakis commented Oct 23, 2015

arielb1 commented Oct 23, 2015

nikomatsakis commented Oct 23, 2015

arielb1 commented Oct 23, 2015

nikomatsakis commented Oct 24, 2015

michaelwoerister commented Oct 24, 2015

bstrie Nov 3, 2015

Choose a reason for hiding this comment

nikomatsakis commented Nov 6, 2015

matthewhammer commented Dec 6, 2015

White-Oak commented Dec 30, 2015

jonas-schievink commented Dec 30, 2015