Add scrooge toolchain #1116

liucijus · 2020-09-28T08:11:08Z

Scrooge toolchain, part of #940

I've grouped deps mostly mechanically, and also glued some of the code together to have less dependency groups. I think it' a perfect time to review grouping, naming and if needed refactor them.

liucijus · 2020-09-28T08:13:35Z

@blorente, @wisechengyi I would appreciate your input!

blorente · 2020-09-28T16:28:53Z

Thank you for putting this together! I ran out of time today, but will review it by EOD tomorrow.

ittaiz · 2020-09-28T18:11:48Z

@ianoc @beala-stripe @andyscott your inputs would be appreciated as well

ianoc · 2020-09-28T18:24:06Z

Does this avoid the host/target deps mixing problem that caused the revert for the protobuf rules?

liucijus · 2020-09-29T07:30:29Z

Does this avoid the host/target deps mixing problem that caused the revert for the protobuf rules?

Probably it doesn't avoid, the question if that's a problem or not. I would appreciate if someone can point this out.

ianoc · 2020-09-29T15:05:45Z

Mixing host/target deps and having both sets of deps on the dependencies for targets is a problem yeah -- bloats all targets by doubling the jars involved, duplicate classes and slows down compilation. We had to revert using deps on the toolchain of the protobuf stuff since it was bad enough to users/experience

ianoc · 2020-09-29T15:07:36Z

one issue related to this was #797

blorente · 2020-09-29T16:51:52Z

I'm still in the process of trying to use this change with our code, but I noticed something:

If I understand correctly, if I want to express "I want //external:io_bazel_rules_scala/dependency/thrift/scrooge_core to instead be //some/internal/target", I'd need to at least declare a declare_deps_provider for every current one that expresses a dependency on it, so compile_classpath_provider, aspect_compile_classpath_provider and compiler_classpath_provider.

This is because the rules under twitter_scrooge are not toolchain-aware, and therefore they can't just pull the deps they need from a common toolchain.

If this is the case, I think it would be good to make the rules toolchain-aware (would be happy to work on it), and then separate the dependencies to have one provider for each dep, allowing for easy swapping in and out.

This way, I think we'd also avoid mixing host and target deps, as they can pull what they need from the toolchain when building the classpath.

blorente · 2020-09-29T17:04:22Z

For what it's worth, I think this change as it exists now won't mix the host/target dependencies in the same way as #797, as long as people are careful with which dep providers overwrite.

liucijus · 2020-09-30T06:56:58Z

I'm still in the process of trying to use this change with our code, but I noticed something:

If I understand correctly, if I want to express "I want //external:io_bazel_rules_scala/dependency/thrift/scrooge_core to instead be //some/internal/target", I'd need to at least declare a declare_deps_provider for every current one that expresses a dependency on it, so compile_classpath_provider, aspect_compile_classpath_provider and compiler_classpath_provider.

Yes

This is because the rules under twitter_scrooge are not toolchain-aware, and therefore they can't just pull the deps they need from a common toolchain.

If this is the case, I think it would be good to make the rules toolchain-aware (would be happy to work on it), and then separate the dependencies to have one provider for each dep, allowing for easy swapping in and out.

deps provider is what I call a group of deps. How much granularity and how those groups are used are what need to be designed here. There may be multiple ways to design it: have less groups, but include the same dep into multiple groups, or have fine grained providers for each dep. I think conceptually having 1 to 1 mapping between provider and the dep is less flexible. For example, adding a new dependency will require changes in the rule implementation. BUT, from my expedience, most of the mappings we have rarely change and it's hard to predict how it will change if ever. I believe as a user of the scrooge rules you know better which of these patterns are better for the ruleset.

@blorente feel free to make scrooge rules toolchain aware. Let me know if you want me close this PR.

blorente

For what it's worth, I think this change as it exists now won't mix the host/target dependencies in the same way as #797, as long as people are careful with which dep providers overwrite.
On a second look, it does seem like scrooge_{scala, java}_library targets leak their host dependencies (called implicit_deps in twitter-scrooge) to the compile classpath (the JavaInfo they build here exports the merged list of all dependencies calculated here).

However, this is something that happened before this change, and possibly out of scope for it.

It's worth noting that, in general, a scrooge_{scala, java}_library will only export the compiled jars of other scrooge_*_library targets, plus its implicit_deps. We could investigate whether it's possible to stop exporting the implicit_deps (by changing how we build the JavaInfo, here), but it may be out of scope for this PR, since it's pre-existing behaviour.

@blorente feel free to make scrooge rules toolchain aware. Let me know if you want me close this PR.

Unfortunately, due to other circumstances I don't think I'll have the time to do this properly in the near future, so landing this sounds good, and I can work on making it toolchain-aware later.

I think conceptually having 1 to 1 mapping between provider and the dep is less flexible. For example, adding a new dependency will require changes in the rule implementation.

Yeah, this is true. If we want to allow users to customize as much as possible without touching the rule, your current grouping, "by usage", is the best, with the possible tweak of my comment above. It makes things a bit harder for our use case of "I want to say exactly where this dep comes from, in every instance of it", but it still allows it, so it's good :)

It's unfortunate that we can't create dependencies between dep providers. That way, we'd be able to define dep providers that "provide a single dep", and other providers that "provide a classpath".

blorente · 2020-10-01T13:14:49Z

twitter_scrooge/BUILD

+)
+
+declare_deps_provider(
+    name = "scrooge_generator_provider",


Given that the other groupings are named "by intention", could we name this something like scrooge_worker_classpath_provider?

I feel like the goal here is to allow someone to grow and shrink these classpaths separately, so even if someone wants to implement a rule that uses "just the scrooge generator", they still shouldn't to use this dep provider.

ianoc · 2020-10-01T15:19:32Z

@blorente I might be misunderstanding your post, was tough to parse the links -- but I think the notion is mixing up host dependencies vs compile dependencies. Its not good to export the scrooge compiler but the issue isn't that its the scrooge compiler coming, its dependencies from the host toolchain (since the scrooge compiler can exist in both the host and target toolchains).

liucijus · 2020-10-05T08:45:30Z

I think we have two options with this toolchain:

Do not merge and wait until toolchains fully support configuration transitions (not clear when)
Merge, for folks that need transitions, they can get them with bazel 3.5.0 with a flag --incompatible_override_toolchain_transition

Other alternatives welcome! Can we agree on which option we go? I am personally biased towards option 2.

blorente · 2020-10-05T09:29:14Z

Personally, I'm okay with option 2. As it stands, after bdc6952, this PR is an improvement over the previous version of the code, and while it complicates things for Twitter a bit, it also brings amazing flexibility.

ianoc · 2020-10-05T15:09:35Z

For (2) should we be following bazelbuild/bazel#11584 which seems to suggest using rule options until everyone has migrated and things can be flipped. I believe that would probably limit the splash damage of any change to inside rules_scala and avoid other rules of people's breaking?

Given this can break ~any other rule set if you flip this option from reading the issues on it, I'd be -1 to requiring it in for scrooge without duplicate classes. It makes the usual case a regression. -- but if we can flip it for the rule + add a test to ensure we don't have duplicated classes(the proto test can probably be copied and pasted over is my guess). We would just have the rules require the latest bazel which we've done before.

liucijus · 2020-10-06T07:08:33Z

I think the only reasonable implementation for (2) is to use --incompatible_override_toolchain_transition flag. Otherwise users will be forced to upgrade bazel during the migration. If having flag is too confusing, I think it's worth waiting until new toolchain transitions are enabled by default on bazel.

ianoc · 2020-10-06T16:15:16Z

@liucijus Its not that I think the flag is confusing, but my understanding the flag changes the behavior of all rules and isn't localized. To use the flag you need a recent version of bazel anyway. We've mandated bazel updates before to update the rules. So unless you update bazel you'd have a regression in performance/size of outputs, and possibly some correctness issues as was seen in the scalapb, which feel like bigger issues than a version bump? to me anyway

ianoc · 2020-10-06T16:15:49Z

(personally I think requiring bazel 3.5 isn't a big deal, eventually ~everyone is going to have to do this in order to follow the migration guideline of this feature I believe)

ittaiz · 2020-10-11T19:41:50Z

I'm ok with requiring 3.5 given we can clearly articulate the need (and it sounds that here we can)

ittaiz · 2020-10-14T08:32:17Z

I've thought about it some more (thanks @liucijus for clarifying some more on the various options) and I think @ianoc's approach is best. Let's require 3.5 and add the incompatible_use_toolchain_transition to the rule definition.
I understand we'll need to break people again when incompatible_use_toolchain_transition will be removed/renamed/whatever and I think that price is worth it (compared to requiring people to turn on the flag for every rule set)

liucijus · 2020-10-19T11:20:11Z

I have added a test to verify if there are host deps in the classpath

liucijus · 2020-10-19T11:20:44Z

I think this PR is ready to be merged

liucijus · 2020-10-21T08:34:23Z

Just to clarify, I haven't added incompatible_use_toolchain_transition = True to rules, because I was unable to reproduce a problem with a test: 6b71671

ittaiz · 2020-10-21T12:12:45Z

Thanks. Sounds reasonable.
@blorente ill merge this on Friday in case you want to try and show the need for the attribute with a failing test by then

blorente · 2020-10-26T11:59:40Z

Sorry for not responding quicker, I was on PTO and didn't have a chance to look at it. This PR looks good, thanks @liucijus for adding the test!

* Add scrooge toolchain * Rename dep provider for scrooge generator * Migrate thrift and scrooge rules cfg from host to exec * Add test to ensure scrooge host and target deps are not mixed

liucijus requested a review from ittaiz as a code owner September 28, 2020 08:11

googlebot added the cla: yes label Sep 28, 2020

blorente reviewed Oct 1, 2020

View reviewed changes

liucijus mentioned this pull request Oct 9, 2020

Keep all maven deps in the central place for easier version management #1113

Merged

liucijus mentioned this pull request Oct 15, 2020

Require Bazel 3.5.0 #1122

Merged

Vaidas Pilkauskas added 4 commits October 16, 2020 17:29

Add scrooge toolchain

d47555c

Rename dep provider for scrooge generator

eb70abd

Migrate thrift and scrooge rules cfg from host to exec

ede2931

Add test to ensure scrooge host and target deps are not mixed

6b71671

liucijus force-pushed the scrooge-toolchain branch from bdc6952 to 6b71671 Compare October 19, 2020 07:55

ittaiz merged commit 1d6cc4f into bazelbuild:master Oct 23, 2020

liucijus mentioned this pull request Dec 2, 2020

create a scrooge toolchain #402

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add scrooge toolchain #1116

Add scrooge toolchain #1116

liucijus commented Sep 28, 2020

liucijus commented Sep 28, 2020

blorente commented Sep 28, 2020

ittaiz commented Sep 28, 2020

ianoc commented Sep 28, 2020

liucijus commented Sep 29, 2020

ianoc commented Sep 29, 2020

ianoc commented Sep 29, 2020

blorente commented Sep 29, 2020

blorente commented Sep 29, 2020

liucijus commented Sep 30, 2020

blorente left a comment

blorente Oct 1, 2020

liucijus Oct 2, 2020

ianoc commented Oct 1, 2020

liucijus commented Oct 5, 2020

blorente commented Oct 5, 2020

ianoc commented Oct 5, 2020

liucijus commented Oct 6, 2020

ianoc commented Oct 6, 2020

ianoc commented Oct 6, 2020

ittaiz commented Oct 11, 2020

ittaiz commented Oct 14, 2020

liucijus commented Oct 19, 2020

liucijus commented Oct 19, 2020

liucijus commented Oct 21, 2020

ittaiz commented Oct 21, 2020

blorente commented Oct 26, 2020

Add scrooge toolchain #1116

Add scrooge toolchain #1116

Conversation

liucijus commented Sep 28, 2020

liucijus commented Sep 28, 2020

blorente commented Sep 28, 2020

ittaiz commented Sep 28, 2020

ianoc commented Sep 28, 2020

liucijus commented Sep 29, 2020

ianoc commented Sep 29, 2020

ianoc commented Sep 29, 2020

blorente commented Sep 29, 2020

blorente commented Sep 29, 2020

liucijus commented Sep 30, 2020

blorente left a comment

Choose a reason for hiding this comment

blorente Oct 1, 2020

Choose a reason for hiding this comment

liucijus Oct 2, 2020

Choose a reason for hiding this comment

ianoc commented Oct 1, 2020

liucijus commented Oct 5, 2020

blorente commented Oct 5, 2020

ianoc commented Oct 5, 2020

liucijus commented Oct 6, 2020

ianoc commented Oct 6, 2020

ianoc commented Oct 6, 2020

ittaiz commented Oct 11, 2020

ittaiz commented Oct 14, 2020

liucijus commented Oct 19, 2020

liucijus commented Oct 19, 2020

liucijus commented Oct 21, 2020

ittaiz commented Oct 21, 2020

blorente commented Oct 26, 2020