Tests with "exclusive" tag do not get results from remote cache #3791

grandseiken · 2017-09-22T10:45:02Z

Description of the problem / feature request / question:

I am trying out the remote cache functionality described here. Currently I'm running against a local hazelcast instance. This works great most of the time. One thing I noticed is that tests with the exclusive tag, despite being cached as normal in the local cache, will not seem to use the remote cache and will always rerun if there is no passing result in the local cache. This seems like an oversight?

If possible, provide a minimal example to reproduce the problem:

BUILD

cc_test(
  name = "test",
  srcs = ["test.cc"],
  tags = ["exclusive"], # comment this out to fix the problem
)

test.cc

#include <chrono>
#include <thread>

int main() {
  std::this_thread::sleep_for(std::chrono::seconds(10));
  return 0;
}

.bazelrc

startup --host_jvm_args=-Dbazel.DigestFunction=SHA1

build --spawn_strategy=remote
build --experimental_strict_action_env
build --remote_rest_cache=http://localhost:5701/hazelcast/rest/maps/cache

test --spawn_strategy=remote
test --experimental_strict_action_env
test --remote_rest_cache=http://localhost:5701/hazelcast/rest/maps/cache

Environment info

Operating System:
Ubuntu
Bazel version (output of bazel info release):
release 0.5.3
Hazelcast version
3.8.6

Anything else, information or logs or outputs that would be helpful?

Here's an example run. The second test will pass in 0.0s if either the exclusive tag is removed, or the second bazel clean command is omitted.

$ bazel clean && bazel test //... && bazel clean && bazel test //...                                         
INFO: Reading 'startup' options from /home/stu/tmp/.bazelrc: --host_jvm_args=-Dbazel.DigestFunction=SHA1
INFO: Starting clean (this may take a while). Consider using --async if the clean takes more than several minutes.
INFO: Reading 'startup' options from /home/stu/tmp/.bazelrc: --host_jvm_args=-Dbazel.DigestFunction=SHA1
INFO: Analysed target //:test (9 packages loaded).
INFO: Found 1 test target...
Target //:test up-to-date:
  bazel-bin/test
INFO: Elapsed time: 10.405s, Critical Path: 10.20s
INFO: Build completed successfully, 7 total actions
//:test                                                                  PASSED in 10.0s

Executed 1 out of 1 test: 1 test passes.
There were tests whose specified size is too big. Use the --test_verbose_timeout_warnings command line option to see which ones these are.
INFO: Reading 'startup' options from /home/stu/tmp/.bazelrc: --host_jvm_args=-Dbazel.DigestFunction=SHA1
INFO: Starting clean (this may take a while). Consider using --async if the clean takes more than several minutes.
INFO: Reading 'startup' options from /home/stu/tmp/.bazelrc: --host_jvm_args=-Dbazel.DigestFunction=SHA1
INFO: Analysed target //:test (9 packages loaded).
INFO: Found 1 test target...
Target //:test up-to-date:
  bazel-bin/test
INFO: Elapsed time: 10.209s, Critical Path: 10.02s
INFO: Build completed successfully, 7 total actions
//:test                                                                  PASSED in 10.0s

Executed 1 out of 1 test: 1 test passes.
There were tests whose specified size is too big. Use the --test_verbose_timeout_warnings command line option to see which ones these are.

The text was updated successfully, but these errors were encountered:

damienmg · 2017-09-22T11:46:34Z

/cc @philwo because I remember there was some discussion because it also drop out of sandbox when we do that.

hlopko · 2017-10-10T15:28:08Z

@buchgr

philwo · 2018-07-19T14:29:54Z

Reassigning to @buchgr, because remote execution.

RNabel · 2019-08-05T14:25:06Z

@buchgr Is there a timeline for fixing this?

buchgr · 2019-08-06T07:53:00Z

@RNabel yes. should be in 0.29.0

phb · 2019-08-20T23:44:56Z

@buchgr it looks like your fix #8983 got rolled back and as far as I can tell, not included in 0.29

buchgr · 2019-08-21T09:34:51Z

yeah it did get rolled back :(. I broke tons of code internally. I ll try to rollforward for the 1.0 release.

mandrean · 2019-12-02T12:33:36Z

@buchgr any updates on this ticket?

buchgr · 2019-12-02T15:29:44Z

Yes, unfortunately it got rolled back because it broke many internal tests. I'll need to fix those before I can roll it forward. I don't have the cycles to do this right now.

Qinusty · 2020-01-30T15:04:36Z

This patch is extremely beneficial, any timeline towards getting this merged?

buchgr · 2020-02-03T13:28:15Z

I unfortunately no longer work on Bazel and won't have the time to fix all the internal tests broken by this change.

olib963 · 2020-02-21T15:01:45Z

Is there anyone that would be able to pick this up or is it possible for someone external to pick this work up? This would be a really helpful, we currently have to work around this by calling the exclusive tests ourselves after the tests that can be run in parallel.

cocreature · 2020-05-18T11:54:58Z

Hi, we’ve been using the patch from #8983 for a couple of months now without any issues. What I noticed is that this change enables both sandboxing and caching. Without knowing how the internal tests failed, could it be that you just need to add a local tag to disable sandboxing?

mcwilson07 · 2020-05-28T14:00:34Z

We would be really interested in getting this into Bazel as well. We have tests that need to run exclusively but we occasionally miss dependencies that would be caught with sandboxing.

stepango · 2020-06-04T19:45:36Z

Is there any timelines to get it fixed? If not how about introducing the flag to enable #8983 conditionally, in this case many teams could benefit from exclusive tests caching as well as keep internal tests green. Thoughts?

JayThomason · 2020-08-24T22:26:13Z

Here is a concrete use-case for why this bug is important:
We have a lot of tests that require usage of the GPU. These tests use the exclusive tag to prevent OOM issues that can arise when running multiple tests that require the GPU at once. Because of this bug these tests have to run every single time that we do a test CI build.

In my opinion this bug breaks one of the core features of bazel: targets are being built which do not need to be built because the dependencies did not change.

JayThomason · 2020-08-27T00:26:15Z

After some hours I was able to figure out a workaround for my use case.

We have a macro that wraps our test rule which I edited such that when the "exclusive" tag is set for a test instead of just creating that test target normally we actually create two test targets.

The first target is tagged exclusive as normal but also gets tagged gpu_test. This ensures that developers can still run bazel test foo/... and still have all their tests run as expected.

The second target is tagged manual and gpu_test. This test is effectively identical to the first but it will be excluded from any sort of foo/... queries in bazel.

The effect is that for CI we run bazel test with --test_tag_filters=-exclusive,-gpu_test to exclude the exclusive targets from our bazel test step while also adding a new test step where we query for the tests tagged as manual and gpu_test and pass those to bazel test with --jobs=1 to effectively run the tests one at a time while maintaining remote caching.

Note that the purpose of the gpu_test tag is to ensure that we aren't running other unnecessary manual tests in this step.

philwo · 2020-09-21T17:18:23Z

@coeuvre For reference, this was fixed in cl/260916180, but rolled back in cl/261644804.

This tells Bazel that the test requires 4 CPUs to run. On enormous machines, tests will run in parallel, on laptops tests will run serially or maybe 2 at a time. This strikes a better balance than exclusive tags. It also enables caching of ref tests; there is a known issue that prevents caching for `exclusive` tests: bazelbuild/bazel#3791

coeuvre · 2020-10-15T02:26:11Z

#8983 is rolled forward with fix as 5e5eb86:

Add --incompatible_exclusive_test_sandboxed flag so users can enable this feature conditionally.
With that flag enabled, users who want to run exclusive tests locally can add a 'local' tag.

MikhailTymchukFT · 2020-12-21T20:04:00Z

Which release contains this flag?

coeuvre · 2020-12-29T04:00:46Z

It should be contained in release 4.0. You can track the release here #12455.

gasparev · 2022-11-28T17:37:48Z

Hi, I'm trying to fix this but I need to test the change on the downstream projects. From the doc, it seems that the labels incompatible-change and migration-ready are needed. Can someone add them?

fmeum · 2022-11-28T18:30:02Z

@gasparev Could you create a separate tracking issue as explained in the docs? You can then ping @meteorcloudy to have him add the necessary flags.

Solves #3791 #16871 bazel-contrib/SIG-rules-authors#40 Steps to do before merging: - [x] Bazel checks green #16868 - [x] Downstream projects green https://buildkite.com/bazel/bazelisk-plus-incompatible-flags/builds/1348 Closes #16867. PiperOrigin-RevId: 493257706 Change-Id: I0f46d092a47a7c08e2436183c9cdc2cd92a0c379

gasparev · 2022-12-06T14:00:06Z

This can be closed after 23580aa

damienmg added P1 I'll work on this now. (Assignee required) type: bug labels Sep 22, 2017

hlopko added the category: performance label Oct 10, 2017

hlopko assigned philwo Oct 10, 2017

philwo assigned buchgr and unassigned philwo Jul 19, 2018

buchgr added P2 We'll consider working on this in future. (Assignee optional) and removed P1 I'll work on this now. (Assignee required) labels Jul 19, 2018

meisterT added team-Execution and removed category: performance labels Nov 29, 2018

jin added team-Remote-Exec Issues and PRs for the Execution (Remote) team and removed team-Execution labels Jan 14, 2019

buchgr removed their assignment Jan 9, 2020

philwo assigned coeuvre Sep 21, 2020

alexeagle mentioned this issue Mar 22, 2022

Flip default for --incompatible_exclusive_test_sandboxed bazel-contrib/SIG-rules-authors#40

Open

gasparev mentioned this issue Nov 28, 2022

Flip incompatible_exclusive_test_sandboxed #16867

Closed

2 tasks

gasparev mentioned this issue Nov 29, 2022

incompatible_exclusive_test_sandboxed: flip #16871

Closed

coeuvre closed this as completed Dec 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tests with "exclusive" tag do not get results from remote cache #3791

Tests with "exclusive" tag do not get results from remote cache #3791

grandseiken commented Sep 22, 2017 •

edited

Loading

damienmg commented Sep 22, 2017

hlopko commented Oct 10, 2017

philwo commented Jul 19, 2018

RNabel commented Aug 5, 2019

buchgr commented Aug 6, 2019

phb commented Aug 20, 2019

buchgr commented Aug 21, 2019

mandrean commented Dec 2, 2019

buchgr commented Dec 2, 2019

Qinusty commented Jan 30, 2020

buchgr commented Feb 3, 2020

olib963 commented Feb 21, 2020

cocreature commented May 18, 2020

mcwilson07 commented May 28, 2020

stepango commented Jun 4, 2020

JayThomason commented Aug 24, 2020

JayThomason commented Aug 27, 2020

philwo commented Sep 21, 2020

coeuvre commented Oct 15, 2020

MikhailTymchukFT commented Dec 21, 2020

coeuvre commented Dec 29, 2020

gasparev commented Nov 28, 2022

fmeum commented Nov 28, 2022

gasparev commented Dec 6, 2022

Tests with "exclusive" tag do not get results from remote cache #3791

Tests with "exclusive" tag do not get results from remote cache #3791

Comments

grandseiken commented Sep 22, 2017 • edited Loading

Description of the problem / feature request / question:

If possible, provide a minimal example to reproduce the problem:

Environment info

Anything else, information or logs or outputs that would be helpful?

damienmg commented Sep 22, 2017

hlopko commented Oct 10, 2017

philwo commented Jul 19, 2018

RNabel commented Aug 5, 2019

buchgr commented Aug 6, 2019

phb commented Aug 20, 2019

buchgr commented Aug 21, 2019

mandrean commented Dec 2, 2019

buchgr commented Dec 2, 2019

Qinusty commented Jan 30, 2020

buchgr commented Feb 3, 2020

olib963 commented Feb 21, 2020

cocreature commented May 18, 2020

mcwilson07 commented May 28, 2020

stepango commented Jun 4, 2020

JayThomason commented Aug 24, 2020

JayThomason commented Aug 27, 2020

philwo commented Sep 21, 2020

coeuvre commented Oct 15, 2020

MikhailTymchukFT commented Dec 21, 2020

coeuvre commented Dec 29, 2020

gasparev commented Nov 28, 2022

fmeum commented Nov 28, 2022

gasparev commented Dec 6, 2022

grandseiken commented Sep 22, 2017 •

edited

Loading