ci: attempt to shard huge test targets more #98834

healthy-pod · 2023-03-17T02:49:13Z

This code change attempts to shard some test
targets that could benefit from more sharding. It
also splits unit tests in TeamCity into two runs:
ccl unit tests, and non-ccl unit tests. The current
build config will be used to run non-ccl unit tests.
The new build config will be used for ccl tests (those
under pkg/ccl). This cuts unit tests wall time by
half while keeping machine time almost the same.

Release note: None
Epic: none

cockroach-teamcity · 2023-03-17T02:49:21Z

This change is

blathers-crl · 2023-03-18T20:37:32Z

It looks like your PR touches production code but doesn't add or edit any test code. Did you consider adding tests to your PR?

_{🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf.}

jlinder · 2023-03-21T14:37:41Z

pkg/BUILD.bazel

@@ -3376,45 +3376,59 @@ test_suite(
 )

 test_suite(
-    name = "small_tests",
+    name = "ccl_tests",


Why not maintain the small, medium, large, enormous categories for CCL tests?

Good question. Because we don't use them, we can easily add them if we need to though.

jlinder · 2023-03-21T14:50:03Z

pkg/ccl/logictestccl/tests/3node-tenant/BUILD.bazel

@@ -12,8 +12,11 @@ go_test(
        "//pkg/sql/logictest:testdata",  # keep
        "//pkg/sql/opt/exec/execbuilder:testdata",  # keep
    ],
-    shard_count = 16,
-    tags = ["cpu:2"],
+    shard_count = 48,


A few questions:

How does shard count interact with cpu:2 (or cpu:1 in other cases)?

Why change some to 48 but not others?

Why change some to 48 but not others?

We upload bazel trace profile to artifacts on each unit tests run. I looked at some runs, found targets that had large shards or bottle-necking the process, and sharded them more. Mainly, logictests, backupccl, schemachangerccl, kvserver. If a logictests package has less than 48 tests, it gets n number of shards where n is the number of tests.

How does shard count interact with cpu:2 (or cpu:1 in other cases)?

Good question. Looking.

How does shard count interact with cpu:2 (or cpu:1 in other cases)?

Each shard gets n cores if cpu:n is used. I tried to play with it but didn't notice any changes so preferred to keep it unchanged.

rickystewart

Looks fine to me.

rickystewart · 2023-03-22T18:16:59Z

pkg/cmd/generate-bazel-extra/main.go

    tags = [
        "-broken_in_bazel",
        "-flaky",
        "-integration",
        "%[1]s",
+        "-ccl_test"


Nit: Keep the list alphabetized? (Put -ccl_test after -broken_in_bazel)

benbardin · 2023-03-22T19:12:05Z

LGTM for backupccl

This code change attempts to shard some test targets that could benefit from more sharding. It also splits unit tests in TeamCity into two runs: ccl unit tests, and non-ccl unit tests. The current build config will be used to run non-ccl unit tests. The new build config will be used for ccl tests (those under `pkg/ccl`). This cuts unit tests wall time by half while keeping machine time almost the same. Release note: None Epic: none

healthy-pod · 2023-03-22T23:29:46Z

TFTRs!

bors r=rickystewart

craig · 2023-03-23T00:39:12Z

Build succeeded:

Bazel Essential CI (Cockroach)

healthy-pod force-pushed the shard-tests branch 7 times, most recently from 7e0de19 to aa5994c Compare March 18, 2023 20:37

healthy-pod force-pushed the shard-tests branch 11 times, most recently from 0da47f0 to b311d92 Compare March 20, 2023 19:26

healthy-pod requested review from jlinder, rail and rickystewart March 20, 2023 21:03

healthy-pod force-pushed the shard-tests branch from b311d92 to f91c8a7 Compare March 20, 2023 21:20

jlinder reviewed Mar 21, 2023

View reviewed changes

healthy-pod force-pushed the shard-tests branch 5 times, most recently from c5b816a to 8d35f3d Compare March 21, 2023 21:24

healthy-pod requested review from a team as code owners March 22, 2023 17:59

healthy-pod requested review from a team, herkolategan, renatolabs, cucaroach, bananabrick, benbardin and miretskiy and removed request for a team, herkolategan, renatolabs, cucaroach, bananabrick, benbardin and miretskiy March 22, 2023 17:59

rickystewart approved these changes Mar 22, 2023

View reviewed changes

healthy-pod force-pushed the shard-tests branch from 0d5f80e to 69921eb Compare March 22, 2023 21:33

craig bot merged commit 384c55d into cockroachdb:master Mar 23, 2023

andyyang890 mentioned this pull request Apr 3, 2023

logictest: ensure generated ccl build files have the right tags #99521

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: attempt to shard huge test targets more #98834

ci: attempt to shard huge test targets more #98834

healthy-pod commented Mar 17, 2023 •

edited

Loading

cockroach-teamcity commented Mar 17, 2023

blathers-crl bot commented Mar 18, 2023

jlinder Mar 21, 2023

healthy-pod Mar 21, 2023

jlinder Mar 21, 2023

healthy-pod Mar 21, 2023 •

edited

Loading

healthy-pod Mar 21, 2023

rickystewart left a comment

rickystewart Mar 22, 2023

benbardin commented Mar 22, 2023

healthy-pod commented Mar 22, 2023

craig bot commented Mar 23, 2023

ci: attempt to shard huge test targets more #98834

ci: attempt to shard huge test targets more #98834

Conversation

healthy-pod commented Mar 17, 2023 • edited Loading

cockroach-teamcity commented Mar 17, 2023

blathers-crl bot commented Mar 18, 2023

jlinder Mar 21, 2023

Choose a reason for hiding this comment

healthy-pod Mar 21, 2023

Choose a reason for hiding this comment

jlinder Mar 21, 2023

Choose a reason for hiding this comment

healthy-pod Mar 21, 2023 • edited Loading

Choose a reason for hiding this comment

healthy-pod Mar 21, 2023

Choose a reason for hiding this comment

rickystewart left a comment

Choose a reason for hiding this comment

rickystewart Mar 22, 2023

Choose a reason for hiding this comment

benbardin commented Mar 22, 2023

healthy-pod commented Mar 22, 2023

craig bot commented Mar 23, 2023

healthy-pod commented Mar 17, 2023 •

edited

Loading

healthy-pod Mar 21, 2023 •

edited

Loading