Action inputs to dispatch n-runs of a single test in CI #6297

carlydf · 2024-07-16T05:04:30Z

What changed?

Add workflow dispatch options to the functional tests Github Action to allow us to run n-iterations of a single functional test with a configurable timeout.

There is also an option to run n-iterations of a single unit test, although it may be faster to run that locally.

WARNING: For functional tests, this will definitely be oomkilled for n>=100, likely for n=>50 too. I suggest to start with n=20 to see how the memory goes and then increase from there. Different DBs may use different amounts of RAM also.

Why?

To aid in the diagnosis and treatment of flaky tests.

How did you test it?

Tested in github actions.
Here is the action run normally, with no new input parameters: https://github.com/temporalio/temporal/actions/runs/11261156403
Here is the action run on one test multiple times: https://github.com/temporalio/temporal/actions/runs/11261236304

While we still have buildkite, the uploaded test results will be uploaded. You can find them by going to https://buildkite.com/organizations/temporal/analytics/suites/temporal-public/runs?branch=all+branches and looking for a recent run with "job: functional-test" and the commit hash you used.
Here is the buildkite output of the run above: https://buildkite.com/organizations/temporal/analytics/suites/temporal-public/runs/39901afd-35f1-8171-9257-e0bada374824

Potential risks

Our functional test pipeline could be broken by this PR, but we would notice that pretty immediately

Documentation

How to run it yourself

Go to https://github.com/temporalio/temporal/actions/workflows/run-tests.yml
Click "run workflow" on the upper right hand side
Set Commit SHA to the latest commit on the branch
Select your desired options
Click the green "Run workflow" button

Is hotfix candidate?

…poral into cdf/rerun-functional-test

rodrigozhou · 2024-07-18T07:18:05Z

Can we create a separate workflow file for this purpose?
run-tests.yml is complicated enough that adding this seems added complexity unnecessarily.
Also, as you noted, running misc-checks is also unnecessary when running a single test.

dnr · 2024-07-20T00:03:24Z

Can we create a separate workflow file for this purpose? run-tests.yml is complicated enough that adding this seems added complexity unnecessarily. Also, as you noted, running misc-checks is also unnecessary when running a single test.

A separate workflow file that we run only rarely will rot and be likely to be broken when someone wants to use it. Integrating it into the main one makes it much more likely to be maintained and working

…functional-test

.github/workflows/run-tests.yml

## What changed? Add workflow dispatch options to the functional tests Github Action to allow us to run n-iterations of a single functional test with a configurable timeout. There is also an option to run n-iterations of a single unit test, although it may be faster to run that locally. WARNING: For functional tests, this will definitely be oomkilled for n>=100, likely for n=>50 too. I suggest to start with n=20 to see how the memory goes and then increase from there. Different DBs may use different amounts of RAM also. ## Why? To aid in the diagnosis and treatment of flaky tests. ## How did you test it? Tested in github actions. Here is the action run normally, with no new input parameters: https://github.com/temporalio/temporal/actions/runs/11261156403 Here is the action run on one test multiple times: https://github.com/temporalio/temporal/actions/runs/11261236304 While we still have buildkite, the uploaded test results will be uploaded. You can find them by going to https://buildkite.com/organizations/temporal/analytics/suites/temporal-public/runs?branch=all+branches and looking for a recent run with `"job: functional-test"` and the commit hash you used. Here is the buildkite output of the run above: https://buildkite.com/organizations/temporal/analytics/suites/temporal-public/runs/39901afd-35f1-8171-9257-e0bada374824 ## Potential risks Our functional test pipeline could be broken by this PR, but we would notice that pretty immediately ## Documentation How to run it yourself 1. Go to https://github.com/temporalio/temporal/actions/workflows/run-tests.yml 2. Click "run workflow" on the upper right hand side 5. Set Commit SHA to the latest commit on the branch 6. Select your desired options 7. Click the green "Run workflow" button ## Is hotfix candidate?

carlydf and others added 30 commits July 10, 2024 12:59

create Run Test N-times gh action

f5301c6

change name of step

377d713

make existing test workflow allow single test run

c05a1a1

fix false check syntax

6fb13fd

don't run unit tests in single functional test mode

82acc5c

fix needs

56b97cf

hardcode single test mode

320ed3d

run functional-test always()

94a6f2c

print what the makefile runs

774403d

temporary code change to test flaky test

cd5160b

add db options

f055929

hard code sqlite and postgres12

451f80c

delete comment below runs-on

ceeaca4

fix db issue?

6ddda0b

removed duplicate matrix step, updated options

fc7ead3

define setup job outputs

ce29c65

configurable timeout

8fbe6df

test string timeout

bc76889

fix timeout pipe

69a0a37

try dynamic shard matrix

ee8e476

try other dynamic matrix format

be78354

add db matrix

0a9d473

fix misuse of single quotes

f6b52ea

hard-code sqlite for single test mode

5647d44

quote the db names

dd629bc

remove comment above runs-on and dynamically config test_db

9a78b44

Merge branch 'main' into cdf/rerun-functional-test

a129047

change job if condition so steps run when needed

7292977

Merge branch 'cdf/rerun-functional-test' of github.com:temporalio/tem…

4980a86

…poral into cdf/rerun-functional-test

fix lint and clean code

189b6a9

fix cassandra db input

a31d78b

carlydf added 18 commits July 23, 2024 19:48

multi-select test db

f6f6873

remove postgres12_pgx test db

e2e0d15

fix bash syntax error

2540f46

try a different jq syntax

e9ce2a7

merge in main

ed9233b

delete bad mktemp --tmpdir flag

fdcaa7f

revert --tempdir flag change

b7d9ccc

put back newlines

fc558d9

temporarily remove make ci-build-misc

2db8d87

remove cache-docker-images dependency

5951bb3

add prints

7417f3d

save changes

16ee81e

echo vars then run functional test n times

edb5343

fix syntax

90c4962

Merge branch 'main' of github.com:temporalio/temporal into cdf/rerun-…

f16eb70

…functional-test

offer custom test directory

89b47d1

offer n-times run for unit test also

20e1f1e

print test command in run

b70af29

carlydf changed the title ~~Action inputs to dispatch n-runs of a single functional test~~ Action inputs to dispatch n-runs of a single test in CI Oct 4, 2024

carlydf commented Oct 7, 2024

View reviewed changes

.github/workflows/run-tests.yml Outdated Show resolved Hide resolved

justinp-tt approved these changes Oct 8, 2024

View reviewed changes

carlydf added 2 commits October 9, 2024 11:32

take dbs as list

cb7d6c1

make misc-checks a no-op in single test mode

c857565

carlydf enabled auto-merge (squash) October 9, 2024 19:06

carlydf merged commit 65a58d9 into main Oct 9, 2024
59 of 60 checks passed

carlydf deleted the cdf/rerun-functional-test branch October 9, 2024 19:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Action inputs to dispatch n-runs of a single test in CI #6297

Action inputs to dispatch n-runs of a single test in CI #6297

carlydf commented Jul 16, 2024 •

edited

Loading

rodrigozhou commented Jul 18, 2024

dnr commented Jul 20, 2024

Action inputs to dispatch n-runs of a single test in CI #6297

Action inputs to dispatch n-runs of a single test in CI #6297

Conversation

carlydf commented Jul 16, 2024 • edited Loading

What changed?

Why?

How did you test it?

Potential risks

Documentation

Is hotfix candidate?

rodrigozhou commented Jul 18, 2024

dnr commented Jul 20, 2024

carlydf commented Jul 16, 2024 •

edited

Loading