test: add test that verifies subsequent tasks are NEVER_ATTEMPTED when previous action is canceled #437

YutongLi291 · 2024-10-10T21:41:01Z

What was the problem/requirement? (What/Why)

The scheduler doesn’t pipeline more than one task run in a session until the first task run in that session has completed.

We should verify that this behaviour is as expected, and that if the task run is not completed, other tasks should not have been attempted.

What was the solution? (How)

Add an E2E test that verifies that when a task is canceled during its run, the subsequent tasks are in NEVER_ATTEMPTED status.

Also edited an existing test that verifies CANCELED actions that other subsequent actions are NEVER_ATTEMPTED

What is the impact of this change?

Better testing to verify worker behaviour is as expected when task is canceled, that following tasks are NEVER_ATTEMPTED

How was this change tested?

# Linux
source .e2e_linux_infra.sh
hatch run e2e-test

# Windows
source .e2e_windows_infra.sh
hatch run e2e-test

Was this change documented?

No

Is this a breaking change?

No

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

test/e2e/test_job_submissions.py

hindleym · 2024-10-24T22:09:38Z

test/e2e/test_job_submissions.py

+        assert is_job_started_with_sessions(job)
+
+        # Wait some time for the second step (which sleeps for 2 minutes) to start
+        time.sleep(20)


Instead of sleeping here, it may be worth just waiting on the second step to switch to RUNNING? That way if it starts sooner, the test can complete faster.

That's true, but that's also more API calls. There's a tradeoff, and I believe that 20 seconds is not a big time wait to justify the extraneous API calls.

I see the sleep command more as a last resort. It guarantees to slow down all testing, especially when testing new changes before PR. Between these two sleeps, you're adding thirty seconds to every full test attempt, which already takes an extremely long time to complete. If I'm not mistaken, most people make sure their changes work with the full suite more than once before creating a PR, that's at least a minute to every future change people try to merge.

hindleym · 2024-10-24T22:19:43Z

test/e2e/test_job_submissions.py

@@ -406,6 +406,9 @@ def sessions_exist(current_job: Job) -> bool:

        LOG.info(f"Job result: {job}")

+        # Wait until the envExit runs as well
+        time.sleep(10)


Is anything actually happening on exit that we need to sleep this long? I see that it waits for the job to complete above. Should that function not already make sure that the job has finished it's calls before returning?

The job is considered completed even without running the envExit and envExit is not required to finish for the job to finish.

hindleym

I would avoid forcing tests to sleep as much as possible.

…n previous action is canceled Signed-off-by: Yutong Li <[email protected]>

sonarqubecloud · 2024-10-28T21:23:01Z

Quality Gate failed

Failed conditions
11.0% Duplication on New Code (required ≤ 3%)

See analysis details on SonarCloud

sonarqubecloud · 2024-10-29T02:54:53Z

Quality Gate failed

Failed conditions
11.0% Duplication on New Code (required ≤ 3%)

See analysis details on SonarCloud

YutongLi291 requested a review from a team as a code owner October 10, 2024 21:41

YutongLi291 force-pushed the mainline branch from 95f5d96 to dd01af0 Compare October 11, 2024 00:58

YutongLi291 changed the title ~~test: add test that verifies subsequent tasks are NEVER_ATTEMPTED when task is canceled~~ test: add test that verifies subsequent tasks are NEVER_ATTEMPTED when previous action is canceled Oct 11, 2024

github-advanced-security bot found potential problems Oct 11, 2024

View reviewed changes

test/e2e/test_job_submissions.py Fixed Show resolved Hide resolved

YutongLi291 force-pushed the mainline branch from dd01af0 to f9cb84f Compare October 11, 2024 22:20

YutongLi291 enabled auto-merge (squash) October 14, 2024 21:23

YutongLi291 disabled auto-merge October 16, 2024 18:28

leongdl reviewed Oct 16, 2024

View reviewed changes

test/e2e/test_job_submissions.py Show resolved Hide resolved

YutongLi291 requested a review from leongdl October 21, 2024 16:52

YutongLi291 enabled auto-merge (squash) October 24, 2024 00:31

hindleym reviewed Oct 24, 2024

View reviewed changes

YutongLi291 requested a review from hindleym October 26, 2024 20:54

YutongLi291 force-pushed the mainline branch from 943710e to 2e622a4 Compare October 28, 2024 21:20

YutongLi291 closed this Oct 28, 2024

auto-merge was automatically disabled October 28, 2024 21:22
Pull request was closed

YutongLi291 force-pushed the mainline branch from 077b5ef to f95a6ce Compare October 28, 2024 21:22

test: add test that verifies subsequent tasks are NEVER_ATTEMPTED whe…

6ae95bc

…n previous action is canceled Signed-off-by: Yutong Li <[email protected]>

YutongLi291 reopened this Oct 28, 2024

hindleym approved these changes Oct 28, 2024

View reviewed changes

leongdl approved these changes Oct 29, 2024

View reviewed changes

Merge branch 'mainline' into mainline

4ca61ad

YutongLi291 enabled auto-merge (squash) October 29, 2024 02:54

YutongLi291 merged commit add5dce into aws-deadline:mainline Oct 29, 2024
14 of 15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: add test that verifies subsequent tasks are NEVER_ATTEMPTED when previous action is canceled #437

test: add test that verifies subsequent tasks are NEVER_ATTEMPTED when previous action is canceled #437

YutongLi291 commented Oct 10, 2024 •

edited

Loading

hindleym Oct 24, 2024 •

edited

Loading

YutongLi291 Oct 25, 2024

hindleym Oct 27, 2024

hindleym Oct 24, 2024

YutongLi291 Oct 24, 2024

hindleym left a comment •

edited

Loading

sonarqubecloud bot commented Oct 28, 2024

sonarqubecloud bot commented Oct 29, 2024

test: add test that verifies subsequent tasks are NEVER_ATTEMPTED when previous action is canceled #437

test: add test that verifies subsequent tasks are NEVER_ATTEMPTED when previous action is canceled #437

Conversation

YutongLi291 commented Oct 10, 2024 • edited Loading

What was the problem/requirement? (What/Why)

What was the solution? (How)

What is the impact of this change?

How was this change tested?

Was this change documented?

Is this a breaking change?

No

hindleym Oct 24, 2024 • edited Loading

Choose a reason for hiding this comment

YutongLi291 Oct 25, 2024

Choose a reason for hiding this comment

hindleym Oct 27, 2024

Choose a reason for hiding this comment

hindleym Oct 24, 2024

Choose a reason for hiding this comment

YutongLi291 Oct 24, 2024

Choose a reason for hiding this comment

hindleym left a comment • edited Loading

Choose a reason for hiding this comment

sonarqubecloud bot commented Oct 28, 2024

Quality Gate failed

sonarqubecloud bot commented Oct 29, 2024

Quality Gate failed

YutongLi291 commented Oct 10, 2024 •

edited

Loading

hindleym Oct 24, 2024 •

edited

Loading

hindleym left a comment •

edited

Loading