Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore: Fix flakey test #202

Merged
merged 47 commits into from
Aug 19, 2024
Merged

chore: Fix flakey test #202

merged 47 commits into from
Aug 19, 2024

Conversation

juliev0
Copy link
Collaborator

@juliev0 juliev0 commented Aug 15, 2024

Fixes #199

Modifications

  1. this fixes the flakey e2e test. Root cause was because the cluster for the e2e test testing "dataLossPrevention=true" wasn't clean, and still had a lingering Job running from the Pipeline of the previous test. Have described the issue that occurs here.
  2. Refactored a bit to subdivide the test which was previously in one big long file into multiple files. (would prefer to later maybe move these new files to a util subdirectory, and then they could be used by multiple tests)

Verification

Have run the e2e test 8 times in a row with no failures.

…s left paused until new spec is reconciled

Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
@juliev0 juliev0 changed the title Test e2e pending chore: Test e2e pending Aug 15, 2024
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
@juliev0 juliev0 changed the title chore: Test e2e pending chore: Troubleshoot flakey test Aug 15, 2024
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
Signed-off-by: Julie Vogelman <[email protected]>
isbServiceRolloutClient planepkg.ISBServiceRolloutInterface
numaflowControllerRolloutClient planepkg.NumaflowControllerRolloutInterface
kubeClient clientgo.Interface
)
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

all these variables need to be in a file not including "_test.go" in the name in order to be referenced from the other files

Signed-off-by: Julie Vogelman <[email protected]>
echo '127.0.0.1 localhost' | sudo tee -a /etc/hosts
echo 'Waiting for the cluster to be ready...'
until kubectl cluster-info; do sleep 1; done

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this resolves the CI flakey test by creating a clean cluster before the second e2e test runs

@juliev0 juliev0 marked this pull request as ready for review August 18, 2024 06:11
@juliev0 juliev0 changed the title chore: Troubleshoot flakey test chore: Fix flakey test Aug 19, 2024
Copy link
Contributor

@dpadhiar dpadhiar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@xdevxy xdevxy merged commit 4fa0054 into main Aug 19, 2024
8 checks passed
@xdevxy xdevxy deleted the test-e2e-pending branch August 19, 2024 17:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Flakey e2e test - "dataLossPrevention=true" - Numaflow Vertex Pods stuck in Pending state
3 participants