Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

test-flake: integration_tests::updates::test_update_uninitialized #4949

Open
rcgoodfellow opened this issue Feb 1, 2024 · 14 comments
Open
Assignees
Labels
Test Flake Tests that work. Wait, no. Actually yes. Hang on. Something is broken.

Comments

@rcgoodfellow
Copy link
Contributor

This test passed in a PR, then failed on main once the PR landed.

See

@rcgoodfellow rcgoodfellow added the Test Flake Tests that work. Wait, no. Actually yes. Hang on. Something is broken. label Feb 1, 2024
@iliana
Copy link
Contributor

iliana commented Feb 1, 2024

Error: repository fetch should have failed with 500 error

Caused by:
    0: making request to server
    1: connection closed before message completed

Yet the server seemed to do the right thing?

@rcgoodfellow
Copy link
Contributor Author

It was the Linux job that had this failure. The helios job passed on the first go. The test passed when re-running the Linux CI job.

@smklein
Copy link
Collaborator

smklein commented Feb 2, 2024

@sunshowers sunshowers self-assigned this Feb 2, 2024
@sunshowers
Copy link
Contributor

I wrote this test, will look tomorrow.

@smklein
Copy link
Collaborator

smklein commented Mar 12, 2024

One more data point: #5246 , still flaking

@smklein
Copy link
Collaborator

smklein commented Mar 12, 2024

This was on buildomat / build-and-test (helios) , which is not a Linux job, fwiw

@sunshowers
Copy link
Contributor

Started looking at this a couple weeks ago but got preempted :(

Wondering if we should just throw a retry on the test for now.

@elaine-oxide
Copy link
Contributor

@elaine-oxide
Copy link
Contributor

The previously failing test passed on re-run.
https://github.com/oxidecomputer/omicron/runs/29808706363

andrewjstone added a commit that referenced this issue Sep 19, 2024
While looking into #4949,
I noticed that the test name was wrong. While I haven't solved this
issue, this will at least make the failure a bit less confusing since
the logs will be named correctly.
andrewjstone added a commit that referenced this issue Sep 19, 2024
While looking into #4949,
I noticed that the test name was wrong. While I haven't solved this
issue, this will at least make the failure a bit less confusing since
the logs will be named correctly.
@FelixMcFelix
Copy link
Contributor

@andrewjstone
Copy link
Contributor

andrewjstone commented Oct 18, 2024

This one's come up again today for me: https://buildomat.eng.oxide.computer/wg/0/details/01JADDKWGJPE0W6GWXXKVACPCS/tntBJzr7RPDoVr0k1n2H2XbfwB7E1I31lHt7USHEwUNUteb7/01JADDM60TPPKHJFF6GYQ0E9QW.

I think you actually hit this error: #6771 but in a test.

@iliana
Copy link
Contributor

iliana commented Oct 29, 2024

This test is part of a set of tests that we're almost certainly going to delete in the short-term; everything that module tests is going to be replaced with the TUF Repo Depot and the Reconfigurator. It might be nice to figure out the root cause but if it's causing pain today I'm happy to delete the update system we've never used! No wait this is wrong, I forgot we had done a bunch of work in Nexus to flesh out some of the original code I wrote.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Test Flake Tests that work. Wait, no. Actually yes. Hang on. Something is broken.
Projects
None yet
Development

No branches or pull requests

7 participants