-
Notifications
You must be signed in to change notification settings - Fork 3.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
teamcity: failed test: TestImportCockroachDump #38242
Comments
Hmm. Gossip is stalled, and lots of things are failing, but this stands out:
That's brutal. Perhaps similar to the failure mode in #38246? Or even #37163 (comment) where nothing happens on one node for a full six minutes. |
Ok in the log we actually see long stalls like that. Look at these runtime stats (that's all of them, though the test ran for minutes and they ought to be logged every 10s, by each node)
I wonder what we're dealing with here. Taking into account these other issues I'm starting to suspect something fundamental. Perhaps the go1.12 bump or some Rocks patch that we applied? cc @ajkr / @petermattis for ideas on what to look for here and in the issues linked above. |
Nothing immediate comes to mind. Certainly, a Go version bump is risky, which is why we did it before the end of the cycle, but minutes of inactivity on a node is not likely to be a Go bug. Perhaps we're pushing up against a memory limit. 2.5 GiB RSS seems pretty sizable for a test. |
We're talking minutes of inactivity across all goroutines in the running test binary. PS the 2.5GiB is process-wide. Have you taken a look at #37163 (comment)? |
The following tests appear to have failed on master (testrace): TestImportCockroachDump
You may want to check for open issues.
#1344165:
Please assign, take a look and update the issue accordingly.
The text was updated successfully, but these errors were encountered: