-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
sql: nightly benchmark broken #8081
Comments
I stopped the build but left the instance running in case someone wants to do forensics. |
Seems like an issue shutting down the server after a test. Not much else to go on. |
@WillHaack seems to have more details on this. Could you chime in? |
I believe @cuongdo is correct. Some tests may timeout, and when they do I'm not sure they close down the servers properly so the subsequent tests may have trouble binding to ports. (Also, I'm not even sure that they close their connections properly when they successfully run.) I'll try to work on fixing this today. If all else fails it may be worth making it so that multinode tests only run with the -multinode flag until further notice. (IMO it's not a big deal to keep multinode tests that are failing, but it is a big deal if they are causing other tests to fail.) |
#8075 likely fixed the issue. The benchmark tests have been running for 7+ hours now. They were failing consistently in about 2.5hrs before. |
@WillHaack I think they're broken in a different way now. Can you please investigate? They've now been running for 30 hours -- seems like they might have stalled again. https://teamcity.cockroachdb.com/viewLog.html?buildId=7571&tab=buildResultsDiv&buildTypeId=Cockroach_BenchmarkTests |
Do they now pass, with #10237 in? |
Looks like they are broken by the move to azure builders? Tonight's run failed with "terraform is not in your path": https://teamcity.cockroachdb.com/viewLog.html?tab=buildLog&logTab=tree&filter=debug&expand=all&buildId=37243#_focus=724 Probably we should run this orchestration from inside the builder container instead of adding terraform to the TC builders. cc @jordanlewis |
The benchmark tests still fail. It seems that a bunch of tests fail that require Postgres to be installed. Did we used to ignore these somehow? The move to Azure didn't affect what is installed and running on the benchmark machines. |
Current issue is that |
Currently blocking cockroachdb#8081
The
|
I spoke too soon: in the
|
The Benchmark tests panic due to `kv_test` passing a `nil` clock to `rpc.NewContext`. Fix this. See cockroachdb#8081
Okay, the nil clock panic is fixed but the above failures remain. |
Tracking the above failures in #10551. |
After ignoring the problems in #10551, a new set of problems turned up. The sql benchmarks now pass, but the storage benchmarks seem to hang indefinitely on the very first iteration of
These errors repeat every ten minutes. |
The nightly benchmarks pass most of the time now. Closing this in favor of more targeted issues for occasional failures. |
The nightly benchmark seems to have stalled during the
sql.test
portion. Thesql.test.stderr
log is growing forever with the following log lines:The test invocation on TeamCity is here: https://teamcity.cockroachdb.com/viewLog.html?buildId=5068&buildTypeId=Cockroach_BenchmarkTests&tab=buildLog
he nightly benchmark seems to have stalled during the
sql.test
portion. Thesql.test.stderr
log is growing forever with the following log lines:The test invocation on TeamCity is here: https://teamcity.cockroachdb.com/viewLog.html?buildId=5068&buildTypeId=Cockroach_BenchmarkTests&tab=buildLog
UPDATE: current issue is that
sql.BenchmarkPgbenchExec_{Cockroach,Postgres}
fail in the absence ofpgbench
, which we do not install anywhere.The text was updated successfully, but these errors were encountered: