-
Notifications
You must be signed in to change notification settings - Fork 3.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
roachtest: ycsb/E/nodes=3/cpu=32 failed #135163
Comments
Disk stall (node died)
|
@cockroachdb/kv Not entirely sure what went wrong here, if someone could help take a look? |
I see that for node 3 at
There are a bunch off error log lines like:
10 seconds later, the node crashes with:
Is there a way to check if there was actually a problem with the disk in GCE? I took a quick look at the metrics, but I wasn't sure which one do we usually use for that type of problems. Also, since node 3 crashed, some important metrics might not have been collected. |
@iskettaneh given this is a disk stall, we typically close it out as an infra flake (assuming it isn't a test induced disk stall, which isn't the case for these YCSB tests). |
Tracking issue for reference: #97968 |
roachtest.ycsb/E/nodes=3/cpu=32 failed with artifacts on release-24.2 @ 287e165b88ff5aa1aa3c9b9ff1303d859c1f960e:
Parameters:
ROACHTEST_arch=amd64
ROACHTEST_cloud=gce
ROACHTEST_coverageBuild=false
ROACHTEST_cpu=32
ROACHTEST_encrypted=false
ROACHTEST_fs=ext4
ROACHTEST_localSSD=true
ROACHTEST_runtimeAssertionsBuild=false
ROACHTEST_ssd=0
Help
See: roachtest README
See: How To Investigate (internal)
See: Grafana
This test on roachdash | Improve this report!
Jira issue: CRDB-44384
The text was updated successfully, but these errors were encountered: