-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
roachtest: tpccbench/nodes=9/cpu=4/chaos/partition failed #79568
Comments
The test fails with a 255 error code from
On node 1 itself (0 here), we see some strange behavior in the logs, which seem to have been corrupted at the end:
There's nothing in the dmesg or journalctl output. I'm not going to close this as a test flake because it's an interesting failure more, but this isn't an issue in KV and it's also not a release blocker, so transferring to test eng to let them decide whether to dig deeper or track going forward. |
This looks like a potential race condition inside
It's unclear as to why |
We should probably remove this |
This was originally introduced when CRDB could push the VM to brown out (sometimes forever), which lost us lost of signal from `tpccbench` (on top of causing flakes). However, for a long time now we've deployed CRDB in a cgroup that leaves enough headroom for the OS to reliably oomkill CRDB if necessary, so `c.Reset` is no longer necessary. Fixes cockroachdb#79568. Release note: None
Without [1] https://cockroachlabs.atlassian.net/browse/CRDB-13799 |
[2] seems wholly unconnected to the problem of building up garbage that you linked. |
Ah, I see; that's because all roachtests use a persistent drive (
Yep, although VM reset may alter the performance of the OS, e.g., different working set of SSTs in memory. [1] https://github.com/cockroachdb/cockroach/blob/master/pkg/cmd/roachtest/spec/cluster_spec.go#L206 |
roachtest.tpccbench/nodes=9/cpu=4/chaos/partition failed with artifacts on release-22.1 @ 5d1063406d48c2713e0a7a8421f1946349b3db65:
Help
See: roachtest README
See: How To Investigate (internal)
This test on roachdash | Improve this report!
Jira issue: CRDB-15945
The text was updated successfully, but these errors were encountered: