-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
roachtest: tpcc/mixed-headroom/n5cpu16 failed #79765
Comments
[ 1985.186448] Out of memory: Killed process 14684 (cockroach) total-vm:18180576kB, anon-rss:10837820kB, file-rss:54896kB, shmem-rss:0kB, UID:1000 pgtables:32736kB oom_score_adj:0 |
cockroach exited with code 137: Mon Apr 11 13:21:03 UTC 2022 Note that this OOM happened on the predecessor, i.e. v21.2, and |
cc @cockroachdb/bulk-io |
#78957 merged last week which I think knocks out the big piece of this profile, but I don't know if that really explains the OOM; that big piece, while big, was still pretty much fixed in size based on the workload (row size) and gomaxprocs, so in most of these, it was already as big as it was going to get. I suppose you could blame the oom on it, since being so big it meant we had little headroom remaining for something else (SSTs in raft?) to spike, but I don't think the workload generator mem would itself be the oom-causing spike here. |
Hi @shermanCRL - with the T-bulkio label removed, is there another team that is still looking into this (to remove the GA-blocker)? thanks! |
No, as far as KV is concerned this issue can be closed (which I will do now). The heap profile points at the workload causing the OOM and David says that was fixed. We don't understand everything here but there also isn't anything else to look at. |
roachtest.tpcc/mixed-headroom/n5cpu16 failed with artifacts on release-22.1 @ 12fac19acc2f05a1ec7c60e9a50e0c694c491657:
Help
See: roachtest README
See: How To Investigate (internal)
Same failure on other branches
This test on roachdash | Improve this report!
Jira issue: CRDB-15924
The text was updated successfully, but these errors were encountered: