-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
roachtest: gossip/chaos/nodes=9 failed #51721
Comments
The error seems legitimate:
|
cc @nvanbenschoten @tbg for triage |
(roachtest).gossip/chaos/nodes=9 failed on master@b8a50cc4d062293915969cdc83e3ec4d057cede5:
More
Artifacts: /gossip/chaos/nodes=9
See this test on roachdash |
Yes, this looks like an issue. I have a hunch I know what's going on. In the first failure, we see errors like:
in the second, errors like:
Notice that in both cases, the "dead" node is part of the gossip network but there is another node missing. Incidentally, I ran a large TPC-E cluster last Friday and noticed that the node IDs were all out of order, even with |
To back this up, in the second failure, we can see from node 7's logs that it was assigned node ID 3. That's the node missing from gossip. |
#51497 (comment), fixed by #51790. |
(roachtest).gossip/chaos/nodes=9 failed on master@bfa6307c292ef4dfed4a53cb99f506e6dab26533:
More
Artifacts: /gossip/chaos/nodes=9
See this test on roachdash |
..and the setting of cluster settings for single node clusters. `roachprod start --sequential` was broken in cockroachdb#51329, and the broken-ness outlined in TODOs in cockroachdb#51790. This PR just addresses those TODOs. Fixes cockroachdb#51497 Fixes cockroachdb#51721 Fixes cockroachdb#51738 Fixes cockroachdb#51768 Fixes cockroachdb#51769 Fixes cockroachdb#51776 Release note: None
51893: roachprod: fixup `roachprod --sequential` r=irfansharif a=irfansharif ..and the setting of cluster settings for single node clusters. `roachprod start --sequential` was broken in #51329, and the broken-ness outlined in TODOs in #51790. This PR just addresses those TODOs. Fixes #51497 Fixes #51721 Fixes #51738 Fixes #51768 Fixes #51769 Fixes #51776 Release note: None Co-authored-by: irfan sharif <[email protected]>
(roachtest).gossip/chaos/nodes=9 failed on master@e9a4f83e3eee59510f97db2c6e0df9b57cf6b944:
More
Artifacts: /gossip/chaos/nodes=9
Related:
roachtest: gossip/chaos/nodes=9 failed #51707 roachtest: gossip/chaos/nodes=9 failed C-test-failure O-roachtest O-robot branch-provisional_202007220233_v20.2.0-alpha.2 release-blocker
roachtest: gossip/chaos/nodes=9 failed #49493 roachtest: gossip/chaos/nodes=9 failed C-test-failure O-roachtest O-robot branch-release-19.1 release-blocker
See this test on roachdash
powered by pkg/cmd/internal/issues
The text was updated successfully, but these errors were encountered: