-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
roachtest: replicate/wide failed #92915
Comments
roachtest.replicate/wide failed with artifacts on master @ 9c21578450e395c83a1dc0df7090296fef06e006:
Parameters: |
Looks like either a flake because we don't handle transient restart errors or some underlying behaviour change. Test fails because query for unavailable ranges fails after cluster is restarted with 6 nodes out of 9:
N7 is the node removed from a cluster. |
roachtest.replicate/wide failed with artifacts on master @ 8a5cb5109d6b7e122e996a44f940bfd61698cd81:
Parameters: |
Seem broken by #92787 |
That PR effectively reverts the changes made in #92293 to introduce a new end-point, instead of using Gossip for Looking at cockroach/pkg/sql/crdb_internal.go Lines 3669 to 3677 in 8f8cc70
Note that However, it seems like we inadvertently changed the behaviour around here: cockroach/pkg/sql/crdb_internal.go Lines 3732 to 3735 in 8f8cc70
Before these two PRs went in, we would include an empty locality if the node descriptor wasn't found in It should be easy enough to bring back the old behaviour. cc @ecwall, assigning this to you given you authored the 2 patches linked above. I think all we need to do here is swallow the error from One thing that isn't clear to me is why we're not able to retrieve this descriptor from Gossip, given our TTL of 2 hours. Might be worth trying to understand this a bit more. |
roachtest.replicate/wide failed with artifacts on master @ 146556e19f5e4fdc8c3e6a623b280cc33aee4d18:
Parameters: |
roachtest.replicate/wide failed with artifacts on master @ 8165e3974c10e88b6ae11c6255872ea16f3a67e3:
Parameters: |
roachtest.replicate/wide failed with artifacts on master @ ec095bc2fdbe4e518b076db20e4920fab67222bf:
Parameters: |
93127: multitenant: handle missing NodeDescriptor in crdb_internal.ranges_no_leases r=arulajmani a=ecwall Fixes #92915 This change matches the previous behavior of using "" for locality if the NodeDescriptor is not found instead of returning an error when generating crdb_internal.ranges_no_leases. Release note: None Co-authored-by: Evan Wall <[email protected]>
roachtest.replicate/wide failed with artifacts on master @ e7b15ebaed9c14668ade0a7827a5525aedef1ab0:
Parameters:
ROACHTEST_cloud=gce
,ROACHTEST_cpu=1
,ROACHTEST_encrypted=false
,ROACHTEST_fs=ext4
,ROACHTEST_localSSD=true
,ROACHTEST_ssd=0
Help
See: roachtest README
See: How To Investigate (internal)
This test on roachdash | Improve this report!
Jira issue: CRDB-22048
Epic CRDB-16746
The text was updated successfully, but these errors were encountered: