-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
roachtest: always use active node in backup schedule injection #97581
Conversation
@renatolabs I've verified that |
Once we are confident this fixes all the issues that we know of, I think we should kick off a nightly build with this patch to double check we don't see on-start backup related failures anymore. A number of tests in current nightly run (which is still running) have failed (examples: #97582, #97562, #97548). I also noticed there are at least two error messages: file not found (e.g., #97558), and failed to connect (e.g., #97562). |
Never mind, that test actually fails on master as well (see #97016). |
21db718
to
6e1ddc4
Compare
for reasons that i do not completely understand, I've gotten |
rats. it's failed on my branch. To prevent other tests from flaking, I will disable scheduled backups on the restart in this test for now. |
Did if fail with a |
Yup, it is failing with |
6e1ddc4
to
94d4876
Compare
@renatolabs is it worth merging this tonight to further limit the blast radius of failures caused by #97495 ? |
it also seems the
|
Previously, the schedule backup cmd injection on Start() would always run on the first node on the cluster, but if that node were not available, the cmd would fail. This patch ensures the injection runs on a node that was just started. Fixes cockroachdb#97558 Release note: None
94d4876
to
c5ef384
Compare
This is probably because of the location of the machine you ran this test on (see [1]).
Yes, let's merge. Could you go over the test failures on yesterday's build and double check we're closing all issues related to this, please? [1] cockroach/pkg/cmd/roachtest/tests/mixed_version_cdc.go Lines 53 to 59 in f3796ef
|
thanks for the help here! bors r=renatolabs |
Build succeeded: |
Previously, the schedule backup cmd injection on Start() would always run on the first node on the cluster, but if that node were not available, the cmd would fail. This patch ensures the injection runs on a node that was just started.
Fixes #97558, #97582, #97548, #97562 #97565 #97561
Release note: None