Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

roachtest: kv/gracefuldraining/nodes=3 failed [qps dropped] #59094

Closed
cockroach-teamcity opened this issue Jan 17, 2021 · 41 comments
Closed

roachtest: kv/gracefuldraining/nodes=3 failed [qps dropped] #59094

cockroach-teamcity opened this issue Jan 17, 2021 · 41 comments
Assignees
Labels
branch-master Failures and bugs on the master branch. C-test-failure Broken test (automatically or manually discovered). O-roachtest O-robot Originated from a bot. S-1 High impact: many users impacted, serious risk of high unavailability or data loss skipped-test T-kv KV Team

Comments

@cockroach-teamcity
Copy link
Member

cockroach-teamcity commented Jan 17, 2021

(roachtest).kv/gracefuldraining/nodes=3 failed on master@7b0ccdda99b81613e70f421c9374483c3feddff3:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:526,cluster.go:2663,errgroup.go:57: QPS of 884.50 at time 2021-01-17 07:08:50 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1610867280000000000 Value:998.7} {TimestampNanos:1610867290000000000 Value:969.3} {TimestampNanos:1610867300000000000 Value:969.2} {TimestampNanos:1610867310000000000 Value:988.2} {TimestampNanos:1610867320000000000 Value:932.7} {TimestampNanos:1610867330000000000 Value:884.5} {TimestampNanos:1610867340000000000 Value:943.9000000000001} {TimestampNanos:1610867350000000000 Value:975.5} {TimestampNanos:1610867360000000000 Value:996.4000000000001} {TimestampNanos:1610867370000000000 Value:997.9000000000001} {TimestampNanos:1610867380000000000 Value:999.5} {TimestampNanos:1610867390000000000 Value:999.3} {TimestampNanos:1610867400000000000 Value:999.9000000000001} {TimestampNanos:1610867410000000000 Value:993.8} {TimestampNanos:1610867420000000000 Value:991.7} {TimestampNanos:1610867430000000000 Value:994.2} {TimestampNanos:1610867440000000000 Value:995.7} {TimestampNanos:1610867450000000000 Value:997.8} {TimestampNanos:1610867460000000000 Value:997.7}]

	cluster.go:2685,kv.go:536,test_runner.go:760: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2673
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2681
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:536
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:760
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitor).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2729
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2643
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:5652
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:191
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError

More

Artifacts: /kv/gracefuldraining/nodes=3

See this test on roachdash
powered by pkg/cmd/internal/issues

Jira issue: CRDB-3333

Epic CRDB-18656

@cockroach-teamcity cockroach-teamcity added branch-master Failures and bugs on the master branch. C-test-failure Broken test (automatically or manually discovered). O-roachtest O-robot Originated from a bot. release-blocker Indicates a release-blocker. Use with branch-release-2x.x label to denote which branch is blocked. labels Jan 17, 2021
@knz knz self-assigned this Jan 19, 2021
@cockroach-teamcity
Copy link
Member Author

(roachtest).kv/gracefuldraining/nodes=3 failed on master@fbf596c3e17fbb9ec0935b732f4b84469a5399e8:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:526,cluster.go:2665,errgroup.go:57: QPS of 511.90 at time 2021-01-21 07:22:40 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1611213700000000000 Value:997.4000000000001} {TimestampNanos:1611213710000000000 Value:996.1} {TimestampNanos:1611213720000000000 Value:994.3} {TimestampNanos:1611213730000000000 Value:995.8000000000001} {TimestampNanos:1611213740000000000 Value:995.1} {TimestampNanos:1611213750000000000 Value:978.3} {TimestampNanos:1611213760000000000 Value:511.90000000000003} {TimestampNanos:1611213770000000000 Value:717.5} {TimestampNanos:1611213780000000000 Value:519.3000000000001} {TimestampNanos:1611213790000000000 Value:585.8000000000001} {TimestampNanos:1611213800000000000 Value:954} {TimestampNanos:1611213810000000000 Value:927.5} {TimestampNanos:1611213820000000000 Value:906.2} {TimestampNanos:1611213830000000000 Value:923.4000000000001} {TimestampNanos:1611213840000000000 Value:996.7} {TimestampNanos:1611213850000000000 Value:983.6000000000001} {TimestampNanos:1611213860000000000 Value:726.4000000000001} {TimestampNanos:1611213870000000000 Value:924.3000000000001} {TimestampNanos:1611213880000000000 Value:996.4000000000001}]

	cluster.go:2687,kv.go:536,test_runner.go:760: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2675
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2683
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:536
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:760
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitor).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2731
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2645
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:5652
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:191
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError

More

Artifacts: /kv/gracefuldraining/nodes=3

See this test on roachdash
powered by pkg/cmd/internal/issues

@asubiotto
Copy link
Contributor

cc @knz does this qualify as an alpha release blocker?

@knz
Copy link
Contributor

knz commented Jan 21, 2021

No it does not qualify

@knz knz removed the release-blocker Indicates a release-blocker. Use with branch-release-2x.x label to denote which branch is blocked. label Jan 21, 2021
@cockroach-teamcity
Copy link
Member Author

(roachtest).kv/gracefuldraining/nodes=3 failed on master@100c09f4f6eb3f5b18a67ec4bbfdfe989e0d6ce2:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:526,cluster.go:2665,errgroup.go:57: QPS of 883.30 at time 2021-02-04 07:06:40 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1612422400000000000 Value:883.3000000000001} {TimestampNanos:1612422410000000000 Value:517.9} {TimestampNanos:1612422420000000000 Value:284} {TimestampNanos:1612422430000000000 Value:406.30000000000007} {TimestampNanos:1612422440000000000 Value:768.4000000000001} {TimestampNanos:1612422450000000000 Value:992.1} {TimestampNanos:1612422460000000000 Value:985.7} {TimestampNanos:1612422470000000000 Value:995.9000000000001} {TimestampNanos:1612422480000000000 Value:994.2} {TimestampNanos:1612422490000000000 Value:999.4000000000001} {TimestampNanos:1612422500000000000 Value:995.3000000000001} {TimestampNanos:1612422510000000000 Value:999.5} {TimestampNanos:1612422520000000000 Value:999.6} {TimestampNanos:1612422530000000000 Value:997.2} {TimestampNanos:1612422540000000000 Value:996.5} {TimestampNanos:1612422550000000000 Value:995.3000000000001} {TimestampNanos:1612422560000000000 Value:996.1000000000001} {TimestampNanos:1612422570000000000 Value:996.8} {TimestampNanos:1612422580000000000 Value:997.1}]

	cluster.go:2687,kv.go:536,test_runner.go:767: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2675
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2683
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:536
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:767
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitor).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2731
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2645
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:5652
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:191
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError

More

Artifacts: /kv/gracefuldraining/nodes=3

See this test on roachdash
powered by pkg/cmd/internal/issues

@cockroach-teamcity
Copy link
Member Author

(roachtest).kv/gracefuldraining/nodes=3 failed on master@c584f62067a45aa540c26fc9081a83e460bfe37a:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:526,cluster.go:2665,errgroup.go:57: QPS of 864.00 at time 2021-02-05 07:07:20 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1612508770000000000 Value:998.8} {TimestampNanos:1612508780000000000 Value:995.7} {TimestampNanos:1612508790000000000 Value:995.3} {TimestampNanos:1612508800000000000 Value:996.1} {TimestampNanos:1612508810000000000 Value:996.7} {TimestampNanos:1612508820000000000 Value:995.1} {TimestampNanos:1612508830000000000 Value:987.9000000000001} {TimestampNanos:1612508840000000000 Value:864} {TimestampNanos:1612508850000000000 Value:998.1} {TimestampNanos:1612508860000000000 Value:998.7} {TimestampNanos:1612508870000000000 Value:998.8000000000001} {TimestampNanos:1612508880000000000 Value:999.6000000000001} {TimestampNanos:1612508890000000000 Value:999} {TimestampNanos:1612508900000000000 Value:998} {TimestampNanos:1612508910000000000 Value:996.3} {TimestampNanos:1612508920000000000 Value:996.1} {TimestampNanos:1612508930000000000 Value:996.5} {TimestampNanos:1612508940000000000 Value:995.7} {TimestampNanos:1612508950000000000 Value:995.9000000000001}]

	cluster.go:2687,kv.go:536,test_runner.go:767: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2675
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2683
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:536
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:767
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitor).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2731
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2645
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:5652
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:191
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError

More

Artifacts: /kv/gracefuldraining/nodes=3

See this test on roachdash
powered by pkg/cmd/internal/issues

@cockroach-teamcity
Copy link
Member Author

(roachtest).kv/gracefuldraining/nodes=3 failed on master@81a2c26a104fa8cc7e8b530b837ffb6ff85ddc5a:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	cluster.go:2253,kv.go:373,test_runner.go:767: output in run_070825.078_n4_workload_run_kv: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/bin/roachprod run teamcity-2650259-1612594948-08-n4cpu4:4 -- ./workload run kv --init --max-ops=1 --splits 100 {pgurl:1} returned: exit status 20
		(1) attached stack trace
		  -- stack trace:
		  | main.(*cluster).RunE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2331
		  | main.(*cluster).Run
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2251
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:373
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:767
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (2) output in run_070825.078_n4_workload_run_kv
		Wraps: (3) /home/agent/work/.go/src/github.com/cockroachdb/cockroach/bin/roachprod run teamcity-2650259-1612594948-08-n4cpu4:4 -- ./workload run kv --init --max-ops=1 --splits 100 {pgurl:1} returned
		  | stderr:
		  | ./workload: /lib/x86_64-linux-gnu/libm.so.6: version `GLIBC_2.29' not found (required by ./workload)
		  | Error: COMMAND_PROBLEM: exit status 1
		  | (1) COMMAND_PROBLEM
		  | Wraps: (2) Node 4. Command with error:
		  |   | ```
		  |   | ./workload run kv --init --max-ops=1 --splits 100 {pgurl:1}
		  |   | ```
		  | Wraps: (3) exit status 1
		  | Error types: (1) errors.Cmd (2) *hintdetail.withDetail (3) *exec.ExitError
		  |
		  | stdout:
		Wraps: (4) exit status 20
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *main.withCommandDetails (4) *exec.ExitError

More

Artifacts: /kv/gracefuldraining/nodes=3

See this test on roachdash
powered by pkg/cmd/internal/issues

@cockroach-teamcity
Copy link
Member Author

(roachtest).kv/gracefuldraining/nodes=3 failed on master@0e98670c8fcec566937e899fdf77d2a68c702d62:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	cluster.go:2253,kv.go:373,test_runner.go:767: output in run_070503.680_n4_workload_run_kv: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/bin/roachprod run teamcity-2651620-1612681270-06-n4cpu4:4 -- ./workload run kv --init --max-ops=1 --splits 100 {pgurl:1} returned: exit status 20
		(1) attached stack trace
		  -- stack trace:
		  | main.(*cluster).RunE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2331
		  | main.(*cluster).Run
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2251
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:373
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:767
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (2) output in run_070503.680_n4_workload_run_kv
		Wraps: (3) /home/agent/work/.go/src/github.com/cockroachdb/cockroach/bin/roachprod run teamcity-2651620-1612681270-06-n4cpu4:4 -- ./workload run kv --init --max-ops=1 --splits 100 {pgurl:1} returned
		  | stderr:
		  | ./workload: /lib/x86_64-linux-gnu/libm.so.6: version `GLIBC_2.29' not found (required by ./workload)
		  | Error: COMMAND_PROBLEM: exit status 1
		  | (1) COMMAND_PROBLEM
		  | Wraps: (2) Node 4. Command with error:
		  |   | ```
		  |   | ./workload run kv --init --max-ops=1 --splits 100 {pgurl:1}
		  |   | ```
		  | Wraps: (3) exit status 1
		  | Error types: (1) errors.Cmd (2) *hintdetail.withDetail (3) *exec.ExitError
		  |
		  | stdout:
		Wraps: (4) exit status 20
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *main.withCommandDetails (4) *exec.ExitError

More

Artifacts: /kv/gracefuldraining/nodes=3

See this test on roachdash
powered by pkg/cmd/internal/issues

@cockroach-teamcity
Copy link
Member Author

(roachtest).kv/gracefuldraining/nodes=3 failed on master@5b33e6dfc47000de831745a851e3bf9e2cf7fd95:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	cluster.go:2253,kv.go:373,test_runner.go:767: output in run_065603.071_n4_workload_run_kv: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/bin/roachprod run teamcity-2653401-1612767119-15-n4cpu4:4 -- ./workload run kv --init --max-ops=1 --splits 100 {pgurl:1} returned: exit status 20
		(1) attached stack trace
		  -- stack trace:
		  | main.(*cluster).RunE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2331
		  | main.(*cluster).Run
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2251
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:373
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:767
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (2) output in run_065603.071_n4_workload_run_kv
		Wraps: (3) /home/agent/work/.go/src/github.com/cockroachdb/cockroach/bin/roachprod run teamcity-2653401-1612767119-15-n4cpu4:4 -- ./workload run kv --init --max-ops=1 --splits 100 {pgurl:1} returned
		  | stderr:
		  | ./workload: /lib/x86_64-linux-gnu/libm.so.6: version `GLIBC_2.29' not found (required by ./workload)
		  | Error: COMMAND_PROBLEM: exit status 1
		  | (1) COMMAND_PROBLEM
		  | Wraps: (2) Node 4. Command with error:
		  |   | ```
		  |   | ./workload run kv --init --max-ops=1 --splits 100 {pgurl:1}
		  |   | ```
		  | Wraps: (3) exit status 1
		  | Error types: (1) errors.Cmd (2) *hintdetail.withDetail (3) *exec.ExitError
		  |
		  | stdout:
		Wraps: (4) exit status 20
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *main.withCommandDetails (4) *exec.ExitError

More

Artifacts: /kv/gracefuldraining/nodes=3

See this test on roachdash
powered by pkg/cmd/internal/issues

@cockroach-teamcity
Copy link
Member Author

(roachtest).kv/gracefuldraining/nodes=3 failed on master@7853fd32de8b6dea869f2a2a92dcd7506f4a8998:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	cluster.go:2253,kv.go:373,test_runner.go:767: output in run_070744.237_n4_workload_run_kv: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/bin/roachprod run teamcity-2657140-1612854228-02-n4cpu4:4 -- ./workload run kv --init --max-ops=1 --splits 100 {pgurl:1} returned: exit status 20
		(1) attached stack trace
		  -- stack trace:
		  | main.(*cluster).RunE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2331
		  | main.(*cluster).Run
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2251
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:373
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:767
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (2) output in run_070744.237_n4_workload_run_kv
		Wraps: (3) /home/agent/work/.go/src/github.com/cockroachdb/cockroach/bin/roachprod run teamcity-2657140-1612854228-02-n4cpu4:4 -- ./workload run kv --init --max-ops=1 --splits 100 {pgurl:1} returned
		  | stderr:
		  | ./workload: /lib/x86_64-linux-gnu/libm.so.6: version `GLIBC_2.29' not found (required by ./workload)
		  | Error: COMMAND_PROBLEM: exit status 1
		  | (1) COMMAND_PROBLEM
		  | Wraps: (2) Node 4. Command with error:
		  |   | ```
		  |   | ./workload run kv --init --max-ops=1 --splits 100 {pgurl:1}
		  |   | ```
		  | Wraps: (3) exit status 1
		  | Error types: (1) errors.Cmd (2) *hintdetail.withDetail (3) *exec.ExitError
		  |
		  | stdout:
		Wraps: (4) exit status 20
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *main.withCommandDetails (4) *exec.ExitError

More

Artifacts: /kv/gracefuldraining/nodes=3

See this test on roachdash
powered by pkg/cmd/internal/issues

@cockroach-teamcity
Copy link
Member Author

(roachtest).kv/gracefuldraining/nodes=3 failed on master@e9e372122a2e3db7090b5705da07128f828e2441:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:526,cluster.go:2665,errgroup.go:57: QPS of 754.80 at time 2021-02-12 07:28:30 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1613114770000000000 Value:993.2} {TimestampNanos:1613114780000000000 Value:989.9000000000001} {TimestampNanos:1613114790000000000 Value:989.1000000000001} {TimestampNanos:1613114800000000000 Value:983.7} {TimestampNanos:1613114810000000000 Value:989.1} {TimestampNanos:1613114820000000000 Value:995.2} {TimestampNanos:1613114830000000000 Value:995.4000000000001} {TimestampNanos:1613114840000000000 Value:997.7} {TimestampNanos:1613114850000000000 Value:998.9000000000001} {TimestampNanos:1613114860000000000 Value:999.3000000000001} {TimestampNanos:1613114870000000000 Value:999.8} {TimestampNanos:1613114880000000000 Value:999.5} {TimestampNanos:1613114890000000000 Value:999.5} {TimestampNanos:1613114900000000000 Value:904.2} {TimestampNanos:1613114910000000000 Value:754.8} {TimestampNanos:1613114920000000000 Value:980} {TimestampNanos:1613114930000000000 Value:993.7} {TimestampNanos:1613114940000000000 Value:994.6} {TimestampNanos:1613114950000000000 Value:996.7}]

	cluster.go:2687,kv.go:536,test_runner.go:767: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2675
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2683
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:536
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:767
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitor).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2731
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2645
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:5652
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:191
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError

More

Artifacts: /kv/gracefuldraining/nodes=3

See this test on roachdash
powered by pkg/cmd/internal/issues

@cockroach-teamcity
Copy link
Member Author

(roachtest).kv/gracefuldraining/nodes=3 failed on master@83e70ce84b740e27e721c3b73c38a4b8b515094a:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:526,cluster.go:2665,errgroup.go:57: QPS of 877.30 at time 2021-02-19 07:09:20 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1613718550000000000 Value:998.2} {TimestampNanos:1613718560000000000 Value:877.3000000000001} {TimestampNanos:1613718570000000000 Value:625.2} {TimestampNanos:1613718580000000000 Value:866.8} {TimestampNanos:1613718590000000000 Value:986.1000000000001} {TimestampNanos:1613718600000000000 Value:987.7} {TimestampNanos:1613718610000000000 Value:986.4000000000001} {TimestampNanos:1613718620000000000 Value:992.2} {TimestampNanos:1613718630000000000 Value:997.2} {TimestampNanos:1613718640000000000 Value:998.1} {TimestampNanos:1613718650000000000 Value:998.8000000000001} {TimestampNanos:1613718660000000000 Value:1000.6} {TimestampNanos:1613718670000000000 Value:997.5} {TimestampNanos:1613718680000000000 Value:997.6} {TimestampNanos:1613718690000000000 Value:994.6} {TimestampNanos:1613718700000000000 Value:994.5} {TimestampNanos:1613718710000000000 Value:996} {TimestampNanos:1613718720000000000 Value:990.5} {TimestampNanos:1613718730000000000 Value:988}]

	cluster.go:2687,kv.go:536,test_runner.go:767: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2675
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2683
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:536
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:767
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitor).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2731
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2645
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:5652
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:191
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError

More

Artifacts: /kv/gracefuldraining/nodes=3

See this test on roachdash
powered by pkg/cmd/internal/issues

@cockroach-teamcity
Copy link
Member Author

(roachtest).kv/gracefuldraining/nodes=3 failed on master@bdff5338ca725bf1cfddf7e3f648bbf02ab42999:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:526,cluster.go:2666,errgroup.go:57: QPS of 848.50 at time 2021-03-14 07:06:30 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1615705470000000000 Value:996.4000000000001} {TimestampNanos:1615705480000000000 Value:995.5} {TimestampNanos:1615705490000000000 Value:994.8} {TimestampNanos:1615705500000000000 Value:993.9000000000001} {TimestampNanos:1615705510000000000 Value:994.5} {TimestampNanos:1615705520000000000 Value:994.5} {TimestampNanos:1615705530000000000 Value:994.6000000000001} {TimestampNanos:1615705540000000000 Value:998.1000000000001} {TimestampNanos:1615705550000000000 Value:999.2} {TimestampNanos:1615705560000000000 Value:999.9000000000001} {TimestampNanos:1615705570000000000 Value:994.9000000000001} {TimestampNanos:1615705580000000000 Value:997.1000000000001} {TimestampNanos:1615705590000000000 Value:848.5} {TimestampNanos:1615705600000000000 Value:857.2} {TimestampNanos:1615705610000000000 Value:974.1000000000001} {TimestampNanos:1615705620000000000 Value:625.6} {TimestampNanos:1615705630000000000 Value:659.7} {TimestampNanos:1615705640000000000 Value:995.6000000000001} {TimestampNanos:1615705650000000000 Value:990.4000000000001}]

	cluster.go:2688,kv.go:536,test_runner.go:767: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2676
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2684
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:536
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:767
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitor).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2732
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2646
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:5652
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:191
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError

More

Artifacts: /kv/gracefuldraining/nodes=3
Related:

See this test on roachdash
powered by pkg/cmd/internal/issues

@cockroach-teamcity
Copy link
Member Author

(roachtest).kv/gracefuldraining/nodes=3 failed on master@ee9f47b9ec9476a693464e2dcd09a01bf9d39ad2:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:526,cluster.go:2666,errgroup.go:57: QPS of 625.80 at time 2021-03-19 06:03:00 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1616133730000000000 Value:998.6} {TimestampNanos:1616133740000000000 Value:994} {TimestampNanos:1616133750000000000 Value:994.5} {TimestampNanos:1616133760000000000 Value:995.4000000000001} {TimestampNanos:1616133770000000000 Value:991.6000000000001} {TimestampNanos:1616133780000000000 Value:625.8000000000001} {TimestampNanos:1616133790000000000 Value:995.2} {TimestampNanos:1616133800000000000 Value:996.6} {TimestampNanos:1616133810000000000 Value:998.5} {TimestampNanos:1616133820000000000 Value:998.6} {TimestampNanos:1616133830000000000 Value:998.3000000000001} {TimestampNanos:1616133840000000000 Value:998.9000000000001} {TimestampNanos:1616133850000000000 Value:999.8000000000001} {TimestampNanos:1616133860000000000 Value:997.6} {TimestampNanos:1616133870000000000 Value:995.8} {TimestampNanos:1616133880000000000 Value:994.7} {TimestampNanos:1616133890000000000 Value:996.9000000000001} {TimestampNanos:1616133900000000000 Value:997.4000000000001} {TimestampNanos:1616133910000000000 Value:996}]

	cluster.go:2688,kv.go:536,test_runner.go:768: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2676
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2684
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:536
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:768
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitor).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2732
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2646
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:5652
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:191
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError

More

Artifacts: /kv/gracefuldraining/nodes=3
Related:

See this test on roachdash
powered by pkg/cmd/internal/issues

@cockroach-teamcity
Copy link
Member Author

(roachtest).kv/gracefuldraining/nodes=3 failed on master@3d19b2cf6b290a152b23722fc32e995eed3b437b:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:526,cluster.go:2666,errgroup.go:57: QPS of 884.50 at time 2021-03-20 06:05:40 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1616220270000000000 Value:999.9000000000001} {TimestampNanos:1616220280000000000 Value:996.5} {TimestampNanos:1616220290000000000 Value:995.7} {TimestampNanos:1616220300000000000 Value:995.4000000000001} {TimestampNanos:1616220310000000000 Value:996.2} {TimestampNanos:1616220320000000000 Value:996.8} {TimestampNanos:1616220330000000000 Value:996.5} {TimestampNanos:1616220340000000000 Value:884.5} {TimestampNanos:1616220350000000000 Value:998.6} {TimestampNanos:1616220360000000000 Value:999.6} {TimestampNanos:1616220370000000000 Value:999.6} {TimestampNanos:1616220380000000000 Value:999.7} {TimestampNanos:1616220390000000000 Value:999.6000000000001} {TimestampNanos:1616220400000000000 Value:999.6} {TimestampNanos:1616220410000000000 Value:997} {TimestampNanos:1616220420000000000 Value:995.7} {TimestampNanos:1616220430000000000 Value:997.6} {TimestampNanos:1616220440000000000 Value:997.9000000000001} {TimestampNanos:1616220450000000000 Value:997.3000000000001}]

	cluster.go:2688,kv.go:536,test_runner.go:768: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2676
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2684
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:536
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:768
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitor).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2732
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2646
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:5652
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:191
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError

More

Artifacts: /kv/gracefuldraining/nodes=3
Related:

See this test on roachdash
powered by pkg/cmd/internal/issues

@cockroach-teamcity

This comment has been minimized.

@tbg
Copy link
Member

tbg commented Apr 8, 2021

last failure

F210408 06:07:57.302899 63 1@server/server.go:322 â‹® [n2] 2668 clock synchronization error: this node is more than 500ms away from at least half of the known nodes (2 of 4 are within the offset)

cc #62946

@cockroach-teamcity

This comment has been minimized.

@cockroach-teamcity
Copy link
Member Author

roachtest.kv/gracefuldraining/nodes=3 failed with artifacts on master @ 69308cce3bb7e660908cb3e2724eedd271ce5585:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:575,cluster.go:2462,errgroup.go:57: QPS of 792.70 at time 2021-06-19 06:17:40 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1624083400000000000 Value:999.5} {TimestampNanos:1624083410000000000 Value:997.5} {TimestampNanos:1624083420000000000 Value:997.5} {TimestampNanos:1624083430000000000 Value:997.5} {TimestampNanos:1624083440000000000 Value:997.3000000000001} {TimestampNanos:1624083450000000000 Value:997.4000000000001} {TimestampNanos:1624083460000000000 Value:792.7} {TimestampNanos:1624083470000000000 Value:819.5} {TimestampNanos:1624083480000000000 Value:999.9000000000001} {TimestampNanos:1624083490000000000 Value:999.6000000000001} {TimestampNanos:1624083500000000000 Value:999.6} {TimestampNanos:1624083510000000000 Value:999.9000000000001} {TimestampNanos:1624083520000000000 Value:999.5} {TimestampNanos:1624083530000000000 Value:998.3} {TimestampNanos:1624083540000000000 Value:996.4000000000001} {TimestampNanos:1624083550000000000 Value:997.4000000000001} {TimestampNanos:1624083560000000000 Value:996.6000000000001} {TimestampNanos:1624083570000000000 Value:997.8000000000001} {TimestampNanos:1624083580000000000 Value:996.2}]

	cluster.go:2484,kv.go:585,test_runner.go:757: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2472
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2480
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:585
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:757
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitor).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2528
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2442
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:5652
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:191
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError
Reproduce

To reproduce, try:

# From https://go.crdb.dev/p/roachstress, perhaps edited lightly.
caffeinate ./roachstress.sh kv/gracefuldraining/nodes=3

Same failure on other branches

/cc @cockroachdb/kv

This test on roachdash | Improve this report!

@cockroach-teamcity
Copy link
Member Author

roachtest.kv/gracefuldraining/nodes=3 failed with artifacts on master @ 6b9d2a15f0c223c8dda04c5b2a39abe784b58bdd:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:576,cluster.go:2464,errgroup.go:57: QPS of 769.90 at time 2021-06-26 06:26:20 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1624688710000000000 Value:999.3} {TimestampNanos:1624688720000000000 Value:995.2} {TimestampNanos:1624688730000000000 Value:997} {TimestampNanos:1624688740000000000 Value:996.4000000000001} {TimestampNanos:1624688750000000000 Value:994.3000000000001} {TimestampNanos:1624688760000000000 Value:996.7} {TimestampNanos:1624688770000000000 Value:995.6} {TimestampNanos:1624688780000000000 Value:769.9000000000001} {TimestampNanos:1624688790000000000 Value:999.3000000000001} {TimestampNanos:1624688800000000000 Value:999.8} {TimestampNanos:1624688810000000000 Value:999.4000000000001} {TimestampNanos:1624688820000000000 Value:999.4000000000001} {TimestampNanos:1624688830000000000 Value:999.7} {TimestampNanos:1624688840000000000 Value:999.1000000000001} {TimestampNanos:1624688850000000000 Value:997.3000000000001} {TimestampNanos:1624688860000000000 Value:995.7} {TimestampNanos:1624688870000000000 Value:996.7} {TimestampNanos:1624688880000000000 Value:997.6} {TimestampNanos:1624688890000000000 Value:995.5}]

	cluster.go:2486,kv.go:586,test_runner.go:757: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2474
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2482
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:586
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:757
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitor).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2530
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2444
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:6309
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:208
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1371
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError
Reproduce

To reproduce, try:

# From https://go.crdb.dev/p/roachstress, perhaps edited lightly.
caffeinate ./roachstress.sh kv/gracefuldraining/nodes=3

Same failure on other branches

/cc @cockroachdb/kv

This test on roachdash | Improve this report!

@cockroach-teamcity
Copy link
Member Author

roachtest.kv/gracefuldraining/nodes=3 failed with artifacts on master @ d43d9fddbebac7eff03804a7d86f7b6af119f24f:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:577,cluster.go:2456,errgroup.go:57: QPS of 69.30 at time 2021-06-28 06:17:20 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1624861020000000000 Value:999.6000000000001} {TimestampNanos:1624861030000000000 Value:959.6000000000001} {TimestampNanos:1624861040000000000 Value:69.30000000000001} {TimestampNanos:1624861050000000000 Value:561.8000000000001} {TimestampNanos:1624861060000000000 Value:993.5} {TimestampNanos:1624861070000000000 Value:993} {TimestampNanos:1624861080000000000 Value:993.1000000000001} {TimestampNanos:1624861090000000000 Value:995.7} {TimestampNanos:1624861100000000000 Value:999} {TimestampNanos:1624861110000000000 Value:433.80000000000007} {TimestampNanos:1624861120000000000 Value:399.80000000000007} {TimestampNanos:1624861130000000000 Value:811.3} {TimestampNanos:1624861140000000000 Value:999.5} {TimestampNanos:1624861150000000000 Value:997.2} {TimestampNanos:1624861160000000000 Value:994.7} {TimestampNanos:1624861170000000000 Value:991.9000000000001} {TimestampNanos:1624861180000000000 Value:991.7} {TimestampNanos:1624861190000000000 Value:995.1} {TimestampNanos:1624861200000000000 Value:994.8}]

	cluster.go:2478,kv.go:587,test_runner.go:758: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2466
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2474
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:587
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:758
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitor).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2522
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2436
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:6309
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:208
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1371
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError
Reproduce

To reproduce, try:

# From https://go.crdb.dev/p/roachstress, perhaps edited lightly.
caffeinate ./roachstress.sh kv/gracefuldraining/nodes=3

Same failure on other branches

/cc @cockroachdb/kv

This test on roachdash | Improve this report!

@cockroach-teamcity
Copy link
Member Author

roachtest.kv/gracefuldraining/nodes=3 failed with artifacts on master @ 84ec89c77841016da0b9c4c71772a4304bad45a5:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:574,z_cluster.go:2465,errgroup.go:57: QPS of 828.30 at time 2021-07-06 06:17:40 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1625552180000000000 Value:999.7} {TimestampNanos:1625552190000000000 Value:997.4000000000001} {TimestampNanos:1625552200000000000 Value:997.3000000000001} {TimestampNanos:1625552210000000000 Value:997.3000000000001} {TimestampNanos:1625552220000000000 Value:995.7} {TimestampNanos:1625552230000000000 Value:994.3000000000001} {TimestampNanos:1625552240000000000 Value:996.9000000000001} {TimestampNanos:1625552250000000000 Value:995.9000000000001} {TimestampNanos:1625552260000000000 Value:828.3} {TimestampNanos:1625552270000000000 Value:924.8000000000001} {TimestampNanos:1625552280000000000 Value:999.7} {TimestampNanos:1625552290000000000 Value:999.3} {TimestampNanos:1625552300000000000 Value:999.3000000000001} {TimestampNanos:1625552310000000000 Value:916.4000000000001} {TimestampNanos:1625552320000000000 Value:789.8000000000001} {TimestampNanos:1625552330000000000 Value:997.3} {TimestampNanos:1625552340000000000 Value:997.7} {TimestampNanos:1625552350000000000 Value:996.1} {TimestampNanos:1625552360000000000 Value:996.6}]

	z_cluster.go:2487,kv.go:584,z_test_runner.go:765: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitorImpl).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_cluster.go:2475
		  | main.(*monitorImpl).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_cluster.go:2483
		  | main.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/kv.go:584
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_test_runner.go:765
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitorImpl).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_cluster.go:2531
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_cluster.go:2440
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:6309
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:208
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1371
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError
Reproduce

To reproduce, try:

# From https://go.crdb.dev/p/roachstress, perhaps edited lightly.
caffeinate ./roachstress.sh kv/gracefuldraining/nodes=3

Same failure on other branches

/cc @cockroachdb/kv-triage

This test on roachdash | Improve this report!

@cockroach-teamcity
Copy link
Member Author

roachtest.kv/gracefuldraining/nodes=3 failed with artifacts on master @ 7d0fd136a538b22cbf9bfff03b2885b7783711aa:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:574,z_monitor.go:106,errgroup.go:57: QPS of 777.80 at time 2021-07-08 06:14:50 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1625724760000000000 Value:995.7} {TimestampNanos:1625724770000000000 Value:977.4000000000001} {TimestampNanos:1625724780000000000 Value:975.1000000000001} {TimestampNanos:1625724790000000000 Value:974.2} {TimestampNanos:1625724800000000000 Value:978.4000000000001} {TimestampNanos:1625724810000000000 Value:980.5} {TimestampNanos:1625724820000000000 Value:981.6} {TimestampNanos:1625724830000000000 Value:983.9000000000001} {TimestampNanos:1625724840000000000 Value:993.6000000000001} {TimestampNanos:1625724850000000000 Value:997.3} {TimestampNanos:1625724860000000000 Value:999.3000000000001} {TimestampNanos:1625724870000000000 Value:999.7} {TimestampNanos:1625724880000000000 Value:981.8} {TimestampNanos:1625724890000000000 Value:777.8000000000001} {TimestampNanos:1625724900000000000 Value:961} {TimestampNanos:1625724910000000000 Value:995.5} {TimestampNanos:1625724920000000000 Value:996.5} {TimestampNanos:1625724930000000000 Value:996.5} {TimestampNanos:1625724940000000000 Value:996.8000000000001}]

	z_monitor.go:128,kv.go:584,z_test_runner.go:765: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitorImpl).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_monitor.go:116
		  | main.(*monitorImpl).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_monitor.go:124
		  | github.com/cockroachdb/cockroach/pkg/cmd/roachtest/tests.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/tests/kv.go:584
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_test_runner.go:765
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitorImpl).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_monitor.go:172
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_monitor.go:81
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:6309
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:208
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1371
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError
Reproduce

To reproduce, try:

# From https://go.crdb.dev/p/roachstress, perhaps edited lightly.
caffeinate ./roachstress.sh kv/gracefuldraining/nodes=3

Same failure on other branches

/cc @cockroachdb/kv-triage

This test on roachdash | Improve this report!

@cockroach-teamcity
Copy link
Member Author

roachtest.kv/gracefuldraining/nodes=3 failed with artifacts on master @ 0a48b0b74b0a6057f1d418875b97830359a52ec6:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:574,z_monitor.go:106,errgroup.go:57: QPS of 817.50 at time 2021-07-10 06:13:10 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1625897550000000000 Value:995.2} {TimestampNanos:1625897560000000000 Value:991.9000000000001} {TimestampNanos:1625897570000000000 Value:995.9000000000001} {TimestampNanos:1625897580000000000 Value:995.3} {TimestampNanos:1625897590000000000 Value:817.5} {TimestampNanos:1625897600000000000 Value:465.20000000000005} {TimestampNanos:1625897610000000000 Value:935} {TimestampNanos:1625897620000000000 Value:996.1000000000001} {TimestampNanos:1625897630000000000 Value:997.6000000000001} {TimestampNanos:1625897640000000000 Value:998.9000000000001} {TimestampNanos:1625897650000000000 Value:998.6} {TimestampNanos:1625897660000000000 Value:998.5} {TimestampNanos:1625897670000000000 Value:999.4000000000001} {TimestampNanos:1625897680000000000 Value:998.2} {TimestampNanos:1625897690000000000 Value:996} {TimestampNanos:1625897700000000000 Value:995.3000000000001} {TimestampNanos:1625897710000000000 Value:997} {TimestampNanos:1625897720000000000 Value:997.4000000000001} {TimestampNanos:1625897730000000000 Value:995.1}]

	z_monitor.go:128,kv.go:584,z_test_runner.go:765: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitorImpl).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_monitor.go:116
		  | main.(*monitorImpl).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_monitor.go:124
		  | github.com/cockroachdb/cockroach/pkg/cmd/roachtest/tests.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/tests/kv.go:584
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_test_runner.go:765
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitorImpl).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_monitor.go:172
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/z_monitor.go:81
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:6309
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:208
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1371
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError
Reproduce

To reproduce, try:

# From https://go.crdb.dev/p/roachstress, perhaps edited lightly.
caffeinate ./roachstress.sh kv/gracefuldraining/nodes=3

Same failure on other branches

/cc @cockroachdb/kv-triage

This test on roachdash | Improve this report!

@cockroach-teamcity
Copy link
Member Author

roachtest.kv/gracefuldraining/nodes=3 failed with artifacts on master @ 5a5b3dc446fcfc2d3e28b6775ae9bb1a63376210:

The test failed on branch=master, cloud=gce:
test artifacts and logs in: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/artifacts/kv/gracefuldraining/nodes=3/run_1
	kv.go:574,monitor.go:106,errgroup.go:57: QPS of 773.50 at time 2021-07-19 14:31:10 +0000 UTC is below minimum allowable QPS of 900.00; entire timeseries: [{TimestampNanos:1626704980000000000 Value:994.4000000000001} {TimestampNanos:1626704990000000000 Value:994.4000000000001} {TimestampNanos:1626705000000000000 Value:994.7} {TimestampNanos:1626705010000000000 Value:994.6} {TimestampNanos:1626705020000000000 Value:993.7} {TimestampNanos:1626705030000000000 Value:991.5} {TimestampNanos:1626705040000000000 Value:991.8000000000001} {TimestampNanos:1626705050000000000 Value:996.3000000000001} {TimestampNanos:1626705060000000000 Value:997.1000000000001} {TimestampNanos:1626705070000000000 Value:773.5} {TimestampNanos:1626705080000000000 Value:959.7} {TimestampNanos:1626705090000000000 Value:999.4000000000001} {TimestampNanos:1626705100000000000 Value:999.7} {TimestampNanos:1626705110000000000 Value:995.9000000000001} {TimestampNanos:1626705120000000000 Value:994.3000000000001} {TimestampNanos:1626705130000000000 Value:995.6} {TimestampNanos:1626705140000000000 Value:995.3000000000001} {TimestampNanos:1626705150000000000 Value:996.1000000000001} {TimestampNanos:1626705160000000000 Value:993}]

	monitor.go:128,kv.go:584,test_runner.go:765: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitorImpl).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/monitor.go:116
		  | main.(*monitorImpl).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/monitor.go:124
		  | github.com/cockroachdb/cockroach/pkg/cmd/roachtest/tests.registerKVGracefulDraining.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/tests/kv.go:584
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:765
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitorImpl).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/monitor.go:172
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/monitor.go:81
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:6309
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:208
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1371
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError
Reproduce

To reproduce, try:

# From https://go.crdb.dev/p/roachstress, perhaps edited lightly.
caffeinate ./roachstress.sh kv/gracefuldraining/nodes=3

Same failure on other branches

/cc @cockroachdb/kv-triage

This test on roachdash | Improve this report!

@tbg tbg added the S-1 High impact: many users impacted, serious risk of high unavailability or data loss label Feb 1, 2022
@kvoli
Copy link
Collaborator

kvoli commented Mar 13, 2023

Ran this on master and it is passing. I'm going to stress overnight and see if there are any failures to debug. If none pop up, I'll unskip this test as it seems useful to have.

@kvoli
Copy link
Collaborator

kvoli commented Mar 14, 2023

Fails 4/57 runs on master. The test itself seems valuable, I'll see if there's any quick changes to make this less flaky.

@kvoli
Copy link
Collaborator

kvoli commented Mar 15, 2023

Looking at one of the test failures.

23:41:38 test_impl.go:344: test failure #1: full stack retained in failure_1.log: (kv.go:644).2: QPS of 653.20 at time 2023-03-13 23:41:00 +0000 UTC is below minimum allowable QPS of 900.00;

However, when I look at prometheus I see that the QPS didn't drop. Note though that the prometheus query rates over 30 seconds, which could be why.

EDIT: The time period on the graph was incorrect, I made the mistake of forgetting my Grafana is configured in local time (-3hr) of the roachprod timestamps.

The actual graph, is very similar but you can see from the workload runner below we do hit an issue. The graphs in prometheus are rated over 30s, which could explain why:

image

kvoli added a commit to kvoli/cockroach that referenced this issue Mar 15, 2023
Enable `kv/gracefuldraining/nodes=3`. The test was skipped in cockroachdb#67798 due
to flakes. The test is updated slightly to prevent future flakes. Before
the changes in this commit, the test failed about 7% of sampled test
runs (/50). The failures were caused by the QPS metric dropping below
the target threshold during drain/restarts. The QPS metric the test used
(from the internal time series) did not match up the scraped metric from
Prometheus or the workload when QPS dropped.

This commit updates the test to gather the QPS metric from Prometheus
rather than internal time series. There were 0 failures over 20 runs.

Resolves: cockroachdb#59094

Release note: None
@kvoli
Copy link
Collaborator

kvoli commented Mar 16, 2023

It seems that the workload rate did indeed drop for 5 seconds from the baseline 1k ops/s in one of the test failures. So the metrics are correct, the test could use the workload runner instantaneous ops/s for a better signal.

I think the value must have been smoothed out over 30s but there definitely was an impact that probably resulted from draining/restarting.

I230313 23:37:57.979754 1 workload/cli/run.go:460  [T1] 3  creating load generator... done (took 4.73502ms)
.... 23:41:00 is at 183s
_elapsed___errors__ops/sec(inst)___ops/sec(cum)__p50(ms)__p95(ms)__p99(ms)_pMax(ms)
  181.0s        0          508.8          993.2      2.6     50.3     58.7     65.0 write
  182.0s        0          192.1          988.8     44.0     50.3     52.4     52.4 write
  183.0s        0          183.9          984.4     46.1     52.4     52.4     54.5 write
  184.0s        0          208.0          980.1     37.7     50.3     52.4     52.4 write
  185.0s        0          184.0          975.8     46.1     52.4     52.4     52.4 write
  186.0s        0          889.1          975.4      2.5      8.4     37.7     50.3 write

@kvoli
Copy link
Collaborator

kvoli commented Mar 17, 2023

This appears like a legitimate issue. The cause I'm not certain on. The cluster does have a very undesirable symptom however, in that the leases are thrashing due to stale/incorrect data + racing between the replicate queue and store rebalancer between n1 and n2.

image

This could be due to the more frequent gossiping when many lease transfers happen. Which is what we do during a drain. If the gossip is frequently untimely and overwriting the storepool estimates with target store descriptors, which recently received a lease, the lease load won't be included for 5 seconds. This will cause thrashing. I believe I've seen this elsewhere in the ycsb test and lowered gossip frequency due to capacity changes as result.

In any case, the failure is legitimate and I'm investigating some updates to the store rebalancer to prevent the thrashing, which could possibly be the culprit.

@kvoli kvoli removed the GA-blocker label Mar 17, 2023
@kvoli
Copy link
Collaborator

kvoli commented Mar 17, 2023

Removing the GA blocker. The failure rate appears lower than when this test was skipped. We do want to re enable the test as it is useful. However, we need to fix the cause of the failures.

kvoli added a commit to kvoli/cockroach that referenced this issue Mar 30, 2023
This commit re-enables the `kv/gracefuldraining/nodes=3` roachtest. The
test is still likely to fail occasionally however has produced
interesting findings just in testing to re-enable.

Informs: cockroachdb#59094

Release note: None
craig bot pushed a commit that referenced this issue May 9, 2023
98720: roachtest: enable kv/gracefuldraining/nodes=3 r=andrewbaptist a=kvoli

This commit re-enables the `kv/gracefuldraining/nodes=3` roachtest. The
test is still likely to fail occasionally however has produced
interesting findings just in testing to re-enable.

Informs: #59094

Release note: None

101729: streamingccl: don't require TLS certificates r=dt a=stevendanna

Users may want to use password auth to simplify their replication setup. While we may recommend TLS certificate auth, I don't see a strong reason to _require_ it.

Epic: none

Release note: None

102825: kvserver,storepool: misc rebalance logging improvements r=andrewbaptist a=kvoli

The store list string returned the mean leases, ranges and
queries-per-second float values without limiting the number of decimal
places. This led to log lines with needlessly long decimals:

`avg-ranges=40.66666666666667... avg-leases=10.166666666666666...`

This PR updates the store list string formatting to 2 decimal
places for float values.

Previously, the easiest method of determining the current rebalance
objective from logs was to view the cluster setting and check for
logging indicating a mixed version cluster - this was cumbersome.

This PR annotates the ctx in the store rebalancer loop with an
additional tag: `obj`. The `obj` tag indicates the current rebalance
objective, either `cpu` or `qps` currently.

resolves: #102812


Release note: None

Co-authored-by: Austen McClernon <[email protected]>
Co-authored-by: Steven Danna <[email protected]>
@kvoli
Copy link
Collaborator

kvoli commented Jun 1, 2023

Going to consolidate test discussion on #103270 - since any future failures will show there due to the test naming format changing.

@kvoli kvoli closed this as completed Jun 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
branch-master Failures and bugs on the master branch. C-test-failure Broken test (automatically or manually discovered). O-roachtest O-robot Originated from a bot. S-1 High impact: many users impacted, serious risk of high unavailability or data loss skipped-test T-kv KV Team
Projects
None yet
Development

No branches or pull requests

7 participants