ccl/sqlproxyccl: include DRAINING pods in the directory cache #79368

jaylim-crl · 2022-04-04T19:38:16Z

Previously, #67452 removed DRAINING pods from the directory cache. This commit
adds that back. The connector will now need to filter for RUNNING pods manually
before invoking the balancer. This is needed so that we could track DRAINING
pods, and wait until 60 seconds has elapsed before transferring connections
away from them. To support that, we also update the Pod's proto definition to
include a StateTimestamp field to reprevent that timestamp that the state field
was last updated.

The plan is to have a polling mechanism every X seconds to check DRAINING pods,
and use that information to start migrating connections.

Release note: None

Jira issue: CRDB-14759

cockroach-teamcity · 2022-04-04T19:38:27Z

This change is

andy-kimball

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @jaylim-crl and @jeffswenson)

pkg/ccl/sqlproxyccl/tenant/directory.proto, line 48 at r1 (raw file):

  // TenantID is the tenant that owns the pod.
  uint64 tenant_id = 2 [(gogoproto.customname) = "TenantID"];
  // addr is the ip and port combination identifying the tenant pod, (e.g.

NIT: The first letter of these comments should be capitalized, b/c this comment gets written to the generated Go file, where the field is capitalized. Our convention is to make the comment match the generated Go definition, not the protobuf definition.

pkg/ccl/sqlproxyccl/tenant/directory_cache.go, line 197 at r1 (raw file):

	// Trigger resumption if there are no RUNNING pods.
	runningPods := make([]*Pod, 0, len(tenantPods))

You're not actually using runningPods here. Instead, you can iterate through the tenantPods list and keep a foundRunning boolean that you set if you find a RUNNING pod.

Previously, cockroachdb#67452 removed DRAINING pods from the directory cache. This commit adds that back. The connector will now need to filter for RUNNING pods manually before invoking the balancer. This is needed so that we could track DRAINING pods, and wait until 60 seconds has elapsed before transferring connections away from them. To support that, we also update the Pod's proto definition to include a StateTimestamp field to reprevent that timestamp that the state field was last updated. The plan is to have a polling mechanism every X seconds to check DRAINING pods, and use that information to start migrating connections. Release note: None

jaylim-crl

TFTR!

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @andy-kimball and @jeffswenson)

pkg/ccl/sqlproxyccl/tenant/directory.proto, line 48 at r1 (raw file):

Previously, andy-kimball (Andy Kimball) wrote…

NIT: The first letter of these comments should be capitalized, b/c this comment gets written to the generated Go file, where the field is capitalized. Our convention is to make the comment match the generated Go definition, not the protobuf definition.

Done.

pkg/ccl/sqlproxyccl/tenant/directory_cache.go, line 197 at r1 (raw file):

Previously, andy-kimball (Andy Kimball) wrote…

You're not actually using runningPods here. Instead, you can iterate through the tenantPods list and keep a foundRunning boolean that you set if you find a RUNNING pod.

Whoops - done.

jeffswenson

LGTM

jaylim-crl · 2022-04-05T19:37:00Z

TFTR!

bors r=JeffSwenson

craig · 2022-04-05T19:37:04Z

Already running a review

craig · 2022-04-05T21:09:50Z

Build succeeded:

GitHub CI (Cockroach)

jaylim-crl marked this pull request as ready for review April 4, 2022 19:38

jaylim-crl requested review from a team as code owners April 4, 2022 19:38

jaylim-crl requested review from jeffswenson and andy-kimball and removed request for a team April 4, 2022 19:39

andy-kimball reviewed Apr 5, 2022

View reviewed changes

jaylim-crl force-pushed the jay/220404-cache-draining-pods branch from 5c78e85 to 7586717 Compare April 5, 2022 05:07

jaylim-crl requested a review from andy-kimball April 5, 2022 05:08

jaylim-crl commented Apr 5, 2022

View reviewed changes

jeffswenson approved these changes Apr 5, 2022

View reviewed changes

jaylim-crl added backport-22.1.x labels Apr 5, 2022

craig bot merged commit 985344a into cockroachdb:master Apr 5, 2022

blathers-crl bot mentioned this pull request Apr 5, 2022

release-22.1: ccl/sqlproxyccl: include DRAINING pods in the directory cache #79457

Merged

tbg mentioned this pull request Jun 22, 2022

roachperf: regression around 2022-04-06 #82136

Closed

jaylim-crl deleted the jay/220404-cache-draining-pods branch November 30, 2022 16:55

jaylim-crl restored the jay/220404-cache-draining-pods branch November 30, 2022 16:55

jaylim-crl deleted the jay/220404-cache-draining-pods branch November 30, 2022 16:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ccl/sqlproxyccl: include DRAINING pods in the directory cache #79368

ccl/sqlproxyccl: include DRAINING pods in the directory cache #79368

jaylim-crl commented Apr 4, 2022 •

edited by cockroach-jira-scripts

Loading

cockroach-teamcity commented Apr 4, 2022

andy-kimball left a comment

jaylim-crl left a comment

jeffswenson left a comment

jaylim-crl commented Apr 5, 2022

craig bot commented Apr 5, 2022

craig bot commented Apr 5, 2022

ccl/sqlproxyccl: include DRAINING pods in the directory cache #79368

ccl/sqlproxyccl: include DRAINING pods in the directory cache #79368

Conversation

jaylim-crl commented Apr 4, 2022 • edited by cockroach-jira-scripts Loading

cockroach-teamcity commented Apr 4, 2022

andy-kimball left a comment

Choose a reason for hiding this comment

jaylim-crl left a comment

Choose a reason for hiding this comment

jeffswenson left a comment

Choose a reason for hiding this comment

jaylim-crl commented Apr 5, 2022

craig bot commented Apr 5, 2022

craig bot commented Apr 5, 2022

jaylim-crl commented Apr 4, 2022 •

edited by cockroach-jira-scripts

Loading