streamingccl: split destination based on source #111178

stevendanna · 2023-09-24T16:55:44Z

This adds initial splits and scatters to the stream ingestion job. We split based on the topology delivered by the source to align our initial splits with the splits that are likely coming in from the source cluster.

These splits substantially improve the throughput of a stream's initial scan.

Careful reviewers may note that we are not calling EnsureSafeSplitKey here. That is because the Spans in the source topology should already be safe split keys. Since EnsureSafeSplitKey isn't idempotent, if we were ta call EnsureSafeSplitKey, we would end up creating splits at rather poor split locations since any key that ends in an integer would have components erroneously trimmed from it.

An alternative here would be to do what the buffering adder does and create an initial set of splits by distributing them over the keyspace of the first buffer. That has the advantage of allowing us to call EnsureSafeSplitKey at the cost of less effective splits.

Note that we face this same problem during bulk operations. Perhaps in the future the source cluster should send some metadata about manual splits that are issued or something that lets the destination know about the fact that we expect a large amount of data in a particular span.

Epic: none

Release note: None

cockroach-teamcity · 2023-09-24T16:55:53Z

This change is

miretskiy · 2023-09-25T12:16:22Z

pkg/ccl/streamingccl/streamingest/stream_ingestion_dist.go

+	if err := ingestionJob.NoTxn().Update(ctx, func(txn isql.Txn, md jobs.JobMetadata, ju *jobs.JobUpdater) error {
+		md.Progress.GetStreamIngest().ReplicationStatus = jobspb.Replicating
+		md.Progress.GetStreamIngest().InitialSplitComplete = true
+		md.Progress.RunningStatus = "physical replication running"


could splits take a while? Does it make sense to update progress to indicate that "replication is running", then update it again once split concluded?

Seems reasonable

miretskiy · 2023-09-25T12:20:25Z

pkg/ccl/streamingccl/streamingest/stream_ingestion_dist.go

+			// get erroneously treated as the column family length.
+			//
+			// Since the partitions are generated from a call to
+			// PartitionSpans on the source cluster, they should be


I wonder if it makes sense for the source cluster to create an iterator and read then key (SeekGE) that's at or after the split point? This could guarantee that the explicit split keys are valid.

Just to make sure I'm following.

This would look like taking the output of PartitionSpans. Then iterate over the results. For each start key of the spans returned by PartitionSpans, we SeekGE to that key and return the first key rather than the split point generated by PartitionSpan.

The pro here is that the key returned is definitely going to be a SQL key that we can run back through EnsureSafeSplitKey.

The cons here would be the bit of added complexity on the source planning side and also that, at least for some time after the initial scan, on the destination you'd have splits that were just a little off from the source. So if a lot of data gets put "in between" the start according to partition spans and the actual start key, that data would now go to the lhs range. During the initial scan this shouldn't matter because we do our scan at a fixed timestamp that should be rather close to when we produced this original plan.

Yup; that about sums it up. I was just trying to come up w/ a way to avoid this "scary" warning you wrote re not calling ensure safe key.

adityamaru · 2023-09-27T16:03:44Z

pkg/ccl/streamingccl/streamingest/stream_ingestion_dist.go

+func splitAndScatter(
+	ctx context.Context, splitAndScatterKey roachpb.Key, s splitAndScatterer,
+) error {
+	log.Infof(ctx, "splitting and scattering at %s", splitAndScatterKey)


do we want to vmodule this?

adityamaru · 2023-09-27T16:04:19Z

pkg/ccl/streamingccl/streamingest/stream_ingestion_dist.go

+	ctx context.Context, splitAndScatterKey roachpb.Key, s splitAndScatterer,
+) error {
+	log.Infof(ctx, "splitting and scattering at %s", splitAndScatterKey)
+	expirationTime := s.now().AddDuration(splitAndScatterSitckBitDuration)


nit: misspelling in StickyBit

pkg/jobs/jobspb/jobs.proto

This adds initial splits and scatters to the stream ingestion job. We split based on the topology delivered by the source to align our initial splits with the splits that are likely coming in from the source cluster. These splits substantially improve the throughput of a stream's initial scan. Careful reviewers may note that we are not calling EnsureSafeSplitKey here. That is because the Spans in the source topology should already be safe split keys. Since EnsureSafeSplitKey isn't idempotent, if we were ta call EnsureSafeSplitKey, we would end up creating splits at rather poor split locations since any key that ends in an integer would have components erroneously trimmed from it. An alternative here would be to do what the buffering adder does and create an initial set of splits by distributing them over the keyspace of the first buffer. That has the advantage of allowing us to call EnsureSafeSplitKey at the cost of less effective splits. Note that we face this same problem during bulk operations. Perhaps in the future the source cluster should send some metadata about manual splits that are issued or _something_ that lets the destination know about the fact that we expect a large amount of data in a particular span. Epic: none Release note: None

stevendanna · 2023-09-30T12:51:41Z

bors r=adityamaru

craig · 2023-09-30T13:39:54Z

Build succeeded:

Bazel Essential CI (Cockroach)

stevendanna requested review from a team as code owners September 24, 2023 16:55

stevendanna requested review from lidorcarmel and removed request for a team September 24, 2023 16:55

adityamaru mentioned this pull request Sep 25, 2023

roachtest: c2c/initialscan/kv0 failed #110885

Closed

miretskiy reviewed Sep 25, 2023

View reviewed changes

adityamaru approved these changes Sep 27, 2023

View reviewed changes

stevendanna force-pushed the split-and-scatter branch from d59a463 to 27e87b6 Compare September 29, 2023 17:26

craig bot merged commit 757c17f into cockroachdb:master Sep 30, 2023

msbutler mentioned this pull request Oct 3, 2023

ccl/streamingccl/streamingest: TestStreamingRegionalConstraint failed #111541

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

streamingccl: split destination based on source #111178

streamingccl: split destination based on source #111178

stevendanna commented Sep 24, 2023

cockroach-teamcity commented Sep 24, 2023

miretskiy Sep 25, 2023

stevendanna Sep 25, 2023

miretskiy Sep 25, 2023

stevendanna Sep 25, 2023

miretskiy Sep 25, 2023

adityamaru Sep 27, 2023

adityamaru Sep 27, 2023

stevendanna commented Sep 30, 2023

craig bot commented Sep 30, 2023

streamingccl: split destination based on source #111178

streamingccl: split destination based on source #111178

Conversation

stevendanna commented Sep 24, 2023

cockroach-teamcity commented Sep 24, 2023

miretskiy Sep 25, 2023

Choose a reason for hiding this comment

stevendanna Sep 25, 2023

Choose a reason for hiding this comment

miretskiy Sep 25, 2023

Choose a reason for hiding this comment

stevendanna Sep 25, 2023

Choose a reason for hiding this comment

miretskiy Sep 25, 2023

Choose a reason for hiding this comment

adityamaru Sep 27, 2023

Choose a reason for hiding this comment

adityamaru Sep 27, 2023

Choose a reason for hiding this comment

stevendanna commented Sep 30, 2023

craig bot commented Sep 30, 2023