c2c: pick a uniform start timestamp across partitions when creating a tenant replication stream #92742

adityamaru · 2022-11-30T15:02:03Z

Previously, each partition would reach out to the
source cluster and pick its own timestamp from which it would start ingesting MVCC versions. This timestamp was used by the rangefeed setup by the partition, to run its initial scan. Eventually, all the partitions would replicate up until a certain timestamp and cause the frontier to be bumped but it was possible for different partitions to begin ingesting at different timestamps.

This change makes it such that during replication planning when we create the producer job on the source cluster, we return a timestamp along with the StreamID. This becomes the timestamp at which each ingestion partition sets up the initial scan of the rangefeed, and consequently becomes the initial timestamp at which all data is ingested. We stash this timestamp in the replication job details and never update its value.

The motivation for this change was to know the lower bound on both the source and destination cluster for MVCC versions that have been streamed. This is necessary to bound the fingerprinting on both clusters to ensure a match.

Epic: CRDB-18749

Jira issue: CRDB-21946

blathers-crl · 2022-11-30T15:02:08Z

cc @cockroachdb/disaster-recovery

blathers-crl · 2022-11-30T15:21:08Z

cc @cockroachdb/disaster-recovery

Previously, each partition would reach out to the source cluster and pick its own timestamp from which it would start ingesting MVCC versions. This timestamp was used by the rangefeed setup by the partition, to run its initial scan. Eventually, all the partitions would replicate up until a certain timestamp and cause the frontier to be bumped but it was possible for different partitions to begin ingesting at different timestamps. This change makes it such that during replication planning when we create the producer job on the source cluster, we return a timestamp alongwith the StreamID. This becomes the timestamp at which each ingestion partition sets up the inital scan of the rangefeed, and consequently become the inital timestamp at which all data is ingested. We stash this timestamp in the replication job details and never update its value. On future resumptions of the replication job, if there is a progress high water, we will not run an initial rangefeed scan but instead start the rangefeed from the previous progress highwater. The motivation for this change was to know the lower bound on both the source and destination cluster for MVCC versions that have been streamed. This is necessary to bound the fingerprinting on both clusters to ensure a match. Release note: None Fixes: cockroachdb#92742

92788: streamingccl: tighten replication timestamp semantics r=lidorcarmel a=adityamaru Previously, each partition would reach out to the source cluster and pick its own timestamp from which it would start ingesting MVCC versions. This timestamp was used by the rangefeed setup by the partition, to run its initial scan. Eventually, all the partitions would replicate up until a certain timestamp and cause the frontier to be bumped but it was possible for different partitions to begin ingesting at different timestamps. This change makes it such that during replication planning when we create the producer job on the source cluster, we return a timestamp alongwith the StreamID. This becomes the timestamp at which each ingestion partition sets up the inital scan of the rangefeed, and consequently become the inital timestamp at which all data is ingested. We stash this timestamp in the replication job details and never update its value. On future resumptions of the replication job, if there is a progress high water, we will not run an initial rangefeed scan but instead start the rangefeed from the previous progress highwater. The motivation for this change was to know the lower bound on both the source and destination cluster for MVCC versions that have been streamed. This is necessary to bound the fingerprinting on both clusters to ensure a match. Release note: None Fixes: #92742 Co-authored-by: adityamaru <[email protected]>

adityamaru added C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. A-disaster-recovery labels Nov 30, 2022

blathers-crl bot added the T-disaster-recovery label Nov 30, 2022

adityamaru self-assigned this Nov 30, 2022

exalate-issue-sync bot unassigned adityamaru Nov 30, 2022

exalate-issue-sync bot removed the T-disaster-recovery label Nov 30, 2022

exalate-issue-sync bot assigned adityamaru Nov 30, 2022

exalate-issue-sync bot added the T-disaster-recovery label Nov 30, 2022

adityamaru mentioned this issue Dec 1, 2022

streamingccl: tighten replication timestamp semantics #92788

Merged

craig bot closed this as completed in 523b79d Dec 13, 2022

github-project-automation bot added this to Disaster Recovery Backlog Aug 28, 2024

github-project-automation bot moved this to Done in Disaster Recovery Backlog Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

c2c: pick a uniform start timestamp across partitions when creating a tenant replication stream #92742

c2c: pick a uniform start timestamp across partitions when creating a tenant replication stream #92742

adityamaru commented Nov 30, 2022 •

edited by cockroach-jira-scripts

Loading

blathers-crl bot commented Nov 30, 2022

blathers-crl bot commented Nov 30, 2022

c2c: pick a uniform start timestamp across partitions when creating a tenant replication stream #92742

c2c: pick a uniform start timestamp across partitions when creating a tenant replication stream #92742

Comments

adityamaru commented Nov 30, 2022 • edited by cockroach-jira-scripts Loading

blathers-crl bot commented Nov 30, 2022

blathers-crl bot commented Nov 30, 2022

adityamaru commented Nov 30, 2022 •

edited by cockroach-jira-scripts

Loading