Resume CHANGEFEED from last committed cursor timestamp #65573

GlennFawcett-doordash · 2021-05-21T20:23:59Z

Is your feature request related to a problem? Please describe.

A CHANGEFEED can fail for multiple reasons internal to CRDB, the Kafka endpoint, etc... After the CHANGEFEED fails, you have to figure out exactly when it failed from the logs and restart from that point after converting to the cluster_logical_timestamp epoch format.

Describe the solution you'd like

I would like for CRDB to store the last committed timestamp and have some way to simply RESUME from that point. You can easily PAUSE and RESUME jobs, but if the job fails you must resubmit. If the changefeed was created as an object in the database instead of a job, we could do something like:

RESUME CHANGEFEED <mychangefeed>

And it would simple pickup from where it left off.

Epic CRDB-2397

The text was updated successfully, but these errors were encountered:

blathers-crl · 2021-05-21T20:24:02Z

Hello, I am Blathers. I am here to help you get the issue triaged.

I have CC'd a few people who may be able to assist you:

@cockroachdb/cdc (found keywords: CHANGEFEED,Kafka)

If we have not gotten back to your issue within a few business days, you can try the following:

Join our community slack channel and ask on #cockroachdb.
Try find someone from here if you know they worked closely on the area and CC them.

_{🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is otan.}

amruss · 2021-05-24T14:08:54Z

Note: Bulk IO also considering something similar, we probably want to do something more generic like RESUME FAILED JOB _job_id_ (RESUME JOB FROM FAILURE, RESUME FAILED JOB, RESURECT JOB, etc.) and spawn a new job id - with the same parameters / settings

We would want to error out for jobs that aren't changefeed jobs right now

amruss · 2021-07-23T00:26:16Z

After talking with the team, we will likely want to do this instead

miretskiy · 2021-07-27T20:55:13Z

@amruss this can probably be closed? We have #36887 issue and few others that we're working on.

spiffyy99 · 2021-08-03T18:58:01Z

going to go ahead and close this, have a fix here: #68176

GlennFawcett-doordash added the C-enhancement Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception) label May 21, 2021

blathers-crl bot added O-community Originated from the community X-blathers-triaged blathers was able to find an owner labels May 21, 2021

shermanCRL added the A-cdc Change Data Capture label May 24, 2021

blathers-crl bot added the T-cdc label May 24, 2021

shermanCRL added A-cdc Change Data Capture and removed A-cdc Change Data Capture labels May 24, 2021

amruss assigned stevendanna May 24, 2021

amruss assigned spiffyy99 Jul 14, 2021

exalate-issue-sync bot unassigned spiffyy99 Jul 15, 2021

amruss assigned spiffyy99 and unassigned stevendanna Jul 21, 2021

spiffyy99 closed this as completed Aug 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resume CHANGEFEED from last committed cursor timestamp #65573

Resume CHANGEFEED from last committed cursor timestamp #65573

GlennFawcett-doordash commented May 21, 2021 •

edited by exalate-issue-sync bot

Loading

blathers-crl bot commented May 21, 2021

amruss commented May 24, 2021 •

edited

Loading

amruss commented Jul 23, 2021

miretskiy commented Jul 27, 2021

spiffyy99 commented Aug 3, 2021

Resume CHANGEFEED from last committed cursor timestamp #65573

Resume CHANGEFEED from last committed cursor timestamp #65573

Comments

GlennFawcett-doordash commented May 21, 2021 • edited by exalate-issue-sync bot Loading

blathers-crl bot commented May 21, 2021

amruss commented May 24, 2021 • edited Loading

amruss commented Jul 23, 2021

miretskiy commented Jul 27, 2021

spiffyy99 commented Aug 3, 2021

GlennFawcett-doordash commented May 21, 2021 •

edited by exalate-issue-sync bot

Loading

amruss commented May 24, 2021 •

edited

Loading