sql: fail schema changes on GC threshold errors #24293

maddyblue · 2018-03-28T18:07:16Z

Index backfills use a single readAsOf time for their entire job. If
the backfill took longer than the GC (which defaults to 25h but could
easily be set to a few minutes) then the schema change would retry
forever. This happened because it didn't detect that error as a fatal
schema change error, even though it would happen every time after the
first occurrence. This would render the table schema unchangeable,
and probably undroppable as well, and the cluster would forever be
retrying the schema change.

Add a specific type for this error so it can be detected.

Release note (bug fix): prevent index backfills from failing in a
loop after exceeding the GC TTL of their source table.

cockroach-teamcity · 2018-03-28T18:07:22Z

This change is

jordanlewis · 2018-03-29T03:03:44Z

This doesn't seem right to me. Many things in the system are classified as internalError, right? I can't figure out by looking at the code what is castable to an internalError and what is not... we'd have to see a list before understanding whether this change is correct or not.

maddyblue · 2018-03-29T05:23:05Z

Thanks for prodding me to do better. I've added a new error type for this and it seems to be ok now.

Index backfills use a single readAsOf time for their entire job. If the backfill took longer than the GC (which defaults to 25h but could easily be set to a few minutes) then the schema change would retry forever. This happened because it didn't detect that error as a fatal schema change error, even though it would happen every time after the first occurrence. This would render the table schema unchangeable, and probably undroppable as well, and the cluster would forever be retrying the schema change. Add a specific type for this error so it can be detected. Release note (bug fix): prevent index backfills from failing in a loop after exceeding the GC TTL of their source table.

maddyblue · 2018-04-02T17:38:15Z

ping @jordanlewis

jordanlewis · 2018-04-02T17:41:07Z

This with regards to the schema change logic, but I can't say with certainty whether or not the condition for failure is accurate (the code in replica.go). Can someone else vet that part?

Reviewed 7 of 7 files at r1.
Review status: all files reviewed at latest revision, all discussions resolved, all commit checks successful.

Comments from Reviewable

maddyblue · 2018-04-02T17:55:47Z

@nvanbenschoten can you review the replica.go change?

nvanbenschoten · 2018-04-02T19:26:56Z

I'm happy to see that error become structured.

Review status: all files reviewed at latest revision, all discussions resolved, all commit checks successful.

Comments from Reviewable

maddyblue requested review from jordanlewis, vivekmenezes and a team March 28, 2018 18:07

maddyblue changed the title ~~sql: fail schema changes on internal errors~~ sql: fail schema changes on GC threshold errors Mar 29, 2018

maddyblue requested a review from nvanbenschoten April 2, 2018 17:55

maddyblue merged commit 9ba6a6b into cockroachdb:master Apr 2, 2018

maddyblue deleted the schema-gc branch April 2, 2018 19:29

maddyblue mentioned this pull request Apr 2, 2018

backport-2.0: sql: fail schema changes on GC threshold errors #24427

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: fail schema changes on GC threshold errors #24293

sql: fail schema changes on GC threshold errors #24293

maddyblue commented Mar 28, 2018 •

edited

Loading

cockroach-teamcity commented Mar 28, 2018

jordanlewis commented Mar 29, 2018

maddyblue commented Mar 29, 2018

maddyblue commented Apr 2, 2018

jordanlewis commented Apr 2, 2018

maddyblue commented Apr 2, 2018

nvanbenschoten commented Apr 2, 2018

sql: fail schema changes on GC threshold errors #24293

sql: fail schema changes on GC threshold errors #24293

Conversation

maddyblue commented Mar 28, 2018 • edited Loading

cockroach-teamcity commented Mar 28, 2018

jordanlewis commented Mar 29, 2018

maddyblue commented Mar 29, 2018

maddyblue commented Apr 2, 2018

jordanlewis commented Apr 2, 2018

maddyblue commented Apr 2, 2018

nvanbenschoten commented Apr 2, 2018

maddyblue commented Mar 28, 2018 •

edited

Loading