sql/schemachanger: implement `DROP COLUMN` #84563

ajwerner · 2022-07-18T12:24:34Z

See individual commits.

Fixes #84072

cockroach-teamcity · 2022-07-18T12:24:47Z

This change is

ajwerner · 2022-07-18T12:25:07Z

The last commit should be broken up into a few. I'll do that later after we see how CI is doing.

ajwerner · 2022-07-18T14:32:11Z

Sent out the changefeed changes in #84571 and the testing fix in #84575.

postamar

I did a review pass on just the last commit. Nice work. This is certainly coming along! Very exciting.

Reviewed 74 of 74 files at r4, all commit messages.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @ajwerner and @postamar)

pkg/sql/schemachanger/scbuild/internal/scbuildstmt/alter_table_drop_column.go line 68 at r4 (raw file):

) {
	var rowLevelTTL *scpb.RowLevelTTL
	// TODO(ajwerner): Does this need to look at status or target status?

I would look at status if only because target status may be changed by the builder. I have no idea how consistent we are about this. Looking ahead, we need a better way to express queries on current state.

pkg/sql/schemachanger/scplan/internal/opgen/opgen_primary_index.go line 68 at r4 (raw file):

					return &scop.MakeMergedIndexWriteOnly{
						TableID: this.TableID,
						IndexID: this.IndexID,

I assume this extra op-edge was necessary for the rollbacks to work? What of the writes occurring between MERGED and WRITE_ONLY, won't they be lost?

pkg/sql/schemachanger/scplan/internal/rules/dep_index_and_column.go line 830 at r4 (raw file):

			}
		},
	)

I don't understand this rule. Shouldn't this be: the column can only reach WRITE_ONLY if the indexes can all be written to, in other words when the temp indexes are all WRITE_ONLY and the other indexes are also WRITE_ONLY?

ajwerner

I've pushed again and addressed the concrete feedback. After the changefeed changes make their way through bors and CI is happy (which I think it should be), I'll take a stab at breaking this out into a few more commits. I do think it's quite close.

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @postamar)

pkg/sql/schemachanger/scbuild/internal/scbuildstmt/alter_table_drop_column.go line 68 at r4 (raw file):

Previously, postamar (Marius Posta) wrote…

I would look at status if only because target status may be changed by the builder. I have no idea how consistent we are about this. Looking ahead, we need a better way to express queries on current state.

Done.

pkg/sql/schemachanger/scplan/internal/opgen/opgen_primary_index.go line 68 at r4 (raw file):

Previously, postamar (Marius Posta) wrote…

I assume this extra op-edge was necessary for the rollbacks to work? What of the writes occurring between MERGED and WRITE_ONLY, won't they be lost?

This is necessary for very subtle reasons I remembered about only recently. We've had bugs we've been carrying around. I need to split this off into more commits to give some more motivation. This comment is revelatory:

cockroach/pkg/sql/backfill.go

Lines 1962 to 2039 in 116c5aa

    
           // backfillIndexes fills the missing columns in the indexes of the 
        
           // leased tables. 
        
           // 
        
           // 
        
           // If temporaryIndexes is non-empty, we assume that we are using the 
        
           // MVCC-compatible backfilling process. This mutation has already been 
        
           // checked to ensure all newly added indexes are using one type of 
        
           // index backfill. 
        
           // 
        
           // The MVCC-compatible index backfilling process has a goal of not 
        
           // having to issue AddSStable requests with backdated timestamps. 
        
           // 
        
           // To do this, we backfill new indexes while they are in a BACKFILLING 
        
           // state in which they do not see writes or deletes. While the 
        
           // backfill is running a temporary index captures all inflight rights. 
        
           // 
        
           // When the backfill is completed, the backfilling index is stepped up 
        
           // to MERGING and then writes and deletes missed during 
        
           // the backfill are merged from the temporary index. 
        
           // 
        
           // Finally, the new index is brought into the DELETE_AND_WRITE_ONLY 
        
           // state for validation. 
        
           // 
        
           //           ┌─────────────────┐         ┌─────────────────┐             ┌─────────────────┐ 
        
           //           │                 │         │                 │             │                 │ 
        
           //           │   PrimaryIndex  │         │     NewIndex    │             │    TempIndex    │ 
        
           // t0        │     (PUBLIC)    │         │  (BACKFILLING)  │             │  (DELETE_ONLY)  │ 
        
           //           │                 │         │                 │             │                 │ 
        
           //           └─────────────────┘         └─────────────────┘             └────────┬────────┘ 
        
           //                                                                                │ 
        
           //                                                                       ┌────────▼────────┐ 
        
           //                                                                       │                 │ 
        
           //                                                                       │    TempIndex    │ 
        
           // t1                                                                    │(DELETE_AND_WRITE)   │ 
        
           //                                                                       │                 │   │ 
        
           //                                                                       └────────┬────────┘   │ 
        
           //                                                                                │            │ 
        
           //           ┌─────────────────┐         ┌─────────────────┐             ┌────────▼────────┐   │ TempIndex receiving writes 
        
           //           │                 │         │                 │             │                 │   │ 
        
           //           │  PrimaryIndex   ├────────►│     NewIndex    │             │    TempIndex    │   │ 
        
           // t2        │   (PUBLIC)      │ Backfill│  (BACKFILLING)  │             │(DELETE_AND_WRITE│   │ 
        
           //           │                 │         │                 │             │                 │   │ 
        
           //           └─────────────────┘         └────────┬────────┘             └─────────────────┘   │ 
        
           //                                                │                                            │ 
        
           //                                       ┌────────▼────────┐                                   │ 
        
           //                                       │                 │                                   │ 
        
           //                                       │     NewIndex    │                                   │ 
        
           // t3                                    │  (DELETE_ONLY)  │                                   │ 
        
           //                                       │                 │                                   │ 
        
           //                                       └────────┬────────┘                                   │ 
        
           //                                                │                                            │ 
        
           //                                       ┌────────▼────────┐                                   │ 
        
           //                                       │                 │                                   │ 
        
           //                                       │     NewIndex    │                                   │   │ 
        
           //                                       │    (MERGING)    │                                   │   │ 
        
           // t4                                    │                 │                                   │   │ NewIndex receiving writes 
        
           //                                       └─────────────────┘                                   │   │ 
        
           //                                                                                             │   │ 
        
           //                                       ┌─────────────────┐             ┌─────────────────┐   │   │ 
        
           //                                       │                 │             │                 │   │   │ 
        
           //                                       │     NewIndex    │◄────────────┤    TempIndex    │   │   │ 
        
           // t5                                    │    (MERGING)    │  BatchMerge │(DELETE_AND_WRITE│   │   │ 
        
           //                                       │                 │             │                 │   │   │ 
        
           //                                       └────────┬────────┘             └───────┬─────────┘   │   │ 
        
           //                                                │                              │             │   │ 
        
           //                                       ┌────────▼────────┐             ┌───────▼─────────┐   │   │ 
        
           //                                       │                 │             │                 │   │   │ 
        
           //                                       │     NewIndex    │             │    TempIndex    │       │ 
        
           // t6                                    │(DELETE_AND_WRITE)             │  (DELETE_ONLY)  │       │ 
        
           //                                       │                 │             │                 │       │ 
        
           //                                       └───────┬─────────┘             └───────┬─────────┘       │ 
        
           //                                               │                               │ 
        
           //                                               │                               │ 
        
           //                                               ▼                               ▼ 
        
           //                                    [validate and make public]             [ dropped ] 
        
           // 
        
           // This operates over multiple goroutines concurrently and is thus not 
        
           // able to reuse the original kv.Txn safely.

The issue is that in Merging we do not enforce uniqueness constraints. We need to be enforcing uniqueness constraints before we go on to validating the uniqueness constraint. This enforcement happens through the use of a CPut, which we do not use in Merging here:

cockroach/pkg/sql/catalog/table_elements.go

Lines 204 to 221 in 116c5aa

    
           // ForcePut, if true, forces all writes to use Put rather than CPut or InitPut. 
        
           // 
        
           // Users of this options should take great care as it 
        
           // effectively mean unique constraints are not respected. 
        
           // 
        
           // Currently (2022-01-19) this two users: delete preserving 
        
           // indexes and merging indexes. 
        
           // 
        
           // Delete preserving encoding indexes are used only as a log of 
        
           // index writes during backfill, thus we can blindly put values into 
        
           // them. 
        
           // 
        
           // New indexes may miss updates during the backfilling process 
        
           // that would lead to CPut failures until the missed updates 
        
           // are merged into the index. Uniqueness for such indexes is 
        
           // checked by the schema changer before they are brought back 
        
           // online. 
        
           ForcePut() bool

pkg/sql/schemachanger/scplan/internal/rules/dep_index_and_column.go line 830 at r4 (raw file):

Previously, postamar (Marius Posta) wrote…

I don't understand this rule. Shouldn't this be: the column can only reach WRITE_ONLY if the indexes can all be written to, in other words when the temp indexes are all WRITE_ONLY and the other indexes are also WRITE_ONLY?

I think the temp indexes condition would be sufficient. This one is also sufficient. The deal is that we need to make sure that we are writing values of the column somewhere before we start doing the backfill. I have rewritten the rule to be about the temporary index and added commentary.

84306: asim: add load splits r=kvoli a=kvoli This patch adds load based splitting to the allocation simulator. It uses the production code path, `pkg/kv/kvserver/split`, to decide when and which key to split on. To enable split recommendations from this package, load events are recorded to the splitter and split suggestions enqueued into the simulator split queue. Split keys are likewise found via consulting the split decider first and when not found, the split queue wil instead split evenly (50/50) on the number of keys instead. resolves #82630 Release note: None 84451: dev,genbzl: add support for generating syntax diagrams r=ajwerner a=ajwerner Fixes #84443. Release note: None 84571: changefeedccl: prerequisite changes for `DROP COLUMN` r=ajwerner a=ajwerner This is the first two commits from #84563. They are needed to ensure that we don't change the behavior of `DROP COLUMN` when we support it in the declarative schema changer. The issue is that there the protocol is to create a new primary index and swap to it. The column becomes a non-public before the index swap, so the primary index swap is no longer a logical schema change of any kind. With this change, we can now detect that and properly restart as opposed to stop. Also, the newly added testing uncovers some badness in how we classify some other schema changes, and should generally be useful. Co-authored-by: Austen McClernon <[email protected]> Co-authored-by: Andrew Werner <[email protected]>

ajwerner · 2022-07-18T23:36:32Z

This is RFAL. I broke this apart into individual commits. I think that was a good exercise. Only really the fourth commit (
sql/rowenc: compute primary index encoding columns correctly) is of real interest to outside teams. I was disappointed to discover the lack of testing in that area, but haven't done anything to make it better.

mgartner

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @ajwerner, @livlobo, and @postamar)

pkg/sql/rowenc/index_encoding.go line 1115 at r8 (raw file):

// getStoredColumnsForPrimaryIndex computes the set of columns stored in this
// primary index's value for encoding which are not composite columns. The

So what do we do with composite columns then?

ajwerner

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @livlobo, @mgartner, and @postamar)

pkg/sql/rowenc/index_encoding.go line 1115 at r8 (raw file):

Previously, mgartner (Marcus Gartner) wrote…

So what do we do with composite columns then?

Have a look at where the output of this thing is used. The composite logic remains intact.

ajwerner

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @livlobo, @mgartner, and @postamar)

pkg/sql/rowenc/index_encoding.go line 1115 at r8 (raw file):

Previously, ajwerner wrote…

Have a look at where the output of this thing is used. The composite logic remains intact.

Oh, I see, the comment is bad. I'll update it.

ajwerner

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @livlobo, @mgartner, and @postamar)

pkg/sql/rowenc/index_encoding.go line 1115 at r8 (raw file):

Previously, ajwerner wrote…

Oh, I see, the comment is bad. I'll update it.

Alright, I've reworded the comment. Hopefully it's clearer now.

mgartner

Fourth and fifth commit LGTM.

Reviewed 1 of 1 files at r9, 2 of 2 files at r10.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @livlobo and @postamar)

pkg/sql/rowenc/index_encoding.go line 1115 at r8 (raw file):

Previously, ajwerner wrote…

Alright, I've reworded the comment. Hopefully it's clearer now.

Brilliant, thank you.

fqazi

For end-to-end testing are we comfortable adding multistatement transactions? i.e. Two drop columns, I know we can't do a mix of ADD/DROP yet?

Looks good, but didn't give enough time to the builder and some test stuff. I'll do that tomorrow

Reviewed 2 of 74 files at r4, 82 of 82 files at r5, 28 of 28 files at r6, 5 of 5 files at r7, 1 of 1 files at r9, 2 of 2 files at r10, 35 of 35 files at r11, all commit messages.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @ajwerner, @livlobo, and @postamar)

pkg/sql/rowenc/index_encoding.go line 1131 at r9 (raw file):

	// columns set is not populated and instead we defer to the colMap to compute
	// the complete set before subtracting the key columns.
	if index.GetVersion() < descpb.PrimaryIndexWithStoredColumnsVersion {

I guess we will infinitely support older encoding types? Until indexes get recreated.

pkg/sql/schemachanger/scbuild/internal/scbuildstmt/alter_table_drop_column.go line 164 at r11 (raw file):

		b.Drop(cc)
	}
	handleDropColumnDefaultExpression(b, col, colElts, behavior)

This pattern is much cleaner than what we have in add column, we should do the same there in the future.

pkg/sql/schemachanger/scbuild/internal/scbuildstmt/helpers.go line 307 at r11 (raw file):

}

func toPublicNotCurrentlyPublicFilter(

Would it make sense to just have a routine that returns a filter closure?

statusAndTargetFilter( scpb.Status_PUBLIC, scpb.ToPublic)
statusFilter(...)

Just a thought more cosmetic I think.

pkg/sql/schemachanger/scexec/scmutationexec/column.go line 123 at r6 (raw file):

		*(protoutil.Clone(mut.GetColumn())).(*descpb.ColumnDescriptor))

	// Ensure that the column is added in the right location. This is important

Is there a reason we couldn't push this ordering into the dep rules? I guess removing or adding stuff in the middle?

pkg/sql/schemachanger/scplan/internal/rules/dep_index_and_column.go line 503 at r6 (raw file):

	// a column which does not exist. This would lead to panics inside the
	// optimizer and an invalid table descriptor.
	registerDepRule("indexes containing columns reach absent before column",

I guess validation would also catch this? Or it might be something should check for?

ajwerner

For end-to-end testing are we comfortable adding multistatement transactions? i.e. Two drop columns, I know we can't do a mix of ADD/DROP yet?

I added a few of these types of tests. They were good. Caught a bug with improperly sorting the statements. Also figured out I had not handled dropping the OnUpdate expression. It was good to flex the new data-driven explains as well.

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @ajwerner, @fqazi, @livlobo, and @postamar)

pkg/sql/rowenc/index_encoding.go line 1131 at r9 (raw file):

Previously, fqazi (Faizan Qazi) wrote…

I guess we will infinitely support older encoding types? Until indexes get recreated.

Yep. At least we know they aren't created by the declarative schema changer.

pkg/sql/schemachanger/scexec/scmutationexec/column.go line 123 at r6 (raw file):

Previously, fqazi (Faizan Qazi) wrote…

Is there a reason we couldn't push this ordering into the dep rules? I guess removing or adding stuff in the middle?

We won't necessarily have elements in the plan for the other columns which aren't being dropped. Consider the case of rolling back the drop column of the first column in the table.

pkg/sql/schemachanger/scplan/internal/rules/dep_index_and_column.go line 503 at r6 (raw file):

Previously, fqazi (Faizan Qazi) wrote…

I guess validation would also catch this? Or it might be something should check for?

I'll add the validation in a separate PR.

Release note: None

This commit ensures that the correct primary index entry is encoded when building an entry for a primary index encoded index which does not include every column in the table. This is the case when dropping a column using the declarative schema changer. Release note: None

These hard-coded index IDs embed assumptions which will be violated by the new DROP COLUMN protocol. Release note: None

This is the initial implementation of `ALTER TABLE ... DROP COLUMN` in the declarative schema changer. It explicitly does not have the case of adding and dropping columns from the same table in the same transaction. It also omits: 1) Dropping columns related to multi-region tables 2) Dropping columns with row-level TTLs 3) Dropping columns involved in UNIQUE WITHOUT INDEX constraints 4) Dropping columns involved in CHECK constraints 5) Dropping columns involved in FOREIGN KEY constraints Fixes cockroachdb#84072. Release note: None

ajwerner

This is RFAL

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @ajwerner, @fqazi, @livlobo, @mgartner, and @postamar)

pkg/sql/schemachanger/scbuild/internal/scbuildstmt/helpers.go line 307 at r11 (raw file):

Previously, fqazi (Faizan Qazi) wrote…

Would it make sense to just have a routine that returns a filter closure?

statusAndTargetFilter( scpb.Status_PUBLIC, scpb.ToPublic)
statusFilter(...)

Just a thought more cosmetic I think.

I didn't do anything in this commit but I'm interested in improvements here in future commits.

Xiang-Gu · 2022-07-20T14:28:28Z

pkg/sql/schemachanger/scplan/internal/opgen/opgen_primary_index.go

+			// The transition from MERGED to WRITE_ONLY must precede index validation.
+			// In MERGE_ONLY and MERGED, the index receives writes, but writes do not
+			// enforce uniqueness (they don't use CPut, see ForcePut). In WRITE_ONLY,
+			// the index receives normal writes. Only once writes are enforcing the
+			// uniqueness constraint for new can we validate that the index does
+			// indeed represent a valid uniqueness constraint over all rows.
+			to(scpb.Status_WRITE_ONLY,
+				emit(func(this *scpb.PrimaryIndex) *scop.MakeMergedIndexWriteOnly {
+					return &scop.MakeMergedIndexWriteOnly{
+						TableID: this.TableID,
+						IndexID: this.IndexID,
+					}
+				}),
+			),


This is a big change! I previously don't know that an index that is in MERGE_ONLY or MERGED can still receive writes. What is the concern of checking uniqness of writes during merging previously (when we transition the primary index to WRITE_ONLY before merging with the temporary index)?

While the index is in MERGING in the table descriptor, write to it use Put instead of CPut. See ForcePut on catalog.Index.

Consider the following timeline:

create table t (i int primary key, j int);
insert into t values (1, 1);
create unique index idx on t(j);

t1: index merging concludes
at this point the unique index contains:
1: 1

t2: validation begins by reading a snapshot at t2 to count that the number of rows in the unique index is equal to the number of rows in the primary index
t3: insert into t values (2, 1) -- this will lead to the unique index now only containing 1: 2 instead of 1: 1 and the force from MERGING means the write will succeed.
t4: Validation completes successfully and the index is moved to public

At this point we have a corrupt unique secondary index.

If, however, we moved from MERGING to WRITE_ONLY before proceeding with validation, the insert operation would fail and the index will remain consistent.

Xiang-Gu · 2022-07-20T14:28:32Z

pkg/sql/schemachanger/scplan/internal/opgen/opgen_secondary_index.go

@@ -46,14 +46,13 @@ func init() {
 				}),
 			),
 			to(scpb.Status_MERGE_ONLY,


This is a learning question: everytime I think of merging (with a temporary index), it's always been on a (new) primary index which is intended to be swapped with the old primary index, and that happens quite often (ADD COLUMN, DROP COLUMN, ALTER PRIMARY KEY, etc. As long as the primary index needs to be modified, we use this primary index swap technique). But here, I'm surprised to learn that secondary indexes also go through the same status, so I start to wonder in what case will we ever do this backfill-then-merge on a secondary index?

We always need to merge concurrent writes into an index when backfilling as of 22.1. The reasons for this are somewhat surprising and relate to history rewriting. See this rfc: https://github.com/cockroachdb/cockroach/blob/master/docs/RFCS/20211004_incremental_index_backfiller.md

Xiang-Gu · 2022-07-20T14:28:36Z

pkg/sql/schemachanger/scplan/internal/rules/dep_index_and_column.go

+				rel.Filter("columnTypeIsNotBeingDropped", ct)(func(
+					ct *scpb.ColumnType,
+				) bool {
+					return !ct.IsRelationBeingDropped
+				}),


I don't quite understand the purpose of ct here, which is a ColumnType element?

Why do we need this clause that uses the isRelationBeingDropped field, which I thought was deemed "hacky"?

I think this is needed to gracefully handle DROP TABLE. IIRC we wanted to replace this filter (and thus remove this field) by using better rule constructs, something along the lines of "this rule only applies on this column if the parent table isn't present in the rel-database with an ABSENT target".

Xiang-Gu · 2022-07-20T14:28:39Z

pkg/sql/schemachanger/scplan/internal/rules/dep_index_and_column.go

+
+	// We need to ensure that the temporary index has all the relevant writes
+	// to any columns it contains. We ensure elsewhere that any index which
+	// will later be merged with the temporary index is not backfilled until
+	// that temporary index is receiving writes. This rule ensures that those
+	// write operations contain data for all columns.
+	registerDepRule(
+		"column is WRITE_ONLY before temporary index is WRITE_ONLY",
+		scgraph.Precedence,
+		"column", "index",
+		func(from, to nodeVars) rel.Clauses {
+			return rel.Clauses{
+				from.el.Type((*scpb.Column)(nil)),
+				to.el.Type((*scpb.TemporaryIndex)(nil)),
+				indexContainsColumn(to.el, from.el, "index-column", "table-id", "column-id", "index-id"),
+				targetStatus(from.target, scpb.ToPublic),
+				targetStatus(to.target, scpb.Transient),
+				currentStatus(from.node, scpb.Status_WRITE_ONLY),
+				currentStatus(to.node, scpb.Status_WRITE_ONLY),
+			}
+		},
+	)
+


This rule seems to be relevant when adding a column (so we insist this newly added column reaches WRITE_ONLY before the temporary reaches WRITE_ONLY). I'm curious how you discover the need to add such a dep rule when working on DROP COLUMN

You're right that this is unrelated to drop column. I don't remember where this came up, but there was a case I ran into where this mattered. I think it was that I removed the WRITE_ONLY state from "column depends on primary index" because it broke rollbacks of drop column, at which point in the add column case now it was valid to defer moving the column to WRITE_ONLY until later which caused some other problem.

fqazi

Looks good just a few minor questions from me. I think most of those are non-issues.

Reviewed 71 of 71 files at r12, 28 of 28 files at r13, 5 of 5 files at r14, 1 of 1 files at r15, 2 of 2 files at r16, 49 of 49 files at r17, all commit messages.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @ajwerner, @fqazi, @livlobo, @postamar, and @Xiang-Gu)

pkg/sql/schemachanger/BUILD.bazel line 12 at r17 (raw file):

go_test(
    name = "schemachanger_test",
    size = "large",

I guess the drop stuff ballooned the length of this test, hopefully we can get this down with parallelism.

pkg/sql/schemachanger/scbuild/internal/scbuildstmt/alter_table_drop_column.go line 282 at r17 (raw file):

		}
		if ic.Kind != scpb.IndexColumn_STORED {
			panic(errors.AssertionFailedf("can only drop columns which are stored in the primary index, this one is %v ",

Are we surfacing the right error here? Or should this be "column %q is referenced by the primary key". Just sanity checking

pkg/sql/schemachanger/scbuild/internal/scbuildstmt/alter_table_drop_column.go line 353 at r17 (raw file):

	for i := n + 1; i < len(storedColumns); i++ {
		storedColumns[i].OrdinalInKind--
		b.Add(storedColumns[i])

I think this is fine because it will just replace the existing element based on the remaining keys. I'm guessing the ops will do the heavy lifting here since the target state will stay the same

pkg/sql/schemachanger/scbuild/internal/scbuildstmt/alter_table_drop_column.go line 374 at r17 (raw file):

		return
	}
	// TODO(ajwerner): Support dropping UNIQUE WITHOUT INDEX constraints.

We might just want a checklist-type Github issue for these. As we are doing this type of stuff over time some of these cases will take longer to address.

pkg/sql/schemachanger/scbuild/internal/scbuildstmt/alter_table_drop_column.go line 458 at r17 (raw file):

func handleDropColumnExpressions(b BuildCtx, colElts ElementResultSet, behavior tree.DropBehavior) {
	publicTargets := colElts.Filter(publicTargetFilter)

Does it make sense for the caller to give a filtered set of elements?

pkg/sql/schemachanger/scexec/scmutationexec/column.go line 123 at r6 (raw file):

Previously, ajwerner wrote…

We won't necessarily have elements in the plan for the other columns which aren't being dropped. Consider the case of rolling back the drop column of the first column in the table.

Ooh right

ajwerner

@fqazi other than the passing of the public elements are there any changes requested? Can I defer that?

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @ajwerner, @fqazi, @livlobo, @postamar, and @Xiang-Gu)

pkg/sql/schemachanger/BUILD.bazel line 12 at r17 (raw file):

Previously, fqazi (Faizan Qazi) wrote…

I guess the drop stuff ballooned the length of this test, hopefully we can get this down with parallelism.

Indeed. I'm waiting on Ricky's logictest change before doing something.

pkg/sql/schemachanger/scbuild/internal/scbuildstmt/alter_table_drop_column.go line 282 at r17 (raw file):

Previously, fqazi (Faizan Qazi) wrote…

Are we surfacing the right error here? Or should this be "column %q is referenced by the primary key". Just sanity checking

This is an assertion failure because we already checked whether it was a primary key column earlier in the dropColumn function in checkColumnNotInPrimaryKey. I don't expect to see this error.

pkg/sql/schemachanger/scbuild/internal/scbuildstmt/alter_table_drop_column.go line 353 at r17 (raw file):

Previously, fqazi (Faizan Qazi) wrote…

I think this is fine because it will just replace the existing element based on the remaining keys. I'm guessing the ops will do the heavy lifting here since the target state will stay the same

Yes, we'll overwrite the existing element being added.

pkg/sql/schemachanger/scbuild/internal/scbuildstmt/alter_table_drop_column.go line 374 at r17 (raw file):

Previously, fqazi (Faizan Qazi) wrote…

We might just want a checklist-type Github issue for these. As we are doing this type of stuff over time some of these cases will take longer to address.

Yes, I'll create one.

pkg/sql/schemachanger/scbuild/internal/scbuildstmt/alter_table_drop_column.go line 458 at r17 (raw file):

Previously, fqazi (Faizan Qazi) wrote…

Does it make sense for the caller to give a filtered set of elements?

I suppose, but I'm not eager to wait for another CI cycle if you're amenable.

fqazi

Yes you can defer that

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @ajwerner, @fqazi, @livlobo, @postamar, and @Xiang-Gu)

ajwerner · 2022-07-20T17:20:50Z

bors r+

craig · 2022-07-20T18:47:54Z

Build succeeded:

GitHub CI (Cockroach)

ajwerner force-pushed the ajwerner/declarative-drop-column branch from 2ccef8e to 9f59c14 Compare July 18, 2022 14:22

ajwerner mentioned this pull request Jul 18, 2022

changefeedccl: prerequisite changes for DROP COLUMN #84571

Merged

postamar reviewed Jul 18, 2022

View reviewed changes

ajwerner force-pushed the ajwerner/declarative-drop-column branch 4 times, most recently from 98c6c12 to d63e101 Compare July 18, 2022 18:28

ajwerner commented Jul 18, 2022

View reviewed changes

ajwerner force-pushed the ajwerner/declarative-drop-column branch 2 times, most recently from 1a32dee to 0897dc5 Compare July 18, 2022 23:33

ajwerner marked this pull request as ready for review July 18, 2022 23:33

ajwerner requested a review from a team July 18, 2022 23:33

ajwerner requested review from a team as code owners July 18, 2022 23:33

ajwerner requested review from livlobo and removed request for a team July 18, 2022 23:34

mgartner reviewed Jul 19, 2022

View reviewed changes

ajwerner commented Jul 19, 2022

View reviewed changes

ajwerner force-pushed the ajwerner/declarative-drop-column branch from 0897dc5 to 5785ab8 Compare July 19, 2022 00:18

ajwerner commented Jul 19, 2022

View reviewed changes

mgartner reviewed Jul 19, 2022

View reviewed changes

fqazi requested changes Jul 19, 2022

View reviewed changes

ajwerner force-pushed the ajwerner/declarative-drop-column branch from 5785ab8 to b51f38a Compare July 19, 2022 13:55

ajwerner commented Jul 19, 2022

View reviewed changes

ajwerner force-pushed the ajwerner/declarative-drop-column branch from b51f38a to f755e6a Compare July 19, 2022 16:26

ajwerner added 6 commits July 19, 2022 14:56

sql/row: pretty print CPut expectation for kv trace

2a48ce9

Release note: None

sql/schemachanger: properly transition indexes to WRITE_ONLY, add rules

a3da7b0

Release note: None

sql,sqlerrors: extract some errors for reuse

b20acd8

Release note: None

sql/logictest,execbuilder: update logic tests to avoid index IDs

89422e8

These hard-coded index IDs embed assumptions which will be violated by the new DROP COLUMN protocol. Release note: None

ajwerner force-pushed the ajwerner/declarative-drop-column branch from f755e6a to 07ce7b1 Compare July 19, 2022 18:57

ajwerner commented Jul 19, 2022

View reviewed changes

Xiang-Gu reviewed Jul 20, 2022

View reviewed changes

fqazi requested changes Jul 20, 2022

View reviewed changes

ajwerner commented Jul 20, 2022

View reviewed changes

fqazi approved these changes Jul 20, 2022

View reviewed changes

craig bot merged commit f650c14 into cockroachdb:master Jul 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql/schemachanger: implement `DROP COLUMN` #84563

sql/schemachanger: implement `DROP COLUMN` #84563

ajwerner commented Jul 18, 2022 •

edited

Loading

cockroach-teamcity commented Jul 18, 2022

ajwerner commented Jul 18, 2022

ajwerner commented Jul 18, 2022

postamar left a comment •

edited

Loading

ajwerner left a comment

ajwerner commented Jul 18, 2022

mgartner left a comment

ajwerner left a comment

ajwerner left a comment

ajwerner left a comment

mgartner left a comment

fqazi left a comment

ajwerner left a comment

ajwerner left a comment

Xiang-Gu Jul 20, 2022

ajwerner Jul 20, 2022

Xiang-Gu Jul 20, 2022

ajwerner Jul 20, 2022

Xiang-Gu Jul 20, 2022

postamar Jul 20, 2022

Xiang-Gu Jul 20, 2022

ajwerner Jul 20, 2022

fqazi left a comment

ajwerner left a comment

fqazi left a comment

ajwerner commented Jul 20, 2022

craig bot commented Jul 20, 2022

	// backfillIndexes fills the missing columns in the indexes of the
	// leased tables.
	//
	//
	// If temporaryIndexes is non-empty, we assume that we are using the
	// MVCC-compatible backfilling process. This mutation has already been
	// checked to ensure all newly added indexes are using one type of
	// index backfill.
	//
	// The MVCC-compatible index backfilling process has a goal of not
	// having to issue AddSStable requests with backdated timestamps.
	//
	// To do this, we backfill new indexes while they are in a BACKFILLING
	// state in which they do not see writes or deletes. While the
	// backfill is running a temporary index captures all inflight rights.
	//
	// When the backfill is completed, the backfilling index is stepped up
	// to MERGING and then writes and deletes missed during
	// the backfill are merged from the temporary index.
	//
	// Finally, the new index is brought into the DELETE_AND_WRITE_ONLY
	// state for validation.
	//
	// ┌─────────────────┐ ┌─────────────────┐ ┌─────────────────┐
	// │ │ │ │ │ │
	// │ PrimaryIndex │ │ NewIndex │ │ TempIndex │
	// t0 │ (PUBLIC) │ │ (BACKFILLING) │ │ (DELETE_ONLY) │
	// │ │ │ │ │ │
	// └─────────────────┘ └─────────────────┘ └────────┬────────┘
	// │
	// ┌────────▼────────┐
	// │ │
	// │ TempIndex │
	// t1 │(DELETE_AND_WRITE) │
	// │ │ │
	// └────────┬────────┘ │
	// │ │
	// ┌─────────────────┐ ┌─────────────────┐ ┌────────▼────────┐ │ TempIndex receiving writes
	// │ │ │ │ │ │ │
	// │ PrimaryIndex ├────────►│ NewIndex │ │ TempIndex │ │
	// t2 │ (PUBLIC) │ Backfill│ (BACKFILLING) │ │(DELETE_AND_WRITE│ │
	// │ │ │ │ │ │ │
	// └─────────────────┘ └────────┬────────┘ └─────────────────┘ │
	// │ │
	// ┌────────▼────────┐ │
	// │ │ │
	// │ NewIndex │ │
	// t3 │ (DELETE_ONLY) │ │
	// │ │ │
	// └────────┬────────┘ │
	// │ │
	// ┌────────▼────────┐ │
	// │ │ │
	// │ NewIndex │ │ │
	// │ (MERGING) │ │ │
	// t4 │ │ │ │ NewIndex receiving writes
	// └─────────────────┘ │ │
	// │ │
	// ┌─────────────────┐ ┌─────────────────┐ │ │
	// │ │ │ │ │ │
	// │ NewIndex │◄────────────┤ TempIndex │ │ │
	// t5 │ (MERGING) │ BatchMerge │(DELETE_AND_WRITE│ │ │
	// │ │ │ │ │ │
	// └────────┬────────┘ └───────┬─────────┘ │ │
	// │ │ │ │
	// ┌────────▼────────┐ ┌───────▼─────────┐ │ │
	// │ │ │ │ │ │
	// │ NewIndex │ │ TempIndex │ │
	// t6 │(DELETE_AND_WRITE) │ (DELETE_ONLY) │ │
	// │ │ │ │ │
	// └───────┬─────────┘ └───────┬─────────┘ │
	// │ │
	// │ │
	// ▼ ▼
	// [validate and make public] [ dropped ]
	//
	// This operates over multiple goroutines concurrently and is thus not
	// able to reuse the original kv.Txn safely.

	// ForcePut, if true, forces all writes to use Put rather than CPut or InitPut.
	//
	// Users of this options should take great care as it
	// effectively mean unique constraints are not respected.
	//
	// Currently (2022-01-19) this two users: delete preserving
	// indexes and merging indexes.
	//
	// Delete preserving encoding indexes are used only as a log of
	// index writes during backfill, thus we can blindly put values into
	// them.
	//
	// New indexes may miss updates during the backfilling process
	// that would lead to CPut failures until the missed updates
	// are merged into the index. Uniqueness for such indexes is
	// checked by the schema changer before they are brought back
	// online.
	ForcePut() bool

sql/schemachanger: implement DROP COLUMN #84563

sql/schemachanger: implement DROP COLUMN #84563

Conversation

ajwerner commented Jul 18, 2022 • edited Loading

cockroach-teamcity commented Jul 18, 2022

ajwerner commented Jul 18, 2022

ajwerner commented Jul 18, 2022

postamar left a comment • edited Loading

Choose a reason for hiding this comment

ajwerner left a comment

Choose a reason for hiding this comment

ajwerner commented Jul 18, 2022

mgartner left a comment

Choose a reason for hiding this comment

ajwerner left a comment

Choose a reason for hiding this comment

ajwerner left a comment

Choose a reason for hiding this comment

ajwerner left a comment

Choose a reason for hiding this comment

mgartner left a comment

Choose a reason for hiding this comment

fqazi left a comment

Choose a reason for hiding this comment

ajwerner left a comment

Choose a reason for hiding this comment

ajwerner left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fqazi left a comment

Choose a reason for hiding this comment

ajwerner left a comment

Choose a reason for hiding this comment

fqazi left a comment

Choose a reason for hiding this comment

ajwerner commented Jul 20, 2022

craig bot commented Jul 20, 2022

sql/schemachanger: implement `DROP COLUMN` #84563

sql/schemachanger: implement `DROP COLUMN` #84563

ajwerner commented Jul 18, 2022 •

edited

Loading

postamar left a comment •

edited

Loading