upgrade: improve delete dropped udf upgrade job efficiency #104512

chengxiong-ruan · 2023-06-07T15:00:19Z

Informs: https://github.com/cockroachlabs/support/issues/2364

Release note (performance improvement): this commit makes the delete descriptors of dropped functions upgrade job more efficient. It used to look at every single id until the max descriptor id which was very inefficient when the max descriptor id really large, in which case the upgrade job took very long even there was no function descriptor at all. This commit changed it to actually query upper bound id of each batch.

blathers-crl · 2023-06-07T15:00:23Z

It looks like your PR touches production code but doesn't add or edit any test code. Did you consider adding tests to your PR?

_{🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf.}

cockroach-teamcity · 2023-06-07T15:00:31Z

This change is

andyyang890

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @chengxiong-ruan)

pkg/upgrade/upgrades/delete_descriptors_of_dropped_functions.go line 45 at r1 (raw file):

    FROM 
      system.descriptor
    WHERE id > $1

Just wondering, would it be a lot more inefficient to instead have the to_json CTE not have a WHERE clause and have to_delete maybe have something like LIMIT 50 on it? (Or maybe have the LIMIT 50 on the main SELECT itself.) And then we'd loop the query until no more rows are affected?

I still worry that looping over the descriptor IDs might be too slow and if these kind of descriptors are rare to begin with, somewhat inefficient still?

chengxiong-ruan · 2023-06-07T16:45:18Z

pkg/upgrade/upgrades/delete_descriptors_of_dropped_functions.go line 45 at r1 (raw file):

Just wondering, would it be a lot more inefficient to instead have the to_json CTE not have a WHERE clause and have to_delete maybe have something like LIMIT 50 on it? (Or maybe have the LIMIT 50 on the main SELECT itself.) And then we'd loop the query until no more rows are affected?

I think the optimizer would push the where clause down to to_json if we put it in the main SELECT (which do the delete).

And then we'd loop the query until no more rows are affected?

I think with this approach, we still need to track the progress somehow. Otherwise it would start from the very first descriptor if there're actually some function descriptors. Maybe we could have the delete query to return the IDs as well then still filter by IDs. But I think by explicitly looping through IDs, each small batch is light and won't scan too much data.

I still worry that looping over the descriptor IDs might be too slow and if these kind of descriptors are rare to begin with, somewhat inefficient still?

yeah, the thing is that we have to look at every single descriptor to tell if it's a function descriptor and then to tell if it's dropped. But I won't worry about the IDs looping as long as it only look at actual existing IDs. From what I saw by testing this new. upgrade with the 4 thousand descriptor debug zip, it's much much faster now.

andyyang890

just one small nit

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @chengxiong-ruan)

pkg/upgrade/upgrades/delete_descriptors_of_dropped_functions.go line 45 at r1 (raw file):

I think with this approach, we still need to track the progress somehow. Otherwise it would start from the very first descriptor if there're actually some function descriptors.

Ah I see, I guess because it's not as straightforward as DELETE FROM ..., you can't just use a LIMIT directly.

pkg/upgrade/upgrades/delete_descriptors_of_dropped_functions.go line 94 at r2 (raw file):

				batchSize,
			)
			batchMaxID := int64(tree.MustBeDInt(row[0]))

nit: this line should probably go after the err != nil check

Release note (performance improvement): this commit makes the `delete descriptors of dropped functions` upgrade job more efficient. It used to look at every single id until the max descriptor id which was very inefficient when the max descriptor id really large, in which case the upgrade job took very long even there was no function descriptor at all. This commit changed it to actually query upper bound id of each batch.

chengxiong-ruan · 2023-06-07T21:32:44Z

pkg/upgrade/upgrades/delete_descriptors_of_dropped_functions.go line 94 at r2 (raw file):

Previously, andyyang890 (Andy Yang) wrote…

nit: this line should probably go after the err != nil check

done. good catch!

chengxiong-ruan · 2023-06-08T01:21:23Z

tftr!
bors r+

craig · 2023-06-08T02:33:38Z

Build failed:

Bazel Essential CI (Cockroach)

chengxiong-ruan · 2023-06-08T13:25:28Z

bors r+

craig · 2023-06-08T13:54:30Z

Build succeeded:

Bazel Essential CI (Cockroach)

chengxiong-ruan requested a review from a team as a code owner June 7, 2023 15:00

chengxiong-ruan force-pushed the 20230607-more-efficient-udf-upgrade branch 2 times, most recently from 7a8e809 to f9adf43 Compare June 7, 2023 15:17

andyyang890 reviewed Jun 7, 2023

View reviewed changes

chengxiong-ruan force-pushed the 20230607-more-efficient-udf-upgrade branch from f9adf43 to c757411 Compare June 7, 2023 16:56

chengxiong-ruan added the backport-23.1.x Flags PRs that need to be backported to 23.1 label Jun 7, 2023

andyyang890 approved these changes Jun 7, 2023

View reviewed changes

chengxiong-ruan force-pushed the 20230607-more-efficient-udf-upgrade branch from c757411 to b20d5ac Compare June 7, 2023 21:32

craig bot merged commit fdf04ff into cockroachdb:master Jun 8, 2023

blathers-crl bot mentioned this pull request Jun 8, 2023

release-23.1: upgrade: improve delete dropped udf upgrade job efficiency #104590

Merged

cockroach-teamcity mentioned this pull request Jun 9, 2023

PR #104512 - upgrade: improve delete dropped udf upgrade job efficiency cockroachdb/docs#17224

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

upgrade: improve delete dropped udf upgrade job efficiency #104512

upgrade: improve delete dropped udf upgrade job efficiency #104512

chengxiong-ruan commented Jun 7, 2023

blathers-crl bot commented Jun 7, 2023

cockroach-teamcity commented Jun 7, 2023

andyyang890 left a comment

chengxiong-ruan commented Jun 7, 2023

andyyang890 left a comment

chengxiong-ruan commented Jun 7, 2023

chengxiong-ruan commented Jun 8, 2023

craig bot commented Jun 8, 2023

chengxiong-ruan commented Jun 8, 2023

craig bot commented Jun 8, 2023

upgrade: improve delete dropped udf upgrade job efficiency #104512

upgrade: improve delete dropped udf upgrade job efficiency #104512

Conversation

chengxiong-ruan commented Jun 7, 2023

blathers-crl bot commented Jun 7, 2023

cockroach-teamcity commented Jun 7, 2023

andyyang890 left a comment

Choose a reason for hiding this comment

chengxiong-ruan commented Jun 7, 2023

andyyang890 left a comment

Choose a reason for hiding this comment

chengxiong-ruan commented Jun 7, 2023

chengxiong-ruan commented Jun 8, 2023

craig bot commented Jun 8, 2023

chengxiong-ruan commented Jun 8, 2023

craig bot commented Jun 8, 2023