Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

release-23.1: colexec: fix type schema corruption in an edge case #133758

Merged
merged 1 commit into from
Oct 30, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
19 changes: 19 additions & 0 deletions pkg/sql/colexec/colbuilder/execplan.go
Original file line number Diff line number Diff line change
Expand Up @@ -709,6 +709,25 @@ func NewColOperator(
result := opResult{NewColOperatorResult: colexecargs.GetNewColOperatorResult()}
r := result.NewColOperatorResult
spec := args.Spec
// Throughout this method we often use the type slice from the input spec to
// create the type schema of an operator. However, it is possible that the
// same type slice is shared by multiple stages of processors. If it just so
// happens that there is free capacity in the slice, and we append to it
// when planning operators for both stages, we might corrupt the type schema
// captured by the operators for the earlier stage. In order to prevent such
// type schema corruption we cap the slice to force creation of a fresh copy
// on the first append.
if flowCtx.Gateway {
// Sharing of the same type slice is only possible on the gateway node
// because we don't serialize the specs created during the physical
// planning. On the remote nodes each stage of processors gets their own
// allocation, so there is no aliasing that can lead to the type schema
// corruption.
for i := range spec.Input {
inputSpec := &spec.Input[i]
inputSpec.ColumnTypes = inputSpec.ColumnTypes[:len(inputSpec.ColumnTypes):len(inputSpec.ColumnTypes)]
}
}
inputs := args.Inputs
if args.Factory == nil {
// This code path is only used in tests.
Expand Down
44 changes: 44 additions & 0 deletions pkg/sql/logictest/testdata/logic_test/vectorize_types
Original file line number Diff line number Diff line change
Expand Up @@ -178,3 +178,47 @@ FROM
t107615
WHERE
(_bool OR (NOT _bool));

# Regression test for corrupting the column type schema captured by a vectorized
# operator due to sharing the same type slice by multiple stages of processors
# during physical planning (#130402).
statement ok
CREATE TABLE abcdef (
a STRING,
b INT4,
c INT4,
d INT4,
e STRING,
f STRING
);

statement ok
INSERT INTO abcdef (a, b, c, d, e, f) VALUES ('a', 0, 0, 0, 'e', 'f');

statement ok
CREATE TABLE ghijkl (
g INT4,
h INT4,
i BOOL,
j INT4,
k STRING,
l STRING
);

statement ok
INSERT INTO ghijkl (g, h, i, j, k, l) VALUES (0, 0, true, -1, 'k', 'l');

statement ok
SELECT c7, c1 >= (SELECT c FROM abcdef WHERE e != c3), c5
FROM (
SELECT
generate_series(g, d) AS c0,
h AS c1,
l AS c3,
k AS c5,
c AS c7
FROM ghijkl
LEFT JOIN abcdef ON b > j OR i NOT IN (e NOT IN (f, a),)
LIMIT 2
)
WHERE c0 NOT IN (SELECT NULL FROM ghijkl);