[Replicated] release-23.1: colexec: fix type schema corruption in an edge case #119
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Replicated from original PR cockroachdb#133758
Original author: blathers-crl[bot]
Original creation date: 2024-10-30T00:26:10Z
Original reviewers: mgartner, yuzefovich
Original description:
Backport 1/1 commits from cockroachdb#133624 on behalf of @yuzefovich.
/cc @cockroachdb/release
This commit fixes type schema corruption in the vectorized engine in an edge case. In particular, consider the following circumstances:
InputSyncSpec.ColumnTypes
) that we get from the previous stage. In other words, we might have memory aliasing, but only on the gateway node because the remote nodes get their specs deserialized and each has its own memory allocation.BatchSchemaSubsetEnforcer
).InputSyncSpec.ColumnTypes
.The bug is effectively the same as the comment deleted by 85fd4fb outlined:
The only differences are:
InputSyncSpec.ColumnTypes
that is often used as the starting points for populatingNewColOperatorResult.ColumnTypes
which is used throughout the vectorized operator planningThis commit addresses this issue by capping the slice to its length right before we get into the vectorized planning. This will make it so that if we need to append a type, then we'll make a fresh allocation, and any possible memory aliasing with a different stage of processors will be gone.
I haven't quite figured out the exact conditions that are needed for this bug to occur, but my intuition says that it should be quite rare in practice (otherwise we'd have seen this much sooner given that the offending commit was merged more than a year ago and was backported to older branches).
Fixes: cockroachdb#130402.
Release note (bug fix): Previously, CockroachDB could encounter an internal error of the form
interface conversion: coldata.Column is
in an edge case and this is now fixed. The bug is present in versions 22.2.13+, 23.1.9+, 23.2+.Release justification: bug fix.