Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix NullFilter getDimensionRangeSet. #15500

Merged
merged 1 commit into from
Dec 6, 2023

Conversation

gianm
Copy link
Contributor

@gianm gianm commented Dec 6, 2023

It wasn't checking the column name, so it would return a domain regardless of the input column. This means that null filters on datasources with range partitioning could lead to excessive pruning of segments, and therefore missing results.

The null filter was introduced in #14542, meaning this query correctness bug has existed since 27.0. It will likely be more commonly hit in 28.0 because the null handling mode is on by default (and therefore this filter is generated by SQL planning by default) since #14792.

It wasn't checking the column name, so it would return a domain regardless
of the input column. This means that null filters on datasources with range
partitioning would lead to excessive pruning of segments, and therefore
missing results.
@gianm gianm added the Bug label Dec 6, 2023
@gianm gianm added this to the 28.0.1 milestone Dec 6, 2023
@LakshSingla LakshSingla merged commit 6f51155 into apache:master Dec 6, 2023
83 checks passed
@gianm gianm deleted the fix-null-filter-drs branch December 6, 2023 14:55
LakshSingla pushed a commit to LakshSingla/druid that referenced this pull request Dec 7, 2023
It wasn't checking the column name, so it would return a domain regardless
of the input column. This means that null filters on data sources with range
partitioning would lead to excessive pruning of segments, and therefore
missing results.
vogievetsky pushed a commit that referenced this pull request Dec 7, 2023
It wasn't checking the column name, so it would return a domain regardless
of the input column. This means that null filters on data sources with range
partitioning would lead to excessive pruning of segments, and therefore
missing results.

Co-authored-by: Gian Merlino <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants