[native] Support system tables #21285

arhimondr · 2023-10-31T17:22:39Z

Description

Querying system tables with native execution enabled may result in incorrect result.

Motivation and Context

Native and Java based executions have different partitioning functions implementation hence queries such the one below may return incorrect result.

SELECT *
FROM  (SELECT DISTINCT regionkey FROM table) t 
INNER JOIN  (SELECT regionkey FROM table$partitions) p 
ON t.regionkey = p.regionkey

Impact

Incorrect results returned when querying system tables with native execution enabled

Test Plan

Integration test

Contributor checklist

Please make sure your submission complies with our development, formatting, commit message, and attribution guidelines.
PR description addresses the issue accurately and concisely. If the change is non-trivial, a GitHub Issue is referenced.
Documented new properties (with its default value), SQL syntax, functions, or other functionality.
If release notes are required, they follow the release notes guidelines.
Adequate tests were added if applicable.
CI passed.

Release Notes

Please follow release notes guidelines and fill in the release notes below.

If release note is NOT required, use:

== NO RELEASE NOTE ==

Currently not supported by native execution

arhimondr · 2023-10-31T17:23:51Z

The idea is to insert an extra "GATHER" exchange right on top of a TableScan of a system table. This will ensure partitioning function is applied by native worker consistently.

mbasmanova

@arhimondr Thank you, Andrii, for enabling queries that use system tables on Prestissimo and Presto-on-Spark on Velox.

arhimondr · 2023-11-01T13:59:48Z

@mbasmanova Happy to help

Presto on Spark (even with Java execution) is not supported though for a different reason. In Presto on Spark a distributed stage cannot read from a coordinator stage. Queries that require a system table to be read by a distributed stage (such as join) are not supported.

Follow up of prestodb#21285 Partial aggregation output might not be compatible between Java and C++ implementations

Follow up of #21285 Partial aggregation output might not be compatible between Java and C++ implementations

Follow up of prestodb#21285 Partial aggregation output might not be compatible between Java and C++ implementations

arhimondr added 3 commits October 31, 2023 11:28

[native] Include native workers in the total number of cluster nodes

f741042

[native] Mitigate partitioning incompatibility for system tables

08605f0

[native pos] Disable optimized-partition-update-serialization

efdd998

Currently not supported by native execution

arhimondr requested a review from mbasmanova October 31, 2023 17:22

arhimondr requested review from shrinidhijoshi and a team as code owners October 31, 2023 17:22

arhimondr requested review from presto-oss, zacw7 and amitkdutta October 31, 2023 17:22

mbasmanova approved these changes Oct 31, 2023

View reviewed changes

amitkdutta merged commit 2172b36 into prestodb:master Oct 31, 2023
59 checks passed

arhimondr mentioned this pull request Jan 18, 2024

[native] Disable partial aggregation over system table scan #21725

Merged

6 tasks

arhimondr added a commit to arhimondr/presto that referenced this pull request Jan 19, 2024

[native] Disable partial agg over system table scan

bf5b3f5

Follow up of prestodb#21285 Partial aggregation output might not be compatible between Java and C++ implementations

arhimondr added a commit that referenced this pull request Jan 19, 2024

[native] Disable partial agg over system table scan

a6f8091

Follow up of #21285 Partial aggregation output might not be compatible between Java and C++ implementations

mbasmanova mentioned this pull request Feb 28, 2024

[native] SystemConnector to query system.runtime.tasks table #21416

Merged

kaikalur pushed a commit to kaikalur/presto that referenced this pull request Mar 14, 2024

[native] Disable partial agg over system table scan

2bc9901

Follow up of prestodb#21285 Partial aggregation output might not be compatible between Java and C++ implementations

wypb pushed a commit to wypb/presto that referenced this pull request Apr 25, 2024

[native] Disable partial agg over system table scan

1e16dc1

Follow up of prestodb#21285 Partial aggregation output might not be compatible between Java and C++ implementations

tdcmeehan mentioned this pull request Oct 23, 2024

[native] Add native plan checker and native endpoint for Velox plan conversion #23596

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[native] Support system tables #21285

[native] Support system tables #21285

arhimondr commented Oct 31, 2023

arhimondr commented Oct 31, 2023

mbasmanova left a comment

arhimondr commented Nov 1, 2023

[native] Support system tables #21285

[native] Support system tables #21285

Conversation

arhimondr commented Oct 31, 2023

Description

Motivation and Context

Impact

Test Plan

Contributor checklist

Release Notes

arhimondr commented Oct 31, 2023

mbasmanova left a comment

Choose a reason for hiding this comment

arhimondr commented Nov 1, 2023