You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Sep 18, 2023. It is now read-only.
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
The plan is generated from Spark when doing "count distinct" with more than once in a query
select
count(distinct l_linestatus) as dist_l_linestatus,
count(distinct l_returnflag) as dist_l_returnflag
from
lineitem
where
l_shipdate <= date '1998-12-01'
in each Count distinct aggregation Spark will append a filter (WHERE gid = 1).
the second use case is like below (Aggregation with FILTER)
Describe the solution you'd like
Currently Gazelle will fallback to Vanilla Spark to execute such quereis. The overhead is big when doing C2R/R2C. Should better to support these cases natively.
Describe alternatives you've considered
N/A
Additional context
N/a
The text was updated successfully, but these errors were encountered:
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
The plan is generated from Spark when doing "count distinct" with more than once in a query
in each Count distinct aggregation Spark will append a filter (WHERE gid = 1).
the second use case is like below (Aggregation with FILTER)
Describe the solution you'd like
Currently Gazelle will fallback to Vanilla Spark to execute such quereis. The overhead is big when doing C2R/R2C. Should better to support these cases natively.
Describe alternatives you've considered
N/A
Additional context
N/a
The text was updated successfully, but these errors were encountered: