Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Query parallel execution #89

Closed
wants to merge 3 commits into from
Closed

Conversation

galsalomon66
Copy link

@galsalomon66 galsalomon66 commented Sep 2, 2021

2 main flows
(1) non-aggregation flow: split the input object into N ranges, each range gets its own context.
(2) aggregation flow:(for aggregation queries) requires 2 phases,
Upon the completion of the first phase (running the query on a specific range)
The AST will be traversed for aggregation nodes, the result of each node is pushed into a dedicated scratch area
the second phase will use the result of the first phase.
The AST for the second phase will be refactored.

…on, result returns as set of values. missing memory managment for variable and value

Signed-off-by: gal salomon <[email protected]>
2 main flows
(1) non-aggregation flow: split the input object into N ranges, each range gets its own context.
(2) aggregarion flow:(for aggregatio queries)  requires 2 phases,
Upon the completion of the first phase (running the query on a specific range)
The AST will be traversed for aggregation nodes, the result of each node is pushed into a dedicated scratch area

current commit is handling the CSV use-case/non-aggregation flow.
@galsalomon66
Copy link
Author

#103 is handling the parallel execution. it takes a different approach(the caller splits the input or defines many input streams as a single one).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant