Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Metrics Computing: DataFrame API #7

Open
brayanjuls opened this issue May 25, 2024 · 2 comments
Open

Metrics Computing: DataFrame API #7

brayanjuls opened this issue May 25, 2024 · 2 comments
Assignees

Comments

@brayanjuls
Copy link
Contributor

Design an API that help us support multiple DataFrame(polars,spark, pandas,etc) and convert them to the choosen processing engine native DataFrame API.

@brayanjuls
Copy link
Contributor Author

The initial objective was to support multiple Dataframe APIs and a single backend for the execution of the query but from a UX point of view it doesn't make sense because if a user is using a different execution engine to process their data we would be forcing that user to use two backends just to use our library. Additionally, to support conversion between i.e polars and DataFusion we would need that both implement substrait format which is not the case and given recent conversation(see issue 7404) in the polars project it seems not be planned for the near future, nor Apache Spark(stuck pr) or Apache Flink support this format yet.

One alternative idea is to support multiple backends along with the frontend, meaning that if the user uses polars we would express and compute the metrics in polars and the like for each query engine or tool. This would require more work but will provide a better UX to the end user and reduce the complexity of implementation.

@brayanjuls
Copy link
Contributor Author

This is an image of how it would look like,
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants