Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore: move infer_schema and infer_data_type into vortex-dtype #1081

Merged

Conversation

danking
Copy link
Member

@danking danking commented Oct 17, 2024

I want to use these in PyVortex without depending on vortex-datafusion.

I want to use these in PyVortex without depending on vortex-datafusion.
@danking danking force-pushed the dk/pull-arrow-dtype-conversion-into-arrow-dtype-pacakge branch from 2632167 to edf9b52 Compare October 17, 2024 19:43
@danking danking force-pushed the dk/pull-arrow-dtype-conversion-into-arrow-dtype-pacakge branch from edf9b52 to b0ab48f Compare October 17, 2024 19:43
@danking danking marked this pull request as ready for review October 17, 2024 19:44
AdamGS
AdamGS previously approved these changes Oct 17, 2024
Copy link
Member

@robert3005 robert3005 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should not standardise this code but instead come up with a solution. This method is wrong but necessary for datafusion. Not quite sure what the flow is in pyarrow but I would like to avoid this method being api. At most we should copy paste it as an implementation of pyvortex

@danking
Copy link
Member Author

danking commented Oct 18, 2024

@robert3005 we now bail on lists

// There are four kinds of lists: List (32-bit offsets), Large List (64-bit), List View
// (32-bit), Large List View (64-bit). We cannot both guarantee zero-copy and commit to an
// Arrow dtype because we do not how large our offsets are.
DType::List(..) => vortex_bail!("Unsupported dtype: {}", dtype),
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For posterity - we will canonicalize to List View. We should figure out what to do about fixed size lists though

@danking danking merged commit 5b1eee8 into develop Oct 18, 2024
5 checks passed
@danking danking deleted the dk/pull-arrow-dtype-conversion-into-arrow-dtype-pacakge branch October 18, 2024 15:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants