Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create objects from iterables that contain cudf.NA #8442

Conversation

brandon-b-miller
Copy link
Contributor

Closes #8287

PyArrow knows how to handle the pd.NA singleton and knows it represents nulls if from_pandas=True during array construction. There's not an option to choose what sentinel or value is used to represent null however and the 'detection' of which values are exactly this object is implemented at the c++ level in pyarrow, limiting our options for 'tricking' pyarrow into seeing cudf.NA as null.

As such it is probably best that our NA be identically the pandas NA. This also makes cudf.NA is pd.NA return true, which is probably what we want as well.

@brandon-b-miller brandon-b-miller added feature request New feature or request 2 - In Progress Currently a work in progress Python Affects Python cuDF API. non-breaking Non-breaking change labels Jun 4, 2021
@brandon-b-miller brandon-b-miller self-assigned this Jun 4, 2021
@brandon-b-miller brandon-b-miller requested a review from a team as a code owner June 4, 2021 16:07
@codecov
Copy link

codecov bot commented Jun 8, 2021

Codecov Report

❗ No coverage uploaded for pull request base (branch-21.08@90e29d9). Click here to learn what that means.
The diff coverage is n/a.

Impacted file tree graph

@@               Coverage Diff               @@
##             branch-21.08    #8442   +/-   ##
===============================================
  Coverage                ?   83.89%           
===============================================
  Files                   ?      109           
  Lines                   ?    19636           
  Branches                ?        0           
===============================================
  Hits                    ?    16473           
  Misses                  ?     3163           
  Partials                ?        0           

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 90e29d9...2899d79. Read the comment docs.

@brandon-b-miller brandon-b-miller added 3 - Ready for Review Ready for review by team and removed 2 - In Progress Currently a work in progress labels Jun 8, 2021
Copy link
Contributor

@marlenezw marlenezw left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great! Looks good to me!

@brandon-b-miller
Copy link
Contributor Author

rerun tests

@brandon-b-miller brandon-b-miller added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team labels Jun 11, 2021
@brandon-b-miller
Copy link
Contributor Author

@gpucibot merge

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
5 - Ready to Merge Testing and reviews complete, ready to merge feature request New feature or request non-breaking Non-breaking change Python Affects Python cuDF API.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[FEA] Instantiate cuDF objects from python objects containing cudf.NA
3 participants