Fix (DataFrame|Series).isin to pass numpy array #2103

itholic · 2021-03-15T05:47:05Z

(Series|DataFrame).isin don't work properly when passing numpy array as a parameter.

>>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [2, 3, 4]})
>>> numpy_arr = np.array([3, 4, 5])
>>> df['a'].isin(numpy_arr)
0    False
1    False
2     True
Name: a, dtype: bool

>>> kdf = ks.from_pandas(df)
>>> kdf[kdf['a'].isin(numpy_arr)]
Traceback (most recent call last):
...
AttributeError: 'numpy.int64' object has no attribute '_get_object_id'

This should resolve #2098

codecov-io · 2021-03-15T06:18:01Z

Codecov Report

Merging #2103 (716e996) into master (2fe8796) will decrease coverage by 5.43%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #2103      +/-   ##
==========================================
- Coverage   95.21%   89.78%   -5.44%     
==========================================
  Files          60       60              
  Lines       13460    13347     -113     
==========================================
- Hits        12816    11983     -833     
- Misses        644     1364     +720

Impacted Files	Coverage Δ
databricks/koalas/base.py	`93.39% <100.00%> (-3.51%)`	⬇️
databricks/koalas/frame.py	`93.42% <100.00%> (-3.11%)`	⬇️
databricks/koalas/plot/plotly.py	`15.78% <0.00%> (-81.06%)`	⬇️
...bricks/koalas/tests/plot/test_frame_plot_plotly.py	`23.33% <0.00%> (-76.67%)`	⬇️
...ricks/koalas/tests/plot/test_series_plot_plotly.py	`26.92% <0.00%> (-71.26%)`	⬇️
databricks/koalas/usage_logging/__init__.py	`28.20% <0.00%> (-64.36%)`	⬇️
databricks/koalas/usage_logging/usage_logger.py	`47.82% <0.00%> (-52.18%)`	⬇️
databricks/koalas/typedef/typehints.py	`66.84% <0.00%> (-27.72%)`	⬇️
databricks/koalas/__init__.py	`77.63% <0.00%> (-14.48%)`	⬇️
databricks/conftest.py	`87.27% <0.00%> (-12.73%)`	⬇️
... and 29 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2fe8796...716e996. Read the comment docs.

lfdversluis · 2021-03-15T09:13:11Z

databricks/koalas/tests/test_dataframe.py

@@ -1848,10 +1848,16 @@ def test_isin(self):
        kdf = ks.from_pandas(pdf)

        self.assert_eq(kdf.isin([4, "six"]), pdf.isin([4, "six"]))
+        # Seems like pandas has a bug when passing `np.array` as parameter


pandas should be koalas :)

It must be caused by type-coercion rule. We can leave it as-is. cc @HyukjinKwon

Thanks, @ueshin :)

databricks/koalas/base.py

databricks/koalas/frame.py

itholic · 2021-03-24T02:39:55Z

I'm pretty sure for this fix. Please feel free to leave comment if any!

Fix isin to pass numpy array

716e996

itholic changed the title ~~Fix isin to pass numpy array~~ Fix (DataFrame|Series).isin to pass numpy array Mar 15, 2021

itholic mentioned this pull request Mar 15, 2021

Series.isin() throws exception when fed numpy array #2098

Closed

lfdversluis reviewed Mar 15, 2021

View reviewed changes

xinrong-meng reviewed Mar 15, 2021

View reviewed changes

databricks/koalas/base.py Show resolved Hide resolved

xinrong-meng self-requested a review March 16, 2021 21:13

xinrong-meng approved these changes Mar 16, 2021

View reviewed changes

lfdversluis reviewed Mar 17, 2021

View reviewed changes

databricks/koalas/frame.py Show resolved Hide resolved

itholic merged commit 95022f3 into databricks:master Mar 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix (DataFrame|Series).isin to pass numpy array #2103

Fix (DataFrame|Series).isin to pass numpy array #2103

itholic commented Mar 15, 2021

codecov-io commented Mar 15, 2021

lfdversluis Mar 15, 2021

itholic Mar 15, 2021 •

edited

Loading

ueshin Mar 19, 2021

itholic Mar 24, 2021

itholic commented Mar 24, 2021

Fix (DataFrame|Series).isin to pass numpy array #2103

Fix (DataFrame|Series).isin to pass numpy array #2103

Conversation

itholic commented Mar 15, 2021

codecov-io commented Mar 15, 2021

Codecov Report

lfdversluis Mar 15, 2021

Choose a reason for hiding this comment

itholic Mar 15, 2021 • edited Loading

Choose a reason for hiding this comment

ueshin Mar 19, 2021

Choose a reason for hiding this comment

itholic Mar 24, 2021

Choose a reason for hiding this comment

itholic commented Mar 24, 2021

itholic Mar 15, 2021 •

edited

Loading