Clean pandas usage in frame.agg #821

charlesdong1991 · 2019-09-23T20:32:03Z

I came across the frame.agg when i read through code base, and based on the inline comment, I think the manipulation using pandas could be simplified a bit IIUC.

codecov-io · 2019-09-23T20:57:15Z

Codecov Report

Merging #821 into master will decrease coverage by <.01%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master     #821      +/-   ##
==========================================
- Coverage   94.36%   94.36%   -0.01%     
==========================================
  Files          32       32              
  Lines        5854     5853       -1     
==========================================
- Hits         5524     5523       -1     
  Misses        330      330

Impacted Files	Coverage Δ
databricks/koalas/frame.py	`96.89% <100%> (-0.01%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 3e263df...a23fc27. Read the comment docs.

ueshin

This is great. The code is really simpler.

ueshin · 2019-09-23T20:55:49Z

databricks/koalas/frame.py

-            lambda gpdf: gpdf.drop('level_1', 1).set_index('level_0').transpose()
-        ).reset_index(level=1)
-        pdf = pdf.drop(columns='level_1')
+        pdf = kdf.to_pandas().stack(level=1)


We can just use .stack() here? Then I guess we can reuse when supporting multi-index columns.

yeah, .stack() should also work indeed. i was trying to be more explicit. Use .stack() to facilitate supporting multiindex, will change!

thanks! @ueshin

softagram-bot · 2019-09-23T21:10:21Z

Softagram Impact Report for pull/821 (head commit: `a23fc27`)

⭐ Change Overview

(Open in Softagram Desktop for full details)

📄 Full report

Permalink: Full report for pull/821

Impact Report explained. Give feedback on this report to [email protected]

ueshin

LGTM, pending tests.

ueshin · 2019-09-23T21:56:27Z

Thanks! merging.

HyukjinKwon · 2019-09-23T22:37:00Z

Nice! Thanks!

* upstream/master: Updated the koalas logo in readme.md Adding koalas-logo without label Adding Koalas logo to readme Adding koalas logo Clean pandas usage in frame.agg (databricks#821) Implement Series.aggregate and agg (databricks#816) Raise a more helpful error for duplicated columns in Join (databricks#820)

Clean pandas usage

03be943

ueshin reviewed Sep 23, 2019

View reviewed changes

remove level

a23fc27

ueshin approved these changes Sep 23, 2019

View reviewed changes

ueshin merged commit 5e39ad5 into databricks:master Sep 23, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean pandas usage in frame.agg #821

Clean pandas usage in frame.agg #821

charlesdong1991 commented Sep 23, 2019 •

edited

Loading

codecov-io commented Sep 23, 2019 •

edited

Loading

ueshin left a comment

ueshin Sep 23, 2019

charlesdong1991 Sep 23, 2019 •

edited

Loading

softagram-bot commented Sep 23, 2019

ueshin left a comment

ueshin commented Sep 23, 2019

HyukjinKwon commented Sep 23, 2019

Clean pandas usage in frame.agg #821

Clean pandas usage in frame.agg #821

Conversation

charlesdong1991 commented Sep 23, 2019 • edited Loading

codecov-io commented Sep 23, 2019 • edited Loading

Codecov Report

ueshin left a comment

Choose a reason for hiding this comment

ueshin Sep 23, 2019

Choose a reason for hiding this comment

charlesdong1991 Sep 23, 2019 • edited Loading

Choose a reason for hiding this comment

softagram-bot commented Sep 23, 2019

Softagram Impact Report for pull/821 (head commit: a23fc27)

⭐ Change Overview

📄 Full report

ueshin left a comment

Choose a reason for hiding this comment

ueshin commented Sep 23, 2019

HyukjinKwon commented Sep 23, 2019

charlesdong1991 commented Sep 23, 2019 •

edited

Loading

codecov-io commented Sep 23, 2019 •

edited

Loading

charlesdong1991 Sep 23, 2019 •

edited

Loading

Softagram Impact Report for pull/821 (head commit: `a23fc27`)