TYP: Add annotation for df.pivot #32197

charlesdong1991 · 2020-02-23T10:31:51Z

xref ENH: Allow multi values for index and columns in df.pivot #30928
tests added / passed
passes black pandas
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

MarcoGorelli · 2020-02-24T14:10:52Z

pandas/core/reshape/pivot.py

+def pivot(
+    data: "DataFrame",
+    index: Optional[Union[Label, Collection[Label]]] = None,
+    columns: Union[Label, List[Label]] = None,


Should this also be Optional?

yeah, i am not very sure, got some mypy errors below to complain that columns is not Optional, probably because we will raise error if None.

WillAyd · 2020-02-24T20:10:23Z

pandas/core/reshape/pivot.py

-def pivot(data: "DataFrame", index=None, columns=None, values=None) -> "DataFrame":
+def pivot(
+    data: "DataFrame",
+    index: Optional[Union[Label, List[Optional[Label]]]] = None,


Suggested change

index: Optional[Union[Label, List[Optional[Label]]]] = None,

index: Optional[Union[Label, Sequence[Optional[Label]]]] = None,

For API items we want to be as generic as possible, so Sequence instead of List and Mapping instead of Dict (unless the interface really isn't generic)

Same comment on next two lines

For API items we want to be as generic as possible, so Sequence instead of List and Mapping instead of Dict (unless the interface really isn't generic)

thanks for the very nice tip!!! 👍

pep8speaks · 2020-02-24T21:12:00Z

Hello @charlesdong1991! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2020-05-18 19:53:56 UTC

jreback

not a fan of all of the type ignores

cc @simonjayhawkins @WillAyd

jreback · 2020-04-10T17:13:05Z

pandas/core/reshape/pivot.py

-def pivot(data: "DataFrame", index=None, columns=None, values=None) -> "DataFrame":
+def pivot(
+    data: "DataFrame",
+    index: Optional[Union[Label, Sequence[Optional[Label]]]] = None,


i t hink would be ok to add these to _typing, maybe Labels? so this becomes Optional[Labels]

jreback · 2020-04-10T17:13:36Z

pandas/core/reshape/pivot.py

        if index is None:
            pass
        elif is_list_like(index):
-            cols = list(index)
+            # Remove type ignore once mypy-5206 is implemented, same for below


can you add an assert here to make mypy happy?

simonjayhawkins · 2020-04-17T13:34:02Z

not a fan of all of the type ignores

we have the choice of ignores, ignores with error codes, casts and asserts.

my preference here would be casts (only after is_list_like calls) xref #32785

diff --git a/pandas/core/reshape/pivot.py b/pandas/core/reshape/pivot.py
index e49dd4c9b..855189c43 100644
--- a/pandas/core/reshape/pivot.py
+++ b/pandas/core/reshape/pivot.py
@@ -1,4 +1,14 @@
-from typing import TYPE_CHECKING, Callable, Dict, List, Optional, Sequence, Tuple, Union
+from typing import (
+    TYPE_CHECKING,
+    Callable,
+    Dict,
+    List,
+    Optional,
+    Sequence,
+    Tuple,
+    Union,
+    cast,
+)
 
 import numpy as np
 
@@ -427,24 +437,28 @@ def _convert_by(by):
 @Appender(_shared_docs["pivot"], indents=1)
 def pivot(
     data: "DataFrame",
-    index: Optional[Union[Label, Sequence[Optional[Label]]]] = None,
-    columns: Optional[Union[Label, Sequence[Optional[Label]]]] = None,
-    values: Optional[Union[Label, Sequence[Optional[Label]]]] = None,
+    index: Optional[Union[Label, Sequence[Label]]] = None,
+    columns: Optional[Union[Label, Sequence[Label]]] = None,
+    values: Optional[Union[Label, Sequence[Label]]] = None,
 ) -> "DataFrame":
     if columns is None:
         raise TypeError("pivot() missing 1 required argument: 'columns'")
-    columns = columns if is_list_like(columns) else [columns]
+    if is_list_like(columns):
+        columns = cast(Sequence[Label], columns)
+        columns = columns
+    else:
+        columns = [columns]
 
     if values is None:
-        cols: List[Optional[Sequence]] = []
+        cols: List[Label] = []
         if index is None:
             pass
         elif is_list_like(index):
-            # Remove type ignore once mypy-5206 is implemented, same for below
-            cols = list(index)  # type: ignore
+            index = cast(Sequence[Label], index)
+            cols = list(index)
         else:
-            cols = [index]  # type: ignore
-        cols.extend(columns)  # type: ignore
+            cols = [index]
+        cols.extend(columns)
 
         append = index is None
         indexed = data.set_index(cols, append=append)
@@ -452,18 +466,20 @@ def pivot(
         if index is None:
             idx_list = [Series(data.index, name=data.index.name)]
         elif is_list_like(index):
-            idx_list = [data[idx] for idx in index]  # type: ignore
+            index = cast(Sequence[Label], index)
+            idx_list = [data[idx] for idx in index]
         else:
             idx_list = [data[index]]
 
-        data_columns = [data[col] for col in columns]  # type: ignore
+        data_columns = [data[col] for col in columns]
         idx_list.extend(data_columns)
         mi_index = MultiIndex.from_arrays(idx_list)
 
         if is_list_like(values) and not isinstance(values, tuple):
             # Exclude tuple because it is seen as a single column name
+            values = cast(Sequence[Label], values)
             indexed = data._constructor(
-                data[values]._values, index=mi_index, columns=values  # type: ignore
+                data[values]._values, index=mi_index, columns=values
             )
         else:
             indexed = data._constructor_sliced(data[values]._values, index=mi_index)

it looks like the use of type: ignores here masked an incorrect type definition of cols

each cast used here could instead be replaced with a runtime assert until resolution of #32785

This diff uses 4 casts corresponding to 4 is_list_like calls compared with 6 ignores for which the reasons are not immediately obvious.

simonjayhawkins · 2020-05-17T16:57:42Z

@charlesdong1991 whats the status here?

charlesdong1991 · 2020-05-17T19:06:54Z

I have updated and it seems cast indeed work here.

Thanks very much! @simonjayhawkins

pandas/core/reshape/pivot.py

charlesdong1991 · 2020-05-18T11:16:24Z

@jreback i use convert_to_list_like for the first one, keep your second comment as is, and pls let me know if you think it's okay

pandas/core/reshape/pivot.py

charlesdong1991 · 2020-05-18T16:31:37Z

many thanks for the help on annotations @simonjayhawkins i think it's good to go now!

simonjayhawkins · 2020-05-18T18:37:37Z

Thanks @charlesdong1991 .lgtm. I think if you type the return type of MultiIndex.from_arrays you could revert the variable name changes idx_list and mi_index and reduce the diff.

    def from_arrays(cls, arrays, sortorder=None, names=lib.no_default) -> "MultiIndex":

charlesdong1991 · 2020-05-18T18:59:09Z

ahh, yeah, indeed seems pass mypy locally.

very nice!! @simonjayhawkins thanks!

simonjayhawkins

@charlesdong1991 Thanks. lgtm pending green. @jreback @WillAyd

WillAyd

lgtm @jreback

jreback · 2020-05-18T22:53:39Z

thanks @charlesdong1991

charlesdong1991 added 10 commits December 3, 2018 17:43

remove \n from docstring

7e461a1

fix conflicts

1314059

Merge remote-tracking branch 'upstream/master'

8bcb313

Merge remote-tracking branch 'upstream/master'

24c3ede

fix issue 17038

dea38f2

revert change

cd9e7ac

revert change

e5e912b

Merge remote-tracking branch 'upstream/master' into add_signature_30928

f337468

Add signature for pivot

1cc1225

sorting

046cdc9

charlesdong1991 mentioned this pull request Feb 23, 2020

Feature request Multi Index pivot #32129

Closed

try fix annotation

2807b63

MarcoGorelli reviewed Feb 24, 2020

View reviewed changes

charlesdong1991 added 2 commits February 24, 2020 20:05

fixup

2fdd875

fixup

5453237

WillAyd requested changes Feb 24, 2020

View reviewed changes

WillAyd added the Typing type annotations, mypy/pyright type checking label Feb 24, 2020

charlesdong1991 added 3 commits February 24, 2020 21:27

fixup

18e85bb

fixup

f61dcac

fixup

5e3ac3f

charlesdong1991 added 8 commits February 24, 2020 22:12

fixup

ffda679

fixup

6b81abb

fixup

c01f221

fixup

1552f4b

fixup

29b0605

fixup

4d2d6d3

fixup

f7fb25a

Merge remote-tracking branch 'upstream/master' into add_signature_30928

80e9710

charlesdong1991 requested a review from WillAyd February 25, 2020 20:49

jreback requested changes Apr 10, 2020

View reviewed changes

charlesdong1991 added 2 commits May 17, 2020 20:19

Merge remote-tracking branch 'upstream/master' into add_signature_30928

62edda6

use cast

f063a24

jreback added this to the 1.1 milestone May 17, 2020

jreback requested changes May 17, 2020

View reviewed changes

pandas/core/reshape/pivot.py Outdated Show resolved Hide resolved

pandas/core/reshape/pivot.py Outdated Show resolved Hide resolved

use convert_list_like

1ac9e0f

charlesdong1991 requested a review from jreback May 18, 2020 11:16

commit uncommited change

c17dd80

simonjayhawkins reviewed May 18, 2020

View reviewed changes

pandas/core/reshape/pivot.py Show resolved Hide resolved

simonjayhawkins reviewed May 18, 2020

View reviewed changes

pandas/core/reshape/pivot.py Outdated Show resolved Hide resolved

charlesdong1991 added 2 commits May 18, 2020 17:06

fixup

16798b1

get rid of cast

63643aa

simonjayhawkins reviewed May 18, 2020

View reviewed changes

pandas/core/reshape/pivot.py Show resolved Hide resolved

simonjayhawkins reviewed May 18, 2020

View reviewed changes

pandas/core/reshape/pivot.py Outdated Show resolved Hide resolved

charlesdong1991 added 2 commits May 18, 2020 17:21

simplify

7b10761

fix failed test

9870d0f

less diff

4e05ec0

simonjayhawkins approved these changes May 18, 2020

View reviewed changes

Merge remote-tracking branch 'upstream/master' into add_signature_30928

65b45a0

WillAyd approved these changes May 18, 2020

View reviewed changes

jreback approved these changes May 18, 2020

View reviewed changes

jreback merged commit 1f48d3d into pandas-dev:master May 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TYP: Add annotation for df.pivot #32197

TYP: Add annotation for df.pivot #32197

charlesdong1991 commented Feb 23, 2020

MarcoGorelli Feb 24, 2020

charlesdong1991 Feb 24, 2020

WillAyd Feb 24, 2020

WillAyd Feb 24, 2020

charlesdong1991 Feb 24, 2020

pep8speaks commented Feb 24, 2020 •

edited

Loading

jreback left a comment

jreback Apr 10, 2020

jreback Apr 10, 2020

simonjayhawkins commented Apr 17, 2020

simonjayhawkins commented May 17, 2020

charlesdong1991 commented May 17, 2020

charlesdong1991 commented May 18, 2020

charlesdong1991 commented May 18, 2020

simonjayhawkins commented May 18, 2020

charlesdong1991 commented May 18, 2020

simonjayhawkins left a comment

WillAyd left a comment

jreback commented May 18, 2020

	index: Optional[Union[Label, List[Optional[Label]]]] = None,
	index: Optional[Union[Label, Sequence[Optional[Label]]]] = None,

TYP: Add annotation for df.pivot #32197

TYP: Add annotation for df.pivot #32197

Conversation

charlesdong1991 commented Feb 23, 2020

MarcoGorelli Feb 24, 2020

Choose a reason for hiding this comment

charlesdong1991 Feb 24, 2020

Choose a reason for hiding this comment

WillAyd Feb 24, 2020

Choose a reason for hiding this comment

WillAyd Feb 24, 2020

Choose a reason for hiding this comment

charlesdong1991 Feb 24, 2020

Choose a reason for hiding this comment

pep8speaks commented Feb 24, 2020 • edited Loading

Comment last updated at 2020-05-18 19:53:56 UTC

jreback left a comment

Choose a reason for hiding this comment

jreback Apr 10, 2020

Choose a reason for hiding this comment

jreback Apr 10, 2020

Choose a reason for hiding this comment

simonjayhawkins commented Apr 17, 2020

simonjayhawkins commented May 17, 2020

charlesdong1991 commented May 17, 2020

charlesdong1991 commented May 18, 2020

charlesdong1991 commented May 18, 2020

simonjayhawkins commented May 18, 2020

charlesdong1991 commented May 18, 2020

simonjayhawkins left a comment

Choose a reason for hiding this comment

WillAyd left a comment

Choose a reason for hiding this comment

jreback commented May 18, 2020

pep8speaks commented Feb 24, 2020 •

edited

Loading