Support for overloaded functions in stubgenc generated by pybind11 #5975

wiktorn · 2018-11-29T22:31:30Z

For overloaded methods pybind generates following docstring:

__init__(*args, **kwargs)
Overloaded function.

1. __init__(self: TestClass, arg0: str) -> None

2. __init__(self: TestClass, arg0: str, arg1: str) -> None

For such docstrings stubgenc currently produces following stub:

def __init__(self, *args, **kwargs) -> Any: ...

With this change stubgenc will generate following stub:

@overload
def __init__(self, arg0: str) -> None: ...
@overload
def __init__(self, arg0: str, arg1: str) -> None: ...

This pull request introduces:

class - FunctionSig - representing signature of function/method (function name, arguments, return value)
class - ArgSig - representing information about function argument (argument name, type, default value)

infer_sig_from_docstring now returns list of FunctionSig . List contains one object when function is not overloaded, more objects - when it's overloaded. Each object represents signature of function.

infer_arg_sig_from_docstring is a new function that parses argument list for function in docstring. As the type information may contain commas, the code checks, if the comma is outside brackets. I tried to approach that with regex and failed, also tried to use ast module, but its not straightforward to move from ast form back to source code form that's why I settled with naive approach, as this is not performance critical path. Tests for edge cases included in test_infer_sig_from_docstring

gvanrossum · 2018-12-01T00:49:12Z

For such docstrings stubgenc will produce following stub:

I presume you mean that stubgenc should produce such a stub, and this PR implements that behavior? (We prefer our commit messages to describe the change rather than the new state of affairs.)

I will try to review soon, but I think this has missed the train for the upcoming 0.650 release (see #5960).

wiktorn · 2018-12-01T12:50:17Z

Ok, updated description.

gvanrossum · 2018-12-04T02:31:12Z

Hm... Having looked at this a bit more, it really feels like string partitioning is a pretty poor way of parsing something of this complexity. I realize that all of stubgen looks like a hack, but since you are doing your best to make it better, perhaps you can improve the approach to parsing a bit more? Maybe write a tiny recursive-descent parser, using the tokenize module for tokenization?

wiktorn · 2018-12-04T20:31:49Z

Thank you for pointing me to tokenize module. Looks like this is something I was looking for. Looks like it is time for me to refresh my knowledge and re-read library reference :-)

I'll start with replacing parsing docstring function declarations. Though quick scan of the code suggests, that some regex usage will remain in the stubgenc/stubutil.

gvanrossum · 2018-12-04T20:33:27Z

Sounds like a plan!

gvanrossum · 2018-12-14T18:04:25Z

Hey @wiktorn, just to let you know, we're planning fairly extensive refactorings of part of stubgen, however, we don't expect the docstring parsing to be affected, so don't worry!

wiktorn · 2018-12-15T08:13:16Z

Thank you for the heads up @gvanrossum . I'll finally have more time next week to polish this PR.

wiktorn · 2019-01-12T22:39:10Z

@gvanrossum Can you trigger another AppVeyor build? My guess that this failure was totally unrelated to my changes (some exceptions in IPC and ast3 module that are not mentioned in this changes)

emmatyping · 2019-01-12T23:03:34Z

@wiktorn if you rebase on master those errors should go away.

gvanrossum · 2019-01-25T20:27:49Z

Sorry I haven't got to review this yet! I missed that you had updated it because you used git push -f. In the future please leave the old revisions alone and just push new changes, it's easier to review.

Note that when #6256 lands this will become one big merge conflict, but according to Ivan it's easy to resolve -- he just moved all the docstring parsing logic to a new file, stubdoc.c.

gvanrossum

Here are a few review comments anyway. Sorry again!

mypy/stubgenc.py

mypy/stubutil.py

wiktorn · 2019-01-26T11:42:27Z

I hope that's ok right now. Just let me know how you want to move forward. Shall I till #6256 lands in master and merge master back to this branch?

Sorry for the force push, that's how I understood @ethanhs and since there was no comments on code yet, I've decided to rebase instead of merging master

gvanrossum

LG! This looks ready to merge.

I'm leaving it up to @ilevkivskyi which one to merge first: this one (then he has to merge your code back into #6256) or the latter (leaving the merge up to you).

ilevkivskyi

I would be glad to merge this before my PR, but I have a bunch of additional comments.

mypy/stubutil.py

mypy/test/teststubgen.py

ilevkivskyi

It looks like there are bunch of comments from my previous review that are not implemented. I tried to explain some of them more, in case you didn't understand what to do. Are you going to implement them?

If no, I would probably merge this anyway and fix them myself, but it would be better if you can do this.

ilevkivskyi · 2019-01-29T17:00:58Z

mypy/stubgenc.py

        if name == 'getitem':
-            return '(index)'
+            return [TypedArgSig(name='index', type=None, default=False)]


If you would use normal classes instead of named tuples you could define __init__ as:

def __init__(self, name: str, type: Optional[str] = None, default: bool = False) -> None: ...

and then this an d a dozen others will be just TypedArgSig('index').

Also with normal classes you can define __repr__(), to simplify test cases significantly. You can just check that str(<generated signature>) matches an expected string.

ilevkivskyi · 2019-01-29T17:02:39Z

mypy/stubgenc.py

+            return [
+                TypedArgSig(name='name', type=None, default=False),
+                TypedArgSig(name='value', type=None, default=False)
+            ]


For multiline we use this style:

return [TypedArgSig(name='name', type=None, default=False), TypedArgSig(name='value', type=None, default=False)]

(also many of these will not need to be multiline if you implement the suggestion above).

mypy/test/teststubgen.py

TypedArgSig -> ArgSig TypedFunctionSig -> FunctionSig

wiktorn · 2019-01-29T17:31:00Z

I've missed your comments on conversation tab on GitHub. As I reviewed changes I found your comments and started to address them.

Shorten multiline lists.

ilevkivskyi · 2019-01-29T18:27:08Z

@wiktorn Currently there is one more big change that I proposed: switch from named tuples to normal classes, see #5975 (comment) and #5975 (comment)

After that you could also remove redundant type=None etc. For the tests just use:

class ArgSig:
    ...
    def __repr__(self) -> str:
        r = self.name
        if self.type:
            r += ': ' + self.type
        if self.default:
            r += ' = ...' if self.type else '=...'
        return r

class FunctionSig:
    ...
    def __repr__(self) -> str:
        args = ','.join([str(arg) for arg in self.args])
        return '{}({}) -> {}: ...'

Then many tests can be simplified to just check the string representation of the result. You can still define __eq__() if you want to use direct equality checks in some tests.

ilevkivskyi · 2019-01-29T18:27:28Z

(also there is a lint failure now)

wiktorn · 2019-01-29T18:31:08Z

And what do you think about this approach:

ArgSig = NamedTuple('ArgSig', [
    ('name', str),
    ('type', Optional[str]),
    ('default', bool)
])

ArgSig.__new__.__defaults__ = (None, False)

As far as I see, it works in Python3.4, and once oldest supported version will be 3.7, we can move this declaration to NamedTuple call itself.

I'm reluctant to use repr strings in tests though.

ilevkivskyi · 2019-01-29T18:37:33Z

ArgSig.__new__.__defaults__ = (None, False)

With this call sites will not type-check by mypy.

I'm reluctant to use repr strings in tests though.

Why? If you just want to be sure it is clear from the string form it is a custom class, use something like "FunctionSig('method(self, arg: int) -> Any')" it is still quicker to grasp that the current form. I however don't have strong opinion about this, unlike about the default arguments.

wiktorn · 2019-01-29T19:17:22Z

ArgSig.__new__.__defaults__ = (None, False)

With this call sites will not type-check by mypy.

As I'm testing it right now, it rather doesn't pass self-check:
error: "Callable[[Type[NT], str, Optional[str], bool], NT]" has no attribute "__defaults__"

Though it properly detects problems in call sites

error: Argument "type" to "ArgSig" has incompatible type "int"; expected "Optional[str]"
error: Argument "default" to "ArgSig" has incompatible type "str"; expected "bool"

So I'll convert them to normal classes.

I'm reluctant to use repr strings in tests though.

Why? If you just want to be sure it is clear from the string form it is a custom class, use something like "FunctionSig('method(self, arg: int) -> Any')" it is still quicker to grasp that the current form. I however don't have strong opinion about this, unlike about the default arguments.

We could actually have __str__ method which would provide stub signature. The only problem I see is that to cover whole API contract, we would need still to check if types in ArgSig's are properly set, as they are used to add necessary imports

ilevkivskyi · 2019-01-29T19:42:44Z

Though it properly detects problems in call sites

I meant that mypy will flag as errors the calls with less arguments like ArgSig('index') if you would use the __defaults__ pattern.

So I'll convert them to normal classes.

OK

The only problem I see is that to cover whole API contract, we would need still to check if types in ArgSig's are properly set, as they are used to add necessary imports

Yes, this could be useful only for testing. Anyway, I don't insist on this. You can just define __eq__() instead and keep your current tests.

Create ArgList class instead of NamedTuple and provide defaults for type and default

wiktorn · 2019-01-29T19:55:09Z

I did not add defaults to FunctionSig as it doesn't shorten that much and forces to give a thought, if arguments are properly set.

mypy/stubgenc.py

ilevkivskyi

OK, I am going to merge this now. There are some minor formatting changes, but I will take care of those.

ilevkivskyi · 2019-01-29T21:46:57Z

@wiktorn Thanks for contributing! I have successfully merged my PR on top for yours.

wiktorn · 2019-01-29T22:24:34Z

Thank you for your patience @ilevkivskyi and help with getting this merged.

wiktorn added 6 commits January 13, 2019 00:18

Parse docstrings to custom objects.

0834954

Support for overloaded function generated by pybind11

7035a97

Fix mypy self-check

cf4de08

Use tokenize to parse function declarations in docstr

38872dc

Use ENDMARKER for python 3.4

860de83

Always check for state

ae08bd3

wiktorn force-pushed the stubgenc_pybind_arg_type_object branch from 1530871 to ae08bd3 Compare January 12, 2019 23:19

gvanrossum reviewed Jan 25, 2019

View reviewed changes

wiktorn added 2 commits January 26, 2019 12:19

Review fixes

d1aee10

Use infer_sig_from_docstring in infer_arg_sig_from_docstring, add tests

13048b5

Fix PEP8

769577a

gvanrossum approved these changes Jan 26, 2019

View reviewed changes

ilevkivskyi reviewed Jan 27, 2019

View reviewed changes

This was referenced Jan 28, 2019

Improve CLI, refactor and document stubgen #6256

Merged

Expose some mypy.stubdoc functions in user-facing documentation #6261

Closed

wiktorn added 2 commits January 28, 2019 23:03

Review fixes

c77fe63

Fixed mypy self-check

824c65a

ilevkivskyi reviewed Jan 29, 2019

View reviewed changes

Review fixes

382946b

TypedArgSig -> ArgSig TypedFunctionSig -> FunctionSig

Fix docstring style

fb7ad2d

wiktorn added 3 commits January 29, 2019 18:38

Review fixes

eebb0e1

Shorten multiline lists.

Shorten lists

cd02f06

fix PEP8

e0ace0f

Review fixes.

d4be948

Create ArgList class instead of NamedTuple and provide defaults for type and default

Fix self-check

e42097b

ilevkivskyi reviewed Jan 29, 2019

View reviewed changes

mypy/stubgenc.py Outdated Show resolved Hide resolved

Remove default values from ArgSig

8082b30

ilevkivskyi approved these changes Jan 29, 2019

View reviewed changes

ilevkivskyi merged commit d6aef70 into python:master Jan 29, 2019

wiktorn mentioned this pull request Feb 3, 2019

Typing information for pyosmium osmcode/pyosmium#59

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for overloaded functions in stubgenc generated by pybind11 #5975

Support for overloaded functions in stubgenc generated by pybind11 #5975

wiktorn commented Nov 29, 2018 •

edited by ilevkivskyi

Loading

gvanrossum commented Dec 1, 2018 •

edited

Loading

wiktorn commented Dec 1, 2018

gvanrossum commented Dec 4, 2018

wiktorn commented Dec 4, 2018

gvanrossum commented Dec 4, 2018 via email

gvanrossum commented Dec 14, 2018

wiktorn commented Dec 15, 2018

wiktorn commented Jan 12, 2019

emmatyping commented Jan 12, 2019

gvanrossum commented Jan 25, 2019

gvanrossum left a comment

wiktorn commented Jan 26, 2019

gvanrossum left a comment

ilevkivskyi left a comment

ilevkivskyi left a comment

ilevkivskyi Jan 29, 2019

ilevkivskyi Jan 29, 2019

ilevkivskyi Jan 29, 2019

wiktorn commented Jan 29, 2019

ilevkivskyi commented Jan 29, 2019

ilevkivskyi commented Jan 29, 2019

wiktorn commented Jan 29, 2019

ilevkivskyi commented Jan 29, 2019

wiktorn commented Jan 29, 2019

ilevkivskyi commented Jan 29, 2019 •

edited

Loading

wiktorn commented Jan 29, 2019

ilevkivskyi left a comment

ilevkivskyi commented Jan 29, 2019

wiktorn commented Jan 29, 2019

Support for overloaded functions in stubgenc generated by pybind11 #5975

Support for overloaded functions in stubgenc generated by pybind11 #5975

Conversation

wiktorn commented Nov 29, 2018 • edited by ilevkivskyi Loading

gvanrossum commented Dec 1, 2018 • edited Loading

wiktorn commented Dec 1, 2018

gvanrossum commented Dec 4, 2018

wiktorn commented Dec 4, 2018

gvanrossum commented Dec 4, 2018 via email

gvanrossum commented Dec 14, 2018

wiktorn commented Dec 15, 2018

wiktorn commented Jan 12, 2019

emmatyping commented Jan 12, 2019

gvanrossum commented Jan 25, 2019

gvanrossum left a comment

Choose a reason for hiding this comment

wiktorn commented Jan 26, 2019

gvanrossum left a comment

Choose a reason for hiding this comment

ilevkivskyi left a comment

Choose a reason for hiding this comment

ilevkivskyi left a comment

Choose a reason for hiding this comment

ilevkivskyi Jan 29, 2019

Choose a reason for hiding this comment

ilevkivskyi Jan 29, 2019

Choose a reason for hiding this comment

ilevkivskyi Jan 29, 2019

Choose a reason for hiding this comment

wiktorn commented Jan 29, 2019

ilevkivskyi commented Jan 29, 2019

ilevkivskyi commented Jan 29, 2019

wiktorn commented Jan 29, 2019

ilevkivskyi commented Jan 29, 2019

wiktorn commented Jan 29, 2019

ilevkivskyi commented Jan 29, 2019 • edited Loading

wiktorn commented Jan 29, 2019

ilevkivskyi left a comment

Choose a reason for hiding this comment

ilevkivskyi commented Jan 29, 2019

wiktorn commented Jan 29, 2019

wiktorn commented Nov 29, 2018 •

edited by ilevkivskyi

Loading

gvanrossum commented Dec 1, 2018 •

edited

Loading

ilevkivskyi commented Jan 29, 2019 •

edited

Loading