Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Override deduced Base class when defining Derived methods #855

Merged
merged 1 commit into from
Jul 3, 2017

Conversation

jagerman
Copy link
Member

@jagerman jagerman commented May 16, 2017

When defining method from a member function pointer (e.g. .def("f", &Derived::f)) we run into a problem if &Derived::f is actually implemented in some base class Base when Base isn't pybind-registered.

This happens because the class type is deduced, which then becomes a lambda with first argument this deduced type. For a base class implementation, the deduced type is Base, not Derived, and so we generate and registered an overload which takes a Base * as first argument. Trying to call this fails if Base isn't registered (e.g. because it's an implementation detail class that isn't intended to be exposed to Python) because the type caster for an unregistered type always fails.

This commit extends the pybind11::is_method annotation into a templated annotation containing the class being registered, which we can then extract to override the first argument to the derived type when attempting to register a base class method for a derived class.

Fixes #854 , #910

@bmerry
Copy link
Contributor

bmerry commented May 17, 2017

This doesn't appear to work for def_property, I'm guessing because the is_method attribute is only applied later by detail::process_attributes rather than at the time the cpp_function is constructed.

@jagerman
Copy link
Member Author

Should work for def_property now.

return def_property(name,
cpp_function(fget, is_method<type>(*this)), cpp_function(fset, is_method<type>(*this)),
return_value_policy::reference_internal, extra...);
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You've ended up with 4 versions of def_property, for the four combination of {cpp_function, generic} for fget and for fset. Possibly the code would be simpler given a helper template function that takes a getter/setter and a (list of) default call policy, and wraps it into a cpp_function with is_method if it isn't already a cpp_function, or returns the provided cpp_function as-is if it is. I think that would allow for only a single version of def_property, and probably also simplifies def_property_readonly.

By the way, is there any particular reason that def and def_static take their function via perfect forwarding while def_property takes it by const reference?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I know; I don't like it that much. The alternative is adding a move constructor that takes and ignores extra arguments (i.e. template <typename... Extra> cpp_function(cpp_function &&f, const Extra &...)), but I like that approach even less (mainly because it's too late to apply any of the Extra arguments).

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't see the extra arguments as that problematic, but I concede that it's a matter of taste.

However, I think you might be able to delete the cpp_function &fget, const Setter &fset overload that you added (L1098) if you change the fully generic overload above to only wrap the setter and forward the getter as-is to be wrapped (if necessary) at the next layer.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not seeing how: if either of the cpp_function, Setter or Getter, cpp_function versions aren't there, then a call with one cpp_function and one lambda is going to end up at the fully-generic Getter, Setter template, which doesn't handle cpp_function arguments.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's why you also need to change the fully-generic template to only wrap the setter. Let's say that c, d are instances of cpp_function, C=cpp_function, G=Getter (template parameter), S=setter (template parameter) and p, q are non-cpp_function getter/setter. Then the chain of calls for each of the cases is

  1. p, q: <G, S> -> <G, C> -> <C, C>
  2. c, q: <G, S> -> <C, C>
  3. p, c: <G, C> -> <C, C>
  4. c, c: <C, C>
    I don't know if that notation makes sense. If I've left you just as unclear, let me know and I can send you a PR against your branch.

.def("increase_value", &RegisteredDerived::increase_value)
.def_readwrite("rw_value", &RegisteredDerived::rw_value)
.def_readonly("ro_value", &RegisteredDerived::ro_value)
.def_property("rw_value_prop", &RegisteredDerived::get_int, &RegisteredDerived::set_int)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Missing a test for def_property_readonly (I don't think the other tests provide full coverage: def_readonly calls def_property_readonly, but passes it a cpp_function).

@wjakob
Copy link
Member

wjakob commented May 24, 2017

I'm on the fence here. I think it would be reasonable to expect the user to create a (tiny) binding for the base class rather than making the pybind11 implementation more complex.

@bmerry
Copy link
Contributor

bmerry commented May 24, 2017

My worry is that which class in a hierarchy defines a member function is not normally considered part of an API. The author of the C++ class might push some members into a base class (which may be an implementation detail class) without believing that it will break API compatibility, but it will break the pybind11 bindings (and in a way that is only apparent at runtime with a cryptic error message).

It's not entirely related to this PR, but as a workaround for this issue I've written a macro that takes the pain out of wrapping the PTMF: see here. If something like that was added to pybind11 it would at least be less onerous to wrap up every PTMF for robustness.

@jagerman
Copy link
Member Author

A simpler approach would be to fix the error message with an error about Base being an unregistered class. That, however, would require letting the argument loader communicate failures back to the dispatcher, as I roughly outlined in #864 (comment)

Right now the biggest issue in call failures is that we have no way to communicate why a call failed. Sometimes you can look at the "invoked with" and see that it doesn't match, but having each overload communicate a failure reason would be pretty nice.

@bmerry
Copy link
Contributor

bmerry commented May 24, 2017

As an example, consider that GCC derives most of its STL classes from base classes with names like _Vector_base. From a quick look I haven't found an example where the public member functions are actually defined in the base class, but that doesn't mean that some other compiler won't do so - and if it did then code in stl_bind.h that binds pointers to members would break.

More helpful error messages would definitely be better than not having them, but it doesn't address the fragility/portability concerns.

@YannickJadoul
Copy link
Collaborator

What would actually be the use case of nót having the class itself as first argument of a method? Is there a situation where that's not a mistake?
I seem to remember that Boost.Python enforces this by default when registering a method to a class, not just when it's an unregistered base, even. (Just wondering if that might approach might simplify the proposed solution to a point where the complexity is acceptable.)

@jagerman
Copy link
Member Author

I've simplified this implementation now by adding a cpp_function::method<D> that works just like the constructors except in the special case where it's called with a Base member function pointer: in that case, it generates a lambda taking the derived type as self.

@jagerman jagerman force-pushed the iss854 branch 3 times, most recently from 7363268 to 73cddbc Compare June 24, 2017 06:21
Copy link
Contributor

@bmerry bmerry left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like a nice simplification.

/// Like is_base_of, but requires a strict base (i.e. `is_strict_base_of<T, T>::value == false`,
/// unlike `std::is_base_of`)
template <typename Base, typename Derived> using is_strict_base_of = bool_constant<
std::is_base_of<Base, Derived>::value && !std::is_base_of<Derived, Base>::value>;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Might be clearer/more self-documenting to say ... && !std::is_same<Derived, Base>::value.

template <typename Derived, typename Return, typename Class, typename... Arg>
struct method_adaptor_impl<Derived, Return (Class::*)(Arg...), enable_if_t<is_strict_base_of<Class, Derived>::value>> {
template <typename... Extra> static cpp_function adapt(Return (Class::*f)(Arg...), const Extra&... extra) {
return {[f](Derived *c, Arg... args) -> Return { return (c->*f)(args...); }, extra...};
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Any particular reason that the specialisations use const & for the extras while the generic implementation uses perfect forwarding?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There's no advantage to forwarding extras given the way they are used. The generic one uses perfect forwarding because there could be an potential advantage in forwarding a stateful lambda (the first argument), and it's simpler to just forward everything.

@dean0x7d
Copy link
Member

Hope you don't mind me pushing directly here, but your simplification gave me an idea to take it further still. I wasn't sure about it, had to write it out completely and test it. Looks like it works.

Now, method_adaptor just does a simple cast from Return (Source::*)(Args...) to Return (Target::*)(Args...). Because it's an implicit cast, the compiler will only allow compatible Source -> Target member function conversions, like Base -> Derived, but Class -> UnrelatedClass will result in a compile-time error.

@jagerman
Copy link
Member Author

Very nice. 👍

@jagerman jagerman force-pushed the iss854 branch 2 times, most recently from 218bcb3 to 2f30767 Compare June 28, 2017 14:46
@jagerman
Copy link
Member Author

Rebased and squashed. Any other comments before committing?


template <typename Derived, typename Return, typename Class, typename... Args,
typename Adapted = Return (Derived::*)(Args...)>
Adapted method_adaptor(Return (Class::*pmf)(Args...)) { return pmf; }
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not clear why Adapted is a template parameter. It can't be explicitly specified (because Args will swallow all trailing arguments) nor inferred (because it is only used in the return type), so it seems to exist solely to assign a name to the return type. While the C++03 syntax for a function returning a function pointer is horrific, would

template <typename Derived, typename Return, typename Class, typename... Args>
auto method_adaptor(Return (Class::*pmf)(Args...)) -> Return (Derived::*)(Args...) { return pmf; }

be clearer? Seems to pass on my system (GCC 5.4), haven't tried running it through Travis and Appveyor or applying it to the const version.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

(LGTM apart from this one comment)

Copy link
Member

@dean0x7d dean0x7d Jun 28, 2017

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Adapted is only there to avoid the horror of the regular function pointer return type syntax. But your trailing return type solution looks much nicer!

When defining method from a member function pointer (e.g. `.def("f",
&Derived::f)`) we run into a problem if `&Derived::f` is actually
implemented in some base class `Base` when `Base` isn't
pybind-registered.

This happens because the class type is deduced from the member function
pointer, which then becomes a lambda with first argument this deduced
type.  For a base class implementation, the deduced type is `Base`, not
`Derived`, and so we generate and registered an overload which takes a
`Base *` as first argument.  Trying to call this fails if `Base` isn't
registered (e.g.  because it's an implementation detail class that isn't
intended to be exposed to Python) because the type caster for an
unregistered type always fails.

This commit adds a `method_adaptor` function that rebinds a member
function to a derived type member function and otherwise (i.e. regular
functions/lambda) leaves the argument as-is.  This is now used for class
definitions so that they are bound with type being registered rather
than a potential base type.

A closely related fix in this commit is to similarly update the lambdas
used for `def_readwrite` (and related) to bind to the class type being
registered rather than the deduced type so that registering a property
that resolves to a base class member similarly generates a usable
function.

Fixes pybind#854, pybind#910.

Co-Authored-By: Dean Moldovan <[email protected]>
@jagerman jagerman merged commit 23bf894 into pybind:master Jul 3, 2017
@dean0x7d dean0x7d modified the milestone: v2.2 Aug 13, 2017
jagerman added a commit to jagerman/pybind11 that referenced this pull request Oct 5, 2017
When using `method_adaptor` (usually implicitly via a `cl.def("f",
&D::f)`) a compilation failure results if `f` is actually a method of
an inaccessible base class made public via `using`, such as:

    class B { public: void f() {} };
    class D : private B { public: using B::f; };

pybind deduces `&D::f` as a `B` member function pointer.  Since the base
class is inaccessible, the cast in `method_adaptor` from a base class
member function pointer to derived class member function pointer isn't
valid, and a cast failure results.

This was sort of a regression in 2.2, which introduced `method_adaptor`
to do the expected thing when the base class *is* accessible.  It wasn't
actually something that *worked* in 2.1, though: you wouldn't get a
compile-time failure, but the method was not callable (because the `D *`
couldn't be cast to a `B *` because of the access restriction).  As a
result, you'd simply get a run-time failure if you ever tried to call
the function (this is what pybind#855 fixed).

Thus the change in 2.2 essentially promoted a run-time failure to a
compile-time failure, so isn't really a regression.

This commit simply adds a `static_assert` with an accessible-base-class
check so that, rather than just a cryptic cast failure, you get
something more informative (along with a suggestion for a workaround).

The workaround is to use a lambda, e.g.:

    class Derived : private Base {
    public:
        using Base::f;
    };

    // In binding code:
    //cl.def("f", &Derived::f); // fails: &Derived::f is actually a base
                                // class member function pointer
    cl.def("f", [](Derived &self) { return self.f(); });

This is a bit of a nuissance (especially if there are a bunch of
arguments to forward), but I don't really see another solution.

Fixes pybind#1124
jagerman added a commit that referenced this pull request Oct 12, 2017
When using `method_adaptor` (usually implicitly via a `cl.def("f",
&D::f)`) a compilation failure results if `f` is actually a method of
an inaccessible base class made public via `using`, such as:

    class B { public: void f() {} };
    class D : private B { public: using B::f; };

pybind deduces `&D::f` as a `B` member function pointer.  Since the base
class is inaccessible, the cast in `method_adaptor` from a base class
member function pointer to derived class member function pointer isn't
valid, and a cast failure results.

This was sort of a regression in 2.2, which introduced `method_adaptor`
to do the expected thing when the base class *is* accessible.  It wasn't
actually something that *worked* in 2.1, though: you wouldn't get a
compile-time failure, but the method was not callable (because the `D *`
couldn't be cast to a `B *` because of the access restriction).  As a
result, you'd simply get a run-time failure if you ever tried to call
the function (this is what #855 fixed).

Thus the change in 2.2 essentially promoted a run-time failure to a
compile-time failure, so isn't really a regression.

This commit simply adds a `static_assert` with an accessible-base-class
check so that, rather than just a cryptic cast failure, you get
something more informative (along with a suggestion for a workaround).

The workaround is to use a lambda, e.g.:

    class Derived : private Base {
    public:
        using Base::f;
    };

    // In binding code:
    //cl.def("f", &Derived::f); // fails: &Derived::f is actually a base
                                // class member function pointer
    cl.def("f", [](Derived &self) { return self.f(); });

This is a bit of a nuissance (especially if there are a bunch of
arguments to forward), but I don't really see another solution.

Fixes #1124
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants