Implicit conversions to bool + np.bool_ conversion #925

aldanor · 2017-06-26T23:04:39Z

This fixes #922 but is quite hacky / not overly efficient -- mainly because we can't use proper numpy API in cast.h and because bool type caster is a specialisation...

(If an added if clause is a concern, I guess it could be moved over to numpy.h and made opt-in (e.g. add an optional function pointer in type_caster<bool>), but then the user would have to enable it manually in their code. On the bright side, it would be more efficient, without string comparisons etc.)

wjakob · 2017-06-27T00:19:11Z

A more general solution would be to invoke a Python API function that will call the instance's __bool__ (3.x) or __nonzero__ (2.x) function. For some reason PyObject_Bool does not exist, so we may have to implement that ourselves.

wjakob · 2017-06-27T00:19:33Z

(the implicit conversion should only take place when convert == true)

aldanor · 2017-06-27T00:31:09Z

A more general solution would be to invoke a Python API function that will call the instance's bool (3.x) or nonzero (2.x) function. For some reason PyObject_Bool does not exist, so we may have to implement that ourselves.

Indeed; I forgot about py2/3 differences. Another option is to just do a bool(obj), I guess?

(the implicit conversion should only take place when convert == true)

Should this really be counted as an implicit conversion?

wjakob · 2017-06-27T00:42:12Z

Indeed; I forgot about py2/3 differences. Another option is to just do a bool(obj), I guess?

Right -- the question is: is there a C API binding to do exactly this.

(the implicit conversion should only take place when convert == true)

Should this really be counted as an implicit conversion?

Yes. For instance, bool(123) is perfectly valid, and we would not want this conversion to be preferred to that of another overload that accepts an integer. With the new two-phase overload traversal, the bool cast would only be considered in the second round.

dean0x7d · 2017-06-27T08:43:01Z

PyObject_IsTrue() should do the trick.

wjakob · 2017-06-27T08:55:42Z

Excellent -- that's the one.

aldanor · 2017-06-27T12:02:08Z

Ok, a few changes:

Use PyObject_IsTrue() as suggested by @dean0x7d
Check for convert arg
Check for .dtype.kind which can be cast to char, one allocation less

I was wondering whether it's worth to cache PyObjectType(src.ptr()) somewhere in a static var after the first np.bool_ is passed in, so that in subsequent calls it's just an isinstance<> check, as opposed to getattr/string checks?

wjakob · 2017-06-27T12:08:03Z

I was thinking more along the following lines:

In common.h, add:

// In Python 3.x block, Line 138+
#define PYBIND11_NONZERO "__bool__"

// In Python 2.x block: Line 158+
#define PYBIND11_NONZERO "__nonzero__"

Then, in the caster, use

....
else if (convert && hasattr(src, PYBIND11_NONZERO)) {
    value = (bool) PyObject_IsTrue(src.ptr());
    return true;
}
....

dean0x7d · 2017-06-27T12:15:19Z

Note that PyObject_IsTrue can fail, e.g. bool(np.array([...])) raises an exception.

if (convert) {
    auto result = PyObject_IsTrue(src.ptr());
    if (result == -1)  // this *should* also cover missing `__bool__`/`__nonzero__`, 
        return false;  // but adding a proper test to make sure would be good
    value = result == 1;
    return true;
}

aldanor · 2017-06-27T12:18:48Z

Oh - I've misread your previous message then. I was only thinking about numpy.bool_.

Do we want to have global implicit bool conversions? (which looks a bit scary to me as most everything can be converted to a bool)

If we do, then __bool__ attr check is not enough; for instance:

>>> hasattr([], '__bool__')
False

(source for PyObject_IsTrue())

Also, for numpy.bool_, if we make it a special case, I'd rather not check for convert = true, as it is a bool (C bool).

wjakob · 2017-06-27T12:19:00Z

this should also cover missing __bool__/__nonzero__,

I am not sure that is the case:

 % python
Python 3.5.2 |Anaconda 4.2.0 (x86_64)| (default, Jul  2 2016, 17:52:12)
[GCC 4.2.1 Compatible Apple LLVM 4.2 (clang-425.0.28)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> class A:
...     pass
>>> A()
<__main__.A object at 0x1017b2a90>
>>> A().__bool__()
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
AttributeError: 'A' object has no attribute '__bool__'
>>> bool(A())
True

aldanor · 2017-06-27T12:20:16Z

Comments crossed with @dean0x7d :) Yea, if we want a generic implicit bool conversion, I'd do it like in #925 (comment)

aldanor · 2017-06-27T12:22:39Z

>>> bool(A())
True

Also

>>> bool(object())
True

🐼

dean0x7d · 2017-06-27T12:42:29Z

@wjakob
I am not sure that is the case:

Yeah, I overlooked that objects can sometimes be converted to bool even without the magic method.

@aldanor
Do we want to have global implicit bool conversions? (which looks a bit scary to me as most everything can be converted to a bool)

I think that going completely broad should be fine (i.e. anything that isn't an error for PyObject_IsTrue):

Overload resolution will not be affected thanks to the new two-phase system.
Python happily coerces most things to bool:

def foo(arg):
    if arg:
        print("yes")

>>> foo(object())
yes

I think it's fine for pybind11 to have the same behavior, regardless of type or magic methods. Besides, it's simpler not to maintain a special pybind11-specific whitelist of what can be converted to bool when Python already has one in PyObject_IsTrue.

aldanor · 2017-06-27T13:06:20Z

Btw, looking at PyObject_IsTrue() source again (here's a blog post on it), we can
do this: basically, replace "return 1" fallback with "return -1". This way, converting A() in @wjakob's example would fail; converting object() would also fail. It would only work if either __bool__ / __nonzero__ are defined, or if it's a sequence / mapping, in which case it checks for length, or if it's a None.

(... or we can just use PyObject_IsTrue() as is.)

aldanor · 2017-06-27T16:32:19Z

My suggestion is in 5a9b757, and an example of how it works in 1a820a4.

This will handle __nonzero / __bool__, sequence/mapping types, None (all of the above under implicit conversion); it will also handle numpy.bool_ with noconvert; everything else would be rejected (e.g. object()).

Does this look reasonable?

dean0x7d · 2017-06-28T09:20:01Z

Given the separate paths needed for Python 2/3 and it looks like a workaround for PyPy, I feel like the simple PyObject_IsTrue() would be nicer, unless there is a good reason not to.

Tests: the ones that aren't related to numpy should be moved into test_builtin_casters.

aldanor · 2017-06-28T11:33:38Z

Heh, PyPy spoilt my plans a bit, yea it looks like it needs a workaround...

I personally feel like from the user's standpoint using PyObject_IsTrue() here actually doesn't follow the path of least surprise (did you know bool(object()) is True? I didn't, and had to look at CPython code to confirm...). Plus, given that implicit conversions are enabled by default, a bool argument will happily accept any and every Python object.

That being said, from the dev standpoint, using PyObject_IsTrue() is indeed the most logical and least messy, and would work the same way on py2/py3/pypy. If we want to do it this way -- I'll update the code + tests. Anyone to confirm?

// Re: tests: ditto, I'll move them before squashing

jagerman · 2017-06-28T12:48:01Z

I don't like the use of PyObject_IsTrue for this. I wouldn't expect a bool argument to accept "anything that can be used in an if statement", but rather to accept anything that looks like a boolean value (or emulates it with __bool__). PyObject_IsTrue here is a bit like forcecast: it feels like too much for default behaviour.

aldanor · 2017-06-28T14:57:28Z

Does anyone have an idea on how to fix the pypy issue? :)

jagerman · 2017-06-28T15:25:18Z

PyPy's implementation of PyObject_IsTrue is:

if a bool type, return it
otherwise call this: https://bitbucket.org/pypy/pypy/src/29f3c769e610d7416b7c9022fd814c4efd3b05c9/pypy/objspace/descroperation.py#descroperation.py-230

jagerman · 2017-06-28T15:36:43Z

include/pybind11/cast.h

+                value = res != 0;
+                return true;
+            }
+        } else if (hasattr(src, "dtype")) {


I think this needs to go before the else if (convert), otherwise it's not going to run if the arguments get into the second (convert-allowed) pass. That second pass can be triggered by some other argument that needs conversion, so anything that runs with convert = false needs to also work with convert = true (even if, as in this case, conversion isn't needed for this argument).

dean0x7d · 2017-06-29T12:15:10Z

OK, I have no objections if PyObject_IsTrue is deemed too broad. If the convertible types are being restricted, then I'd vote for @wjakob's solution from #925 (comment): it's a simplification and also more strict (no sequence or mapping types allowed). As a bonus, it doesn't require any workarounds for PyPy.

aldanor · 2017-06-29T13:12:53Z

OK, I have no objections if PyObject_IsTrue is deemed too broad. If the convertible types are being restricted, then I'd vote for @wjakob's solution from #925 (comment): it's a simplification and also more strict (no sequence or mapping types allowed). As a bonus, it doesn't require any workarounds for PyPy.

Re: hasattr version -- maybe it could be done that way on PyPy for the lack of better options-- in CPython it would be much, much slower than checking for tp_as_number->nb_bool. I'll try to play around with it today to see what works.

wjakob · 2017-07-10T09:00:08Z

This still seems really complicated to me. Can’t we strip out the NumPy-specific bits (they should be handled by the default case that calls PyObject_IsTrue). The code below AFAIK should not have the "True for arbitrary objects" issue because it only runs when PYBIND11_NONZERO is an attribute.

if (!src) return false;
else if (src.ptr() == Py_True) { value = true; return true; }
else if (src.ptr() == Py_False) { value = false; return true; }
else if (convert && hasattr(src, PYBIND11_NONZERO)) {
    int res = PyObject_IsTrue(src.ptr());
    if (res == 0 || res == 1)
        return (bool) res;
}
return false;

jagerman · 2017-07-10T15:39:44Z

include/pybind11/common.h

@@ -153,8 +153,10 @@
 #define PYBIND11_SLICE_OBJECT PyObject
 #define PYBIND11_FROM_STRING PyUnicode_FromString
 #define PYBIND11_STR_TYPE ::pybind11::str
+#define PYBIND11_NONZERO "__bool__"


Could we rename this to BOOL_ATTR or something along those lines to stick to the python-3-derived name here rather than python-2-derived, like we do with the above (e.g. "BYTES").

wjakob · 2017-07-10T16:10:54Z

Calling any kind of pybind11-bound function with scalar data (e.g. booleans instead of arrays of booleans) that perhaps even require implicit conversions is not going to be that fast, so I don't know useful performance tuning is at that level.

aldanor · 2017-07-10T16:41:51Z

@jagerman I've integrated your comments, but that's probably as simple as it's going to get... CPython optimization part is now just extra 4 lines of code which isn't too bad -- the rest has to be there anyway, including the none check at the start.

jagerman · 2017-07-10T16:48:36Z

include/pybind11/cast.h

+            }
+            return false;
+        }
+        if (hasattr(src, "dtype")) {


Does the above (i.e. the else if (convert) { ... } block) catch a numpy bool under convert == true? If so, this could be an else if to save the attribute lookup during a convert load when we already know it failed.

Yes, indeed. Fixed.

By the way, I think an even faster way would be to strcmp the type name (which should cost nothing to extract) to 'bool_' and only then check the dtype, etc. But not sure it's worth it here.

wjakob · 2017-07-10T22:13:21Z

include/pybind11/cast.h

+            }
+            return false;
+        }
+        else if (hasattr(src, "dtype")) {


What happens when this branch is completely removed? Can't we have the second (converting) pass handle NumPy booleans (which is already optimized to avoid hasattr)?

I think the intention is to get this into the no-convert pass, so that it can win the overload resolution if the function is overloaded with some other argument type (e.g. int).

Yep, correct.

// as I've noted above, I think the second hasattr() can actually be avoided in np.bool_ cases via something cheap like strcmp(Py_TYPE(src.ptr())->tp_name, "bool_") prior to the dtype hasattr check (and maybe the following dtype.kind check can then be thrown out, because what else can it be but the np.bool_...)

strcmp(tp_name, "bool_") also looks like a pretty nice simplification. In that case the separate np.bool_ logic could be removed completely and just leave:

else if (convert || strcmp(tp_name, "bool_") == 0) { ... }

aldanor · 2017-07-23T11:31:44Z

Ok, I've reduced the numpy.bool_ check to just a strcmp() on the type name, that seems to work.

jagerman · 2017-07-23T15:03:03Z

That simplification is nice. Merged!

aldanor force-pushed the feature/numpy-bool branch 2 times, most recently from b29be01 to 63e57e8 Compare June 27, 2017 11:45

aldanor changed the title ~~Support np.bool_ in type_caster<bool>~~ Implicit conversions to bool + np.bool_ conversion Jun 27, 2017

aldanor force-pushed the feature/numpy-bool branch 2 times, most recently from 93963e6 to 047a11b Compare June 28, 2017 08:54

jagerman reviewed Jun 28, 2017

View reviewed changes

jagerman reviewed Jul 10, 2017

View reviewed changes

wjakob reviewed Jul 10, 2017

View reviewed changes

jagerman mentioned this pull request Jul 21, 2017

Make a 2.2 release #953

Closed

6 tasks

jagerman added this to the v2.2 milestone Jul 21, 2017

aldanor added 17 commits July 23, 2017 12:17

Add support for np.bool_ in type_caster<bool>

9395c2d

Add a test for np.bool_

4c4edce

Support generic implicit conversion to bool

36b4b68

Add tests for generic bool conversions

017e2f3

(Make flake8 happy)

a5cec63

Add a few more bool conversion tests

839cc2d

Add PYBIND11_NONZERO constant (__bool__ magic)

5de232f

Bool conversion: run np.bool_ in the second pass

718641d

Bool conversion: ignore sequences and mappings

8ca10f3

Bool conversion: handle PyPy

2f811f1

(Update the tests)

a5ddfbb

(Update more tests)

2d4182c

Move bool caster tests to where they belong

09d99db

(Fix a compilation warning on PyPy)

15c3000

Bool conversion: simplifications and comments

9b4f569

Bool conversion: only special-case np.bool_ in 1st pass

31401f8

Simplify numpy.bool_ detection (use type name only)

025ba8a

aldanor force-pushed the feature/numpy-bool branch from a919783 to 025ba8a Compare July 23, 2017 11:17

jagerman merged commit e07f758 into pybind:master Jul 23, 2017

rwgk mentioned this pull request Feb 9, 2023

FWD pybind11 google/pybind11clif#925

Closed

adamreichold mentioned this pull request Dec 10, 2023

Try harder by looking for a __bool__ magic method when extracing bool values from Python objects. PyO3/pyo3#3638

Merged

Implicit conversions to bool + np.bool_ conversion #925

Implicit conversions to bool + np.bool_ conversion #925

Conversation

aldanor commented Jun 26, 2017 • edited Loading

wjakob commented Jun 27, 2017

wjakob commented Jun 27, 2017

aldanor commented Jun 27, 2017

wjakob commented Jun 27, 2017 • edited Loading

dean0x7d commented Jun 27, 2017

wjakob commented Jun 27, 2017

aldanor commented Jun 27, 2017 • edited Loading

wjakob commented Jun 27, 2017 • edited Loading

dean0x7d commented Jun 27, 2017

aldanor commented Jun 27, 2017

wjakob commented Jun 27, 2017

aldanor commented Jun 27, 2017

aldanor commented Jun 27, 2017

dean0x7d commented Jun 27, 2017 • edited Loading

aldanor commented Jun 27, 2017 • edited Loading

aldanor commented Jun 27, 2017 • edited Loading

dean0x7d commented Jun 28, 2017

aldanor commented Jun 28, 2017 • edited Loading

jagerman commented Jun 28, 2017

aldanor commented Jun 28, 2017

jagerman commented Jun 28, 2017 • edited Loading

jagerman Jun 28, 2017

Choose a reason for hiding this comment

dean0x7d commented Jun 29, 2017

aldanor commented Jun 29, 2017

wjakob commented Jul 10, 2017 • edited Loading

jagerman Jul 10, 2017

Choose a reason for hiding this comment

wjakob commented Jul 10, 2017

aldanor commented Jul 10, 2017

jagerman Jul 10, 2017 • edited Loading

Choose a reason for hiding this comment

aldanor Jul 10, 2017 • edited Loading

Choose a reason for hiding this comment

wjakob Jul 10, 2017

Choose a reason for hiding this comment

jagerman Jul 10, 2017

Choose a reason for hiding this comment

aldanor Jul 10, 2017 • edited Loading

Choose a reason for hiding this comment

dean0x7d Jul 23, 2017

Choose a reason for hiding this comment

aldanor commented Jul 23, 2017

jagerman commented Jul 23, 2017

aldanor commented Jun 26, 2017 •

edited

Loading

wjakob commented Jun 27, 2017 •

edited

Loading

aldanor commented Jun 27, 2017 •

edited

Loading

wjakob commented Jun 27, 2017 •

edited

Loading

dean0x7d commented Jun 27, 2017 •

edited

Loading

aldanor commented Jun 27, 2017 •

edited

Loading

aldanor commented Jun 27, 2017 •

edited

Loading

aldanor commented Jun 28, 2017 •

edited

Loading

jagerman commented Jun 28, 2017 •

edited

Loading

wjakob commented Jul 10, 2017 •

edited

Loading

jagerman Jul 10, 2017 •

edited

Loading

aldanor Jul 10, 2017 •

edited

Loading

aldanor Jul 10, 2017 •

edited

Loading