-
-
Notifications
You must be signed in to change notification settings - Fork 18.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Equality comparison raises exception #7830
Comments
most ops with a rhs of a non-Index/ndarray don't make any sense. e.g. what should
even against a list/tuple/ndarray what are they actually checking? that ALL the values match, that some match, that they are the same object? I don't think these are actually used anywhere what are you doing ? |
I agree that these comparison don't make sense, but in that case, the standard behavior would be to return e.g., look at what numpy does for your example:
I agree there is ambiguity over whether such methods should return a single value or an array. |
I had a generic method which could take a scalar argument or list of arguments and I wanted to test for the case where no arguments were passed. The type of the arguments could be anything and the method broke when a I know under normal circumstances array equality checks the equality of elements, but it seems that in this case numpy does simply return a False In [17]: randn(10) == ()
Out[17]: False It turns out it was a badly designed method and so I've improved it and no longer have the problem however perhaps it would be appropriate to assume that any equality comparison which raises an error should return False - i.e. def __eq__(self):
try:
...
except Exception:
return False |
@dhirschfeld care to submit a PR for this? |
Can do, but it will have to wait a couple of weeks as I'll be on holiday 😄 |
@dhirschfeld not 100% sure going the 'catch all exceptions' route is the best bet. If we return a naked bool from any equality method, it leads to a different class of hidden errors (where the expectation is that all pandas ops return a PandasObject that you can keep working with) and pushes into all the ambiguities with the truthiness of an array. E.g., here's some edge-case numpy behavior: In [3]: arr1 = np.array([1, 2, 3, 4, 5, 6])
In [10]: arr2 = np.array([1, 2])
In [11]: arr1 == arr1
Out[11]: array([ True, True, True, True, True, True], dtype=bool)
In [12]: arr1 == arr2
Out[12]: False which leads down the road of: In [13]: bool(arr1 == arr1)
---------------------------------------------------------------------------
ValueError Traceback (most recent call last)
<ipython-input-13-dcde701319c0> in <module>()
----> 1 bool(arr1 == arr1)
ValueError: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all()
In [14]: bool(arr1 == arr2)
Out[14]: False so maybe we should shoot for "Let's give a better exception" vs. falling back on to False and swallowing a different class of errors. In [18]: class MyObject(object):
....: def __eq__(self, *args, **kwargs):
....: return NotImplemented
....:
In [19]: o1 = MyObject()
In [20]: o2 = MyObject()
In [21]: o1 == o2
Out[21]: False
In [22]: o1 == 1
Out[22]: False |
xref to some discussion on the numpy approach to this issue |
as of:
so this still needs some work |
Test case:
The text was updated successfully, but these errors were encountered: