-
Notifications
You must be signed in to change notification settings - Fork 608
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bug: Inconsistent behaviour for aggregate expressions after joins #7288
Comments
Thanks @binste! Yeah, this is currently a rough edge. We are pretty deep into exploring ideas to resolve this "once and for all", hopefully in a way that preserves existing column deduplication behavior and generates the right code whether or not deferred is used. |
This will be addressed by #7580. It'll be a few releases before it lands, but it's well under way and things are looking good! |
Very exciting, thanks for working on it and for the update! |
Fixed in |
What happened?
It was mentioned before in a few issues such as #5537 that there are some rough edges around joins. One that tripped me up just now happens in aggregated expressions after joins. Taking the example from the join + aggregation section in the docs:
gives me the following as expected
However, when I slightly change this example by renaming
t2.key3
tot2.key1
:and do exactly the same but now join on
key
:I get a cross-join:
I can fix it by using the deferred object in the aggregation expression:
Which removes the cross-join:
As mentioned in #5537, maybe it's worth to disable this behaviour completely?
If you would not want to do this, I'd recommend to no longer advertise it in the documentation where it currently states:
What version of ibis are you using?
7.0.0
What backend(s) are you using, if any?
No response
Relevant log output
No response
Code of Conduct
The text was updated successfully, but these errors were encountered: