Dot-broadcasting for short-circuiting ops .&& and .|| #39594

mbauman · 2021-02-10T04:35:22Z

I have long wanted a proper fix for issue #5187. It was the very first Julia issue I filed.
This is a shot at such a fix. This PR:

Enables parsing for .&& and .||. They are parsed into syntactic Expr(:.&&, ...) expressions at the same precedence as their respective && and ||:
```
julia> Meta.show_sexpr(:(a .&& b))
(:.&&, :a, :b)
```
Unlike all other dotted operators .op (like .+), the op-alone part (var"&&") is not an exported name from Base. As such, this effectively lowers to broadcasted((x,y)->x && y, ...), but instead of using an anonymous function I've named it Base.andand and Base.oror:
```
julia> Meta.@lower a .&& b
:($(Expr(:thunk, CodeInfo(
    @ none within `top-level scope'
1 ─ %1 = Base.broadcasted(Base.andand, a, b)
│   %2 = Base.materialize(%1)
└──      return %2
))))
```
I've used a named function to enable short-circuiting behavior within the broadcast kernel itself. In the case that the second argument is a part of the same fused broadcast kernel, it will only evaluate if required:
```
julia> mutable struct F5187; x; end

julia> (f::F5187)(x) = (f.x += x)

julia> (iseven.(1:4) .|| (F5187(0)).(ones(4)))
4-element Vector{Real}:
    1.0
 true
    2.0
 true
```
~~This also enables support for standalone .&& and .|| as BroadcastFunctions, but of course they are not able to short-circuit when used as functions themselves.~~

Request for feedback

A good bikeshed could be had over the names themselves; the implemented andand and oror are cutesy placeholders. We could actually use var"&&" and var"||" for these names — and it'd almost simplify the implementation, but in order to do so we'd have to actually export them from Base, too. It seems like it might just be confusing.
The names are fine
Is this the implementation we want? This uses Broadcast.flatten to create the lazy function needed for short-circuiting the second argument. This could alternatively be done directly within the parser — perhaps by resurrecting the old 0.5 broadcast parsing behavior. Someone else would have to do that work if they wanted it.
Yes
Do we want to support the stand-alone .&& and .|| BroadcastFunctions if they cannot possibly short circuit?
Maybe, but deferred
Should dotted chained comparisons like 1 .< A .< 2 produce (1 .< A) .&& (A .< 2)? It currently uses .&. I'm leaning towards keeping the status quo, even though this would represent a departure from the scalar lowering. I'd bet that changing this would be breaking for folks that are effectively working around Lower chained inequalities in an extendable way #39104 (knowingly or not).
Maybe, but breaking and deferred

(See also #26792 which made space for this feature.)

@lower

I have long wanted a proper fix for issue #5187. It was the very first Julia issue I filed. This is a shot at such a fix. This PR: * Enables parsing for `.&&` and `.||`. They are parsed into `Expr(:call, :.&&, ...)` expressions at the same precedence as their respective `&&` and `||`: ```julia-repl julia> Meta.show_sexpr(:(a .&& b)) (:call, :.&&, :a, :b) ``` * Unlike all other dotted operators `.op` (like `.+`), the `op`-alone part (`var"&&"`) is not an exported name from Base. As such, this effectively lowers to `broadcasted((x,y)->x && y, ...)`, but instead of using an anonymous function I've named it `Base.andand` and `Base.oror`: ```julia-repl julia> Meta.@lower a .&& b :($(Expr(:thunk, CodeInfo( @ none within `top-level scope' 1 ─ %1 = Base.broadcasted(Base.andand, a, b) │ %2 = Base.materialize(%1) └── return %2 )))) ``` * I've used a named function to enable short-circuiting behavior _within the broadcast kernel itself_. In the case that the second argument is a part of the same fused broadcast kernel, it will only evaluate if required: ```julia-repl julia> mutable struct F5187; x; end julia> (f::F5187)(x) = (f.x += x) julia> (iseven.(1:4) .|| (F5187(0)).(ones(4))) 4-element Vector{Real}: 1.0 true 2.0 true ``` * This also enables support for standalone `.&&` and `.||` as `BroadcastFunction`s, but of course they are not able to short-circuit when used as functions themselves. Request for feedback -------------------- * [ ] A good bikeshed could be had over the names themselves. We could actually use `var"&&"` and `var"||"` for these names — and it'd _almost_ simplify the implementation, but in order to do so we'd have to actually _export_ them from Base, too. It seems like it might just be confusing. * [ ] Is this the implementation we want? This uses `Broadcast.flatten` to create the lazy function needed for short-circuiting the second argument. This could alternatively be done directly within the parser — perhaps by resurrecting the old 0.5 broadcast parsing behavior. Someone else would have to do that work if they wanted it. * [ ] Do we want to support the stand-alone `.&&` and `.||` `BroadcastFunction`s if they cannot possibly short circuit?

NEWS.md

MasonProtter · 2021-02-10T05:05:11Z

Forgive me if I'm misunderstanding, but I think this doesn't correctly preserve the semantics of && / ||. It seems to basically just be & and | instead.

For instance:

julia> @eval Base.Broadcast begin

       andand(a, b) = a && b
       function broadcasted(::typeof(andand), a, bc::Broadcasted)
           bcf = flatten(bc)
           broadcasted((a, args...) -> a && bcf.f(args...), a, bcf.args...)
       end
       oror(a, b) = a || b
       function broadcasted(::typeof(oror), a, bc::Broadcasted)
           bcf = flatten(bc)
           broadcasted((a, args...) -> a || bcf.f(args...), a, bcf.args...)
       end

       end
broadcasted (generic function with 56 methods)

julia> using Base.Broadcast: oror

julia> oror.(iseven.(1:4), (F5187(0)).(ones(4)))
4-element Vector{Real}:
    1.0
 true
    2.0
 true

julia> oror.(isfinite.(1:4), throw("nonfinite number!"))
ERROR: "nonfinite number!"
Stacktrace:
 [1] top-level scope
   @ REPL[8]:1

That is, this appears to not really short circuit since the both the function arguments get evaluated.

Edit:

Okay, so it will work if I also broadcast the throw:

julia> oror.(isfinite.(1:4), throw.("nonfinite number!"))
4-element BitVector:
 1
 1
 1
 1

I'm not sure how to feel about this. Is there a way to make the first one work without throw.(...)? I can kinda see the logic of demanding the . here as it basically says "fuse the call to throw with the loop".

So it seems I was just misunderstanding and this was intended design all along.

simeonschaub

Overall, this makes sense to me, but I am a bit worried about how this will play together with some custom broadcast implementations. Not really sure how to test that though.

src/julia-parser.scm

base/broadcast.jl

simeonschaub · 2021-02-10T09:45:37Z

@MasonProtter Non-broadcasted calls are always evaluated before broadcasted ones and I don't really see any way around this. To me at least, this behavior for throw makes sense.

simeonschaub · 2021-02-10T15:07:14Z

What do you think about using callable objects AndAnd() and OrOr() instead of regular functions, since we might not want e.g. andand isa Function?

mbauman · 2021-02-10T15:29:48Z

That's also what I did at first... but for their BroadcastFunction forms to work they need to implement (a,b) -> a && b... and if they implement that, well, aren't they functions? That's really the crux of the question — do we want to them to be callable at all? If they're callable, I think they might as well just be functions.

simeonschaub · 2021-02-10T15:34:50Z

Yeah, perhaps we don't even want to make them callable. It seems to me like anywhere those are called directly is probably likely to be a bug.

MasonProtter · 2021-02-10T17:40:19Z

Non-broadcasted calls are always evaluated before broadcasted ones and I don't really see any way around this. To me at least, this behavior for throw makes sense.

@simeonschaub Yeah, I've warmed considerably to this, but I will note that the the standard behaviour of executing calls is built around assuming that the objects being broadcasted are functions, and && / || are not functions and have different semantics. Namely, that the thing to the right of them only gets evaluated if the thing to the left is true (false).

I still can't help but feel this is semantically a bit different.

mbauman · 2021-02-10T17:49:37Z

standard behaviour of executing calls is built around assuming that the objects being broadcasted are functions, and && / || are not functions and have different semantics.

I see this a little differently: I see a fused broadcast as being a simple syntax for a complicated for loop. This is most clearly demonstrated with the @. macro, but the mental model should be that anything that's not dotted is hoisted out of the loop. It doesn't really matter what is inside the for loop. I'd want this @. A < 1 || A > 2 to behave the same as A .< 1 .|| A .>2 to behave (roughly) the same as [A[i] < 1 || A[i] > 2 for i in eachindex(A)]. The dots just put it (whatever it is) inside the loop.

mbauman · 2021-02-10T18:13:31Z

On the topic of how broadcast implementors would see this: I intentionally intercept this very early on in the call chain. There are two forms of Broadcasteds this uses:

Broadcasted(andand, (x, y)) — this is where it makes sense for andand to be a Function. There's no evaluation to be done in the second argument and thus no short circuiting — and so this really wouldn't be any different than any other Julia function.
Broadcasted((x, yargs...)->x && f(yargs...), (x, yargs...)) — this is the case where there's some function evaluation happening in the second argument. As such, we build a flattened function f to represent the second argument and lift out all the arguments it uses, and then we use an anonymous function to put it together. All the custom implementations see is an anonymous function with a bunch of arguments — something they're already handling regularly. An advantage of doing this in Julia is that in building the flattened f we go through all the normal machinations of broadcasting. Interestingly, this means that, e.g., the O(1) specializations (like immediate computation of ranges) will still occur even if the LHS is all false.

This really isn't any different from what they deal with regularly.

JeffBezanson · 2021-02-11T20:47:11Z

Do we want to support the stand-alone .&& and .|| BroadcastFunctions if they cannot possibly short circuit?

I would say probably yes just for consistency.

Should dotted chained comparisons like 1 .< A .< 2 produce (1 .< A) .&& (A .< 2)? It currently uses .&. I'm leaning towards keeping the status quo

From triage: changing it would be the right thing but breaking, so keep it as-is for now.

test/broadcast.jl

mbauman · 2021-02-23T13:14:56Z

Thanks Simeon! I've updated the first post to reflect the current status quo.

This is ready as far as I'm concerned.

simeonschaub · 2021-02-23T15:12:21Z

I think I would still lean towards oror and andand not being callable, as to not add any confusion with regular functions. I am open to being convinced otherwise though.

mbauman · 2021-02-23T15:50:04Z

If they're not callable, we still need to implement this method. Here's a potential diff (of course, typeof(andand) could be simplified to AndAnd, but limiting diff size here):

diff --git a/base/broadcast.jl b/base/broadcast.jl
index 60ea22805f..cbd7cc9c86 100644
--- a/base/broadcast.jl
+++ b/base/broadcast.jl
@@ -179,12 +179,16 @@ function Broadcasted{Style}(f::F, args::Args, axes=nothing) where {Style, F, Arg
     Broadcasted{Style, typeof(axes), Core.Typeof(f), Args}(f, args, axes)
 end

-andand(a, b) = a && b
+struct AndAnd end
+const andand = AndAnd()
+broadcasted(::typeof(andand), a, b) = broadcasted((a, b) -> a && b, a, b)
 function broadcasted(::typeof(andand), a, bc::Broadcasted)
     bcf = flatten(bc)
     broadcasted((a, args...) -> a && bcf.f(args...), a, bcf.args...)
 end
-oror(a, b) = a || b
+struct OrOr end
+const oror = OrOr()
+broadcasted(::typeof(oror), a, b) = broadcasted((a, b) -> a || b, a, b)
 function broadcasted(::typeof(oror), a, bc::Broadcasted)
     bcf = flatten(bc)
     broadcasted((a, args...) -> a || bcf.f(args...), a, bcf.args...)

I actually started out this direction, but I changed to this because it's simpler (4 fewer LOC, and it lets Julia handle the singleton for us) and we are effectively defining a function method anyway. And it enabled the BroadcastFunction (which we've now dropped), but it also felt less user-hostile in case it leaked out.

simeonschaub · 2021-02-23T17:19:22Z

but it also felt less user-hostile in case it leaked out.

My worry is that these leaking out is likely to indicate a bug, since treating them just like regular functions would ignore their short-circuiting behavior and could lead to subtle mistakes.

Co-authored-by: Matt Bauman <[email protected]>

simeonschaub · 2021-04-03T12:59:29Z

I went ahead with this change for now, but feel completely free to revert if you'd prefer the previous approach.

simeonschaub · 2021-04-03T13:01:28Z

@mbauman Apparently, I can't request approval from you for your own PR, but please let me know what you think.

mbauman · 2021-04-09T00:21:21Z

Yes, I'm ok with this — not my first choice, but also not worth holding up the feature for. :)

Apparently I can't make a green check on my own PR, either:

Here you go:

@lower

I have long wanted a proper fix for issue JuliaLang#5187. It was the very first Julia issue I filed. This is a shot at such a fix. This PR: * Enables parsing for `.&&` and `.||`. They are parsed into `Expr(:call, :.&&, ...)` expressions at the same precedence as their respective `&&` and `||`: ```julia-repl julia> Meta.show_sexpr(:(a .&& b)) (:call, :.&&, :a, :b) ``` * Unlike all other dotted operators `.op` (like `.+`), the `op`-alone part (`var"&&"`) is not an exported name from Base. As such, this effectively lowers to `broadcasted((x,y)->x && y, ...)`, but instead of using an anonymous function I've named it `Base.andand` and `Base.oror`: ```julia-repl julia> Meta.@lower a .&& b :($(Expr(:thunk, CodeInfo( @ none within `top-level scope' 1 ─ %1 = Base.broadcasted(Base.andand, a, b) │ %2 = Base.materialize(%1) └── return %2 )))) ``` * I've used a named function to enable short-circuiting behavior _within the broadcast kernel itself_. In the case that the second argument is a part of the same fused broadcast kernel, it will only evaluate if required: ```julia-repl julia> mutable struct F5187; x; end julia> (f::F5187)(x) = (f.x += x) julia> (iseven.(1:4) .|| (F5187(0)).(ones(4))) 4-element Vector{Real}: 1.0 true 2.0 true ``` Co-authored-by: Simeon Schaub <[email protected]>

@lower

I have long wanted a proper fix for issue JuliaLang#5187. It was the very first Julia issue I filed. This is a shot at such a fix. This PR: * Enables parsing for `.&&` and `.||`. They are parsed into `Expr(:call, :.&&, ...)` expressions at the same precedence as their respective `&&` and `||`: ```julia-repl julia> Meta.show_sexpr(:(a .&& b)) (:call, :.&&, :a, :b) ``` * Unlike all other dotted operators `.op` (like `.+`), the `op`-alone part (`var"&&"`) is not an exported name from Base. As such, this effectively lowers to `broadcasted((x,y)->x && y, ...)`, but instead of using an anonymous function I've named it `Base.andand` and `Base.oror`: ```julia-repl julia> Meta.@lower a .&& b :($(Expr(:thunk, CodeInfo( @ none within `top-level scope' 1 ─ %1 = Base.broadcasted(Base.andand, a, b) │ %2 = Base.materialize(%1) └── return %2 )))) ``` * I've used a named function to enable short-circuiting behavior _within the broadcast kernel itself_. In the case that the second argument is a part of the same fused broadcast kernel, it will only evaluate if required: ```julia-repl julia> mutable struct F5187; x; end julia> (f::F5187)(x) = (f.x += x) julia> (iseven.(1:4) .|| (F5187(0)).(ones(4))) 4-element Vector{Real}: 1.0 true 2.0 true ``` Co-authored-by: Simeon Schaub <[email protected]>

@lower

I have long wanted a proper fix for issue JuliaLang#5187. It was the very first Julia issue I filed. This is a shot at such a fix. This PR: * Enables parsing for `.&&` and `.||`. They are parsed into `Expr(:call, :.&&, ...)` expressions at the same precedence as their respective `&&` and `||`: ```julia-repl julia> Meta.show_sexpr(:(a .&& b)) (:call, :.&&, :a, :b) ``` * Unlike all other dotted operators `.op` (like `.+`), the `op`-alone part (`var"&&"`) is not an exported name from Base. As such, this effectively lowers to `broadcasted((x,y)->x && y, ...)`, but instead of using an anonymous function I've named it `Base.andand` and `Base.oror`: ```julia-repl julia> Meta.@lower a .&& b :($(Expr(:thunk, CodeInfo( @ none within `top-level scope' 1 ─ %1 = Base.broadcasted(Base.andand, a, b) │ %2 = Base.materialize(%1) └── return %2 )))) ``` * I've used a named function to enable short-circuiting behavior _within the broadcast kernel itself_. In the case that the second argument is a part of the same fused broadcast kernel, it will only evaluate if required: ```julia-repl julia> mutable struct F5187; x; end julia> (f::F5187)(x) = (f.x += x) julia> (iseven.(1:4) .|| (F5187(0)).(ones(4))) 4-element Vector{Real}: 1.0 true 2.0 true ``` Co-authored-by: Simeon Schaub <[email protected]>

In JuliaLang/julia#39594 it was considered non-breaking enough to change the tokenization of ".&&" from `.&`,`&` to `.&&`. For JuliaSyntax it seems simplest to just use the newer tokenization for this and detect the use of the syntax in the parser, emitting an error on older versions. This leads to the most neat predictable parser errors.

…. Replace with regular vectorised and. Also, because it's over bitvectors, it's likely that it's not any faster: https://discourse.julialang.org/t/vectorized-short-circuit-operator/80336 See vectorised and pull request for 1.7: JuliaLang/julia#39594

mbauman added triage This should be discussed on a triage call broadcast Applying a function over a collection labels Feb 10, 2021

mbauman changed the title ~~RFC: Dot-broadcasting for short-circuiting ops .&& .||~~ RFC: Dot-broadcasting for short-circuiting ops .&& and .|| Feb 10, 2021

mbauman commented Feb 10, 2021

View reviewed changes

NEWS.md Outdated Show resolved Hide resolved

simeonschaub reviewed Feb 10, 2021

View reviewed changes

src/julia-parser.scm Outdated Show resolved Hide resolved

base/broadcast.jl Show resolved Hide resolved

mbauman and others added 2 commits February 10, 2021 09:58

update tests for call expr

8e4f11b

fix issue number in NEWS.md

f502b14

mbauman added the compiler:lowering Syntax lowering (compiler front end, 2nd stage) label Feb 10, 2021

mbauman removed the triage This should be discussed on a triage call label Feb 12, 2021

JeffBezanson changed the title ~~RFC: Dot-broadcasting for short-circuiting ops .&& and .||~~ Dot-broadcasting for short-circuiting ops .&& and .|| Feb 12, 2021

simeonschaub reviewed Feb 15, 2021

View reviewed changes

test/broadcast.jl Outdated Show resolved Hide resolved

simeonschaub added 2 commits February 18, 2021 09:45

Merge remote-tracking branch 'origin/master' into mb/revenge-of-5187

ccab6f5

make .|| and .&& special forms

8fe0eaf

simeonschaub and others added 2 commits April 3, 2021 14:52

Merge remote-tracking branch 'origin/master' into mb/revenge-of-5187

d25317f

make andand and oror not callable

2b58e24

Co-authored-by: Matt Bauman <[email protected]>

simeonschaub approved these changes Apr 3, 2021

View reviewed changes

simeonschaub requested a review from JeffBezanson April 3, 2021 13:00

vtjnash merged commit 40e57aa into master Apr 9, 2021

vtjnash deleted the mb/revenge-of-5187 branch April 9, 2021 03:48

cstjean mentioned this pull request Aug 4, 2022

Broadcasting && with missings #46252

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dot-broadcasting for short-circuiting ops .&& and .|| #39594

Dot-broadcasting for short-circuiting ops .&& and .|| #39594

mbauman commented Feb 10, 2021 •

edited

Loading

MasonProtter commented Feb 10, 2021 •

edited

Loading

simeonschaub left a comment

simeonschaub commented Feb 10, 2021

simeonschaub commented Feb 10, 2021

mbauman commented Feb 10, 2021

simeonschaub commented Feb 10, 2021

MasonProtter commented Feb 10, 2021 •

edited

Loading

mbauman commented Feb 10, 2021

mbauman commented Feb 10, 2021 •

edited

Loading

JeffBezanson commented Feb 11, 2021 •

edited

Loading

mbauman commented Feb 23, 2021

simeonschaub commented Feb 23, 2021

mbauman commented Feb 23, 2021

simeonschaub commented Feb 23, 2021

simeonschaub commented Apr 3, 2021

simeonschaub commented Apr 3, 2021

mbauman commented Apr 9, 2021 •

edited

Loading

Dot-broadcasting for short-circuiting ops .&& and .|| #39594

Dot-broadcasting for short-circuiting ops .&& and .|| #39594

Conversation

mbauman commented Feb 10, 2021 • edited Loading

Request for feedback

MasonProtter commented Feb 10, 2021 • edited Loading

simeonschaub left a comment

Choose a reason for hiding this comment

simeonschaub commented Feb 10, 2021

simeonschaub commented Feb 10, 2021

mbauman commented Feb 10, 2021

simeonschaub commented Feb 10, 2021

MasonProtter commented Feb 10, 2021 • edited Loading

mbauman commented Feb 10, 2021

mbauman commented Feb 10, 2021 • edited Loading

JeffBezanson commented Feb 11, 2021 • edited Loading

mbauman commented Feb 23, 2021

simeonschaub commented Feb 23, 2021

mbauman commented Feb 23, 2021

simeonschaub commented Feb 23, 2021

simeonschaub commented Apr 3, 2021

simeonschaub commented Apr 3, 2021

mbauman commented Apr 9, 2021 • edited Loading

mbauman commented Feb 10, 2021 •

edited

Loading

MasonProtter commented Feb 10, 2021 •

edited

Loading

MasonProtter commented Feb 10, 2021 •

edited

Loading

mbauman commented Feb 10, 2021 •

edited

Loading

JeffBezanson commented Feb 11, 2021 •

edited

Loading

mbauman commented Apr 9, 2021 •

edited

Loading