Boolean operators don't have specialized versions for sparse matrices #13024

sbromberger · 2015-09-09T00:38:12Z

Ref https://groups.google.com/d/msg/julia-users/bo6YMXzWPdA/jKEWz_rfAQAJ

The following code runs in ~15 seconds:

_column(a::AbstractSparseArray, i::Integer) = sub(a.rowval, a.colptr[i]:a.colptr[i+1]-1)

# material nonimplication
⊅(p::Bool, q::Bool) = p & !q 

function ⊅(a::SparseMatrixCSC, b::SparseMatrixCSC) 
    (m,n) = size(a) 
    resultmx = spzeros(Bool,m,n) 
    for c = 1:n 
        for r in _column(a,n) 
            # info("row $r, col $c") 
            resultmx[r,c] = ⊅(a[r,c], b[r,c]) 
        end 
    end 
    return resultmx 
end

a = sprandbool(1000000,1000000,0.0001);
b = sprandbool(1000000,1000000,0.0001);

julia> @time z = a ⊅ b;
 15.272250 seconds (1.10 M allocations: 73.870 MB)

Replacing @time z = a ⊅ b with @time z = a & b results in a > 10 minute execution (I stopped it after 10 minutes or so).

Interestingly enough, substituting the ⊅-equivalent resultmx[r,c] = !b[r,c] within the function itself also causes poor performance relative to the above code.

The text was updated successfully, but these errors were encountered:

IainNZ · 2015-09-09T01:00:00Z

EDIT: disregard, issue was updated

I tried

function other(a::SparseMatrixCSC, b::SparseMatrixCSC) 
    (m,n) = size(a) 
    resultmx = spzeros(Bool,m,n) 
    for c = 1:n 
        for r in _column(a,n) 
            # info("row $r, col $c") 
            resultmx[r,c] = a[r,c] & !b[r,c]
        end 
    end 
    return resultmx 
end

and got

julia> @time z = a ⊅ b;
 12.544197 seconds (1.12 M allocations: 74.449 MB)

julia> @time z = other(a,b);
 12.134954 seconds (1.01 M allocations: 69.430 MB, 0.27% gc time)

sbromberger · 2015-09-09T01:00:03Z

This is on

Version 0.4.0-pre+7107 (2015-08-31 16:51 UTC)
Commit 4e44a1c (8 days old master)

sbromberger · 2015-09-09T01:01:50Z

@IainNZ ah - that seems to imply that the problem with & is when it's used with sparse matrices, not with bools.

IainNZ · 2015-09-09T01:05:37Z

julia> @which (&)(sprand(5,5,0.1), sprand(5,5,0.1))
&{S,T}(A::AbstractArray{S,N}, B::AbstractArray{T,N}) at arraymath.jl:96

sbromberger · 2015-09-09T01:10:35Z

so it looks like there's no method on & optimized for sparse matrices, and so it's iterating over all r,c pairs. That's... silly. Should I make a PR?

tkelman · 2015-09-09T01:24:23Z

Same as in #12118, can probably just add & and | to this loop

julia/base/sparse/sparsematrix.jl

Line 887 in bffe239

for op in (+, -, min, max)

, add tests and done. Since & is zero-preserving (result is zero if either input is zero) slightly fancier things could be done for it, but we don't have distinct scalar & vs broadcasting .& like we do for most other operators so probably not worth dealing with that right now.

edit: actually I think eltype_plus should be changed to something different for max, min, & and |

jiahao · 2015-09-09T01:24:27Z

so it looks like there's no method on & optimized for sparse matrices

Yes, this is true for much of the sparse matrix functionality, simply because no one has had the time or use case to warrant implementing them. PR welcomed although it probably will go into 0.5.

It might be as simple as extending this loop to include &, |, $, etc.

tkelman · 2015-09-09T01:31:03Z

There's actually a bug there,

julia> A = sprandbool(5,5,0.3)
5x5 sparse matrix with 7 Bool entries:
        [1, 1]  =  true
        [2, 1]  =  true
        [2, 2]  =  true
        [5, 2]  =  true
        [1, 4]  =  true
        [1, 5]  =  true
        [2, 5]  =  true

julia> B = sprandbool(5,5,0.3)
5x5 sparse matrix with 5 Bool entries:
        [3, 1]  =  true
        [1, 3]  =  true
        [2, 3]  =  true
        [5, 4]  =  true
        [5, 5]  =  true

julia> max(A, B)
5x5 sparse matrix with 12 Int64 entries:
        [1, 1]  =  1
        [2, 1]  =  1
        [3, 1]  =  1
        [2, 2]  =  1
        [5, 2]  =  1
        [1, 3]  =  1
        [2, 3]  =  1
        [1, 4]  =  1
        [5, 4]  =  1
        [1, 5]  =  1
        [2, 5]  =  1
        [5, 5]  =  1

May as well fix both issues now. If anyone were relying on the present behavior they probably would have complained and asked to change it already.

sbromberger · 2015-09-09T02:28:52Z

@tkelman - Sorry - it's late. Why is that a bug? (is it because it's being converted to Int, when max(x::Bool, y::Bool) returns a boolean?)

simonster · 2015-09-09T02:30:42Z

I think because max(true, true) is true, not 1.

sbromberger · 2015-09-09T14:38:37Z

I opened up #13029 to track the performance of ! on an element separately, since I don't think it's related.

ViralBShah · 2015-09-09T14:43:30Z

My preference is to have these things go to 0.5. However, it is not a problem to get them into 0.4.x if someone really needs it (perhaps @sbromberger does), since this will not break the 0.4 API.

sbromberger · 2015-09-09T14:45:11Z

I'm happy to work on a PR. Obviously it won't go into -release. I've been looking at the underlying code and it will take time for me to understand it fully. :)

Edit to add: I do really need this functionality, actually. It puzzles me that nobody's brought this issue up before: are sparse matrices not widely used?

ViralBShah · 2015-09-09T14:53:34Z

Sparse matrices are increasingly getting widely used. :-)

IainNZ · 2015-09-09T15:23:33Z

Sparse matrices of Float64s with Int64 indices get used extensively, but anything else, not as often. Hence #12984 and this

tkelman · 2015-09-09T15:27:49Z

I'll open a PR that fixes this today. Won't be hard, and no reason to wait on fixing this since the current behavior is slow and incorrect.

also fix eltype promotion in sparse max and min

sbromberger · 2015-09-09T23:08:57Z

Thanks. Glad to see it uncovered a few things despite the bugs in my code that prompted the issue.

Fix #13024, sparse & and |

sbromberger · 2015-09-10T19:03:20Z

Thank you (again) very much - this results in orders of magnitude more efficient random graph generation.

also fix eltype promotion in sparse max and min Also add $ because Jiahao asked nicely (cherry picked from commit cdb6496) ref #13024

sbromberger changed the title ~~possible performance issues with sparse matrices and booleans~~ possible performance issues with sparse matrices and boolean operators Sep 9, 2015

tkelman added performance Must go faster sparse Sparse arrays labels Sep 9, 2015

IainNZ changed the title ~~possible performance issues with sparse matrices and boolean operators~~ Boolean operators don't have specialized versions for sparse matrices Sep 9, 2015

sbromberger mentioned this issue Sep 9, 2015

Performance of ! is poor relative to more complex functions with CSCSparseMatrices #13029

Closed

tkelman self-assigned this Sep 9, 2015

tkelman added a commit that referenced this issue Sep 9, 2015

Fix #13024, sparse & and |

d4c87a6

also fix eltype promotion in sparse max and min

tkelman closed this as completed in cdb6496 Sep 10, 2015

jakebolewski added a commit that referenced this issue Sep 10, 2015

Merge pull request #13036 from JuliaLang/tk/fix13024

35758e1

Fix #13024, sparse & and |

tkelman added a commit that referenced this issue Sep 11, 2015

Fix #13024, sparse & and |

f35a6ff

also fix eltype promotion in sparse max and min Also add $ because Jiahao asked nicely (cherry picked from commit cdb6496) ref #13024

tkelman mentioned this issue Oct 11, 2016

Sparse test sets #18844

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Boolean operators don't have specialized versions for sparse matrices #13024

Boolean operators don't have specialized versions for sparse matrices #13024

sbromberger commented Sep 9, 2015

IainNZ commented Sep 9, 2015

sbromberger commented Sep 9, 2015

sbromberger commented Sep 9, 2015

IainNZ commented Sep 9, 2015

sbromberger commented Sep 9, 2015

tkelman commented Sep 9, 2015

jiahao commented Sep 9, 2015

tkelman commented Sep 9, 2015

sbromberger commented Sep 9, 2015

simonster commented Sep 9, 2015

sbromberger commented Sep 9, 2015

ViralBShah commented Sep 9, 2015

sbromberger commented Sep 9, 2015

ViralBShah commented Sep 9, 2015

IainNZ commented Sep 9, 2015

tkelman commented Sep 9, 2015

sbromberger commented Sep 9, 2015

sbromberger commented Sep 10, 2015

Boolean operators don't have specialized versions for sparse matrices #13024

Boolean operators don't have specialized versions for sparse matrices #13024

Comments

sbromberger commented Sep 9, 2015

IainNZ commented Sep 9, 2015

sbromberger commented Sep 9, 2015

sbromberger commented Sep 9, 2015

IainNZ commented Sep 9, 2015

sbromberger commented Sep 9, 2015

tkelman commented Sep 9, 2015

jiahao commented Sep 9, 2015

tkelman commented Sep 9, 2015

sbromberger commented Sep 9, 2015

simonster commented Sep 9, 2015

sbromberger commented Sep 9, 2015

ViralBShah commented Sep 9, 2015

sbromberger commented Sep 9, 2015

ViralBShah commented Sep 9, 2015

IainNZ commented Sep 9, 2015

tkelman commented Sep 9, 2015

sbromberger commented Sep 9, 2015

sbromberger commented Sep 10, 2015