add mapaccumulate method #21152

CarloLucibello · 2017-03-24T08:53:00Z

since reduce -> mapreduce and foldl -> mapfoldl, I think that also accumulate deserves an extension which can accept a function argument. See also #21150 for a related issue (more specific)

The text was updated successfully, but these errors were encountered:

bramtayl · 2017-03-24T14:08:24Z

Why not just deprecate mapf functions for f function methods for generators? Speaking of generators, I think the Base.Generator constructor should be exported.

bramtayl · 2017-03-24T14:29:51Z

In fact, that's something I'd be interested in making a PR for if anyone is interested.

CarloLucibello · 2017-03-24T17:24:05Z

could the fact that

julia> eltype(i for i in 1:10)
Any

have some impact on performances?
BTW should an issue be filed regarding this behaviour of even the simplest generators? Isn't it a potential cause of type instabilitites?

ararslan · 2017-03-24T18:37:39Z

I am very, very adamantly against deprecating map and the passing of functions as arguments. This construct is absolutely central to functional programming and is something that Julia does exceptionally well. It's one of the things I find most appealing about the language. I tend to think of Generators as a "use this when expressing something would otherwise be difficult." The fact that generators exist permits people to use that Pythonic style if they want. But we should absolutely not enforce that as the way to map functions.

CarloLucibello · 2017-03-24T18:43:47Z

@ararslan what is your thought on mapreduce, mapfoldl, mapaccumulate...? Could they be sostituted by reduce(f(v) for v in v) and so on?

bramtayl · 2017-03-24T18:45:36Z

I'm not suggesting getting rid of function passing. In fact I like that syntax. Instead I'm suggesting syntax like reduce(Generator(x -> x + 1, [1, 2]), +)

ararslan · 2017-03-24T18:48:12Z

what is your thought on mapreduce, mapfoldl, mapaccumulate...? Could they be sostituted by reduce(f(v) for v in v) and so on?

We can allow them but we should absolutely not deprecate anything in favor of them.

I'm suggesting syntax like reduce(Generator(x -> x + 1, [1, 2]), +)

I'd be fine with allowing that (but with the argument order reversed; + comes first in reduce) but I still think it should not replace anything.

bramtayl · 2017-03-24T18:52:23Z

Julia's long standing policy, from what I understand, is to avoid multi-concept functions because they are incompletely factored. In fact, Julia is so adamant about it as to nix using _ in function names. Well, anyway, this is a clear case, it seems to me, of a set of easily "factorable" functions.

martinholters · 2017-03-24T19:00:43Z

As long as creating Generators allocates, that causes overhead which function-to-be-mapped-as-argument versions don't have.

bramtayl · 2017-03-24T19:02:41Z

I actually don't quite understand this. Is it possible to use things like generators and avoid allocation?

ararslan · 2017-03-24T19:03:40Z

Functions that accomplish multiple tasks at once are often good for efficiency. Plus it can't really be a long-standing policy to avoid them when, e.g. mapreduce has been around since at least 0.1.

TotalVerb · 2017-03-25T08:17:55Z

There's no reason why generators would cause type instability here, as we don't need to know the eltype to implement reduce, etc. Generators are lazy so there is no big efficiency difference either. We should benchmark the various approaches before making assumptions about performance.

TotalVerb · 2017-03-25T08:21:08Z

@martinholters I think this doesn't really matter. The extra cost of a heap allocation is negligible unless the arrays being worked with are nearly empty and the operation is nearly free, which seems like a niche use case.

CarloLucibello · 2017-03-25T08:34:19Z

a small experiment shows that the implementation for generators (or generic iterators?) of reduce needs some love. On master:

julia> r=rand(10000);

julia> @benchmark mapreduce(x->x^2,+,r)
BenchmarkTools.Trial: 
  memory estimate:  16 bytes
  allocs estimate:  1
  --------------
  minimum time:     4.349 μs (0.00% GC)
  median time:      4.907 μs (0.00% GC)
  mean time:        5.596 μs (0.00% GC)
  maximum time:     1.861 ms (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     9

julia> @benchmark reduce(+,(r^2 for r in r))
BenchmarkTools.Trial: 
  memory estimate:  48 bytes
  allocs estimate:  3
  --------------
  minimum time:     30.702 μs (0.00% GC)
  median time:      42.943 μs (0.00% GC)
  mean time:        44.460 μs (0.00% GC)
  maximum time:     1.170 ms (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     1

#######
julia> @benchmark mapreduce(x->x^2,+,$r)
BenchmarkTools.Trial: 
  memory estimate:  0 bytes
  allocs estimate:  0
  --------------
  minimum time:     2.296 μs (0.00% GC)
  median time:      3.752 μs (0.00% GC)
  mean time:        4.254 μs (0.00% GC)
  maximum time:     583.003 μs (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     9

julia> g = (r^2  for r in r)
Base.Generator{Array{Float64,1},##47#48}(#47, [0.932495, 0.325073, 0.240037, 0.576765, 0.173194, 0.820185, 0.854522, 0.481677, 0.567246, 0.100028  …  0.862458, 0.13161, 0.220919, 0.524313, 0.904255, 0.339595, 0.55281, 0.942101, 0.0240301, 0.892888])

julia> @benchmark reduce(+,$g)
BenchmarkTools.Trial: 
  memory estimate:  32 bytes
  allocs estimate:  2
  --------------
  minimum time:     13.163 μs (0.00% GC)
  median time:      14.126 μs (0.00% GC)
  mean time:        15.798 μs (0.00% GC)
  maximum time:     3.626 ms (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     1

In this case the generators' construction time has a huge impact.
I'm not sure why those $ in the last two benchmarks matter (tried multiple times with or without)

simonbyrne mentioned this issue Jan 30, 2018

create Accumulate iterator #25766

Open

2 tasks

abhinav3398 mentioned this issue Oct 14, 2020

cumsum should accept functions and generic iterables #21150

Open

brenhinkeller added feature Indicates new feature / enhancement requests fold sum, maximum, reduce, foldl, etc. labels Nov 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add mapaccumulate method #21152

add mapaccumulate method #21152

CarloLucibello commented Mar 24, 2017

bramtayl commented Mar 24, 2017 •

edited

Loading

bramtayl commented Mar 24, 2017

CarloLucibello commented Mar 24, 2017 •

edited

Loading

ararslan commented Mar 24, 2017

CarloLucibello commented Mar 24, 2017

bramtayl commented Mar 24, 2017

ararslan commented Mar 24, 2017

bramtayl commented Mar 24, 2017

martinholters commented Mar 24, 2017

bramtayl commented Mar 24, 2017

ararslan commented Mar 24, 2017

TotalVerb commented Mar 25, 2017

TotalVerb commented Mar 25, 2017

CarloLucibello commented Mar 25, 2017 •

edited

Loading

add mapaccumulate method #21152

add mapaccumulate method #21152

Comments

CarloLucibello commented Mar 24, 2017

bramtayl commented Mar 24, 2017 • edited Loading

bramtayl commented Mar 24, 2017

CarloLucibello commented Mar 24, 2017 • edited Loading

ararslan commented Mar 24, 2017

CarloLucibello commented Mar 24, 2017

bramtayl commented Mar 24, 2017

ararslan commented Mar 24, 2017

bramtayl commented Mar 24, 2017

martinholters commented Mar 24, 2017

bramtayl commented Mar 24, 2017

ararslan commented Mar 24, 2017

TotalVerb commented Mar 25, 2017

TotalVerb commented Mar 25, 2017

CarloLucibello commented Mar 25, 2017 • edited Loading

bramtayl commented Mar 24, 2017 •

edited

Loading

CarloLucibello commented Mar 24, 2017 •

edited

Loading

CarloLucibello commented Mar 25, 2017 •

edited

Loading