close #8269: grow arrays incrementally by factors of 1.5 rather than 2 #16305

stevengj · 2016-05-10T23:39:36Z

As discussed in #8269, many authors seem to consider this to be a good idea. The patch seems trivial.

I tried a quick benchmark, and the performance ~~difference was barely measurable (if anything, a 1% slowdown)~~ (whoops, screwed up the patch) was _3–15% faster_ after this patch.

function grow(n)
    a = Int[]
    for i = 1:n
        push!(a, i)
    end
    return a
end
@time grow(10^8);

It's apparently faster in both theory and practice, and wastes less memory. Seems worthwhile.

JaredCrean2 · 2016-05-10T23:58:01Z

Should there be some docs about this? I don't see anything in the current docs about the efficiency of functions like push!

stevengj · 2016-05-11T01:02:51Z

@JaredCrean2, good point; I added some docs to push!.

TotalVerb · 2016-05-11T02:00:41Z

base/docs/helpdb/Base.jl

@@ -1406,6 +1406,10 @@ julia> push!([1, 2, 3], 4, 5, 6)
 6
 ```

+Internally, the storage for the array is increased exponentially (by 50%)
+as needed, so that calling `push!` ``n`` times on an empty array
+involves only ``O(\log n)`` allocations and ``O(n)`` time.


that should be \\log n

Thanks, fixed.

dpsanders · 2016-05-11T02:37:16Z

Is there a sensible way to try factors for the growth other than 2 and 1.5?

stevengj · 2016-05-11T02:37:23Z

@TotalVerb, probably most C compilers do this, but C programmers traditionally read >> 1 as / 2 anyway. 😉

stevengj · 2016-05-11T02:41:42Z

@dpsanders, you can try anything you want. Honestly, I doubt the cost of the arithmetic is significant here, so you could just try (int) (size * 1.xxx) (though you have to be careful about minimum-size thresholds to make sure that the size strictly increases when it is small). But since a lot of other authors seems to have settled on 1.5 as a good heuristic, and it indeed seems to be better in practice than 2, I'm not sure I'm enthusiastic enough to do that experiment myself.

stevengj · 2016-05-11T03:22:18Z

Okay, I ran a benchmark of grow(10^8) (using the Benchmarks.jl package with a benchmark time of 30s) for size = 1 + (int) (size * growth) as a function of the growth factor. The results are shown below.

There is a clear penalty for making the growth factor too small, and if anything the optimum growth factor below 2 is around 1.7. However, I would take small differences in timing here with a large grain of salt, because it depends on things like our garbage-collection algorithm (not to mention on the hardware, in this case my 2014 2.15GHz i7 laptop).

1.5 still seems like a reasonable compromise, minimizing over-allocation without sacrificing much if any performance.

vchuravy · 2016-05-11T03:30:01Z

I always found that https://github.com/facebook/folly/blob/master/folly/docs/FBVector.md gives a nice reasoning for choosing 1.5

stevengj · 2016-05-11T03:32:16Z

@vchuravy, that argument, which eventually leads to the golden ratio (as you cited on #8269) is indeed very cute. (But you never want to trust simple performance models too much without checking them on actual hardware, which is almost always vastly more complicated than the theory.)

mschauer · 2016-05-11T13:52:35Z

Interesting. The statement from https://blog.mozilla.org/nnethercote/2014/11/04/please-grow-your-buffers-exponentially/ cited

"Now, you wouldn’t want to use exactly this value [the golden ratio] to do your memory reallocation, because you’d end up waiting forever to be able to reuse your old memory "

is misleading, "Forever" here is only three allocations as 1 + ϕ^1 = ϕ^2 . The next interesting number is χ = 1.8392867552 = "golden number corresponding to the tribonacci sequence" with 1 + χ + χ^2 = χ^3 which allows reuse after four steps. Choosing 1.75 (a bit smaller than 1.83 to allow for book keeping) also makes sense.

Edit: Whether 1 + χ + χ^2 = χ^3 or 1 + χ + χ^2 = χ^4 is relevant depends on the implementation of realloc.

jrevels · 2016-05-11T14:01:48Z

Specifically, it'll be good to see how the benchmarks in the "growth" group fair. Something within 3%-15% might be too small a deviation with our current noise tolerance, though.

@nanosoldier runbenchmarks("array", vs = ":master")

jrevels · 2016-05-11T15:10:35Z

Reading the most recent status made me realize that ":master" was comparing against stevengj/julia:master, not JuliaLang/julia:master, like it should've been. I just fixed it - this should restart the build here:

@nanosoldier runbenchmarks("array", vs = ":master")

stevengj · 2016-05-11T15:29:49Z

@jrevels, that is cool stuff. (@nanosoldier please get me lunch.)

stevengj · 2016-05-11T15:32:39Z

Technically, shouldn't it compare to the master branch at the commit where this PR was forked? Otherwise you could get spurious performance differences due to changes merged after this PR. (Or are you comparing to this PR rebased/merged onto the current master?)

jrevels · 2016-05-11T15:48:50Z

Or are you comparing to this PR rebased/merged onto the current master?

If the merge can performed automatically (without conflicts), then it will use the merge commit of the PR and compare it against whatever commit you specified with vs. If the merge can't be performed automatically, it will simply use the head commit of the PR instead. There's more documentation on @nanosoldier's behavior here if you're interested.

stevengj · 2016-05-11T17:27:58Z

@jrevels, any way to see the full error log on nanosoldier? Hopefully the failure is not the fault of this PR since Travis and Appveyor passed?

jrevels · 2016-05-11T18:28:17Z

any way to see the full error log on nanosoldier?

Not at this time, unfortunately. It seems Nanosoldier just had trouble posting the final "job completed" comment, it wasn't the fault of this PR. Weird, since some other jobs just finished and the comments were correctly posted there...seems I still have some kinks to work out since the most recent refactor.

Besides that last comment error, the entire job completed and the report was uploaded here. You can play with these same benchmarks locally using BaseBenchmarks.jl.

KristofferC · 2016-05-11T21:12:23Z

It is relevant to note that all the growth benchmarks have a final size of 2^k.

StefanKarpinski · 2016-05-12T17:38:51Z

@vchuravy, that argument, which eventually leads to the golden ratio (as you cited on #8269) is indeed very cute. (But you never want to trust simple performance models too much without checking them on actual hardware, which is almost always vastly more complicated than the theory.)

Doesn't the theoretical argument for the golden ration being optimal plus your benchmarks showing that the optimal ratio is around 1.6 argue pretty strongly for the golden ratio actually being a good choice?

mschauer · 2016-05-12T17:54:25Z

@StefanKarpinski The conclusion of the theoretical arguments really depends on whether the new resized array should fit into the space occupied by all previous resized arrays excluding or including the space occupied by the current array. If it is excluding, then 1.6... is not optimal, but to the contrary the smallest factor for which all previous sizes excluding the current one never is enough. Took me also a moment. I would go with @stevengj 's dictum "But you never want to trust simple performance models too much without checking them on actual hardware, which is almost always vastly more complicated than the theory."

jrevels · 2016-05-12T19:46:58Z

It is relevant to note that all the growth benchmarks have a final size of 2^k.

Good point, it would be better to test more varied sizes...in the meantime, it's probably worth running the whole suite here to test all of our application-based benchmarks (the previous run was only the "array" benchmarks).

@nanosoldier runbenchmarks(ALL, vs = ":master")

nanosoldier · 2016-05-12T22:21:24Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

StefanKarpinski · 2016-05-13T14:03:42Z

Isn't "checking them on actual hardware" exactly what @stevengj did?

JeffBezanson · 2016-05-13T14:35:42Z

Looks like the nanosoldier is not happy with this.

mschauer · 2016-05-13T16:11:36Z

Simulations are good, it is the theoretical arguments which do not give clear answers. By the way, @stevengj, I think

function grow(n)
    a = Int[]
    b = Int[]
    for i = 1:n
        push!(a, i)
        push!(b, i)
    end
    return a,b
end

would be a relevant test: to have at least one array which is not at the top of the heap where growing to the right/upwards is easy.

stevengj · 2016-05-13T16:24:51Z

@StefanKarpinski, if you take my benchmark as gospel, then 1.7 is even better. However:

That is just a single benchmark (on a single machine, for a single version of Julia's gc). As the nanosoldier pointed out, different benchmarks may give different results. That variance means that we don't want to place too much weight on small differences in performance when deciding on a factor.
It's not just a question of maximizing speed: The bigger the growth factor, the more memory will get wasted in general.

stevengj · 2016-08-02T18:03:59Z

Do we want to revisit this at some point?

eschnett · 2016-08-02T18:22:55Z

Growing an array to a size of 10^8 must be rare. The common case is probably growing thousands of arrays to sizes of 10^3 or 10^5. I'd benchmark these cases as well before making a decision.

stevengj · 2016-08-02T19:58:36Z

Yes, but I should emphasize that it's not completely a question of performance (and there is bound to be some variability between different benchmarks); there's also a space/time tradeoff here.

eschnett · 2016-08-02T20:00:57Z

If the array is large, and on a 64-bit system, the additional space should only be "address space", not "allocated memory". With paging, the system will also be able to reuse previously allocated memory since fragmentation is (in principle) limited to the OS page size, and (in practice) to 128 kByte or so, for GNU malloc.

stevengj · 2017-01-28T06:19:15Z

Rebased, re-running benchmarks to see if anything has changed:

@nanosoldier runbenchmarks("array", vs = ":master")

nanosoldier · 2017-01-28T09:20:13Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

oscardssmith · 2017-01-28T16:09:40Z

Something worth noting is that none of these have a memory difference of more than 10% in phi's favor. To me, this seems that either our theory is really wrong, or something is messed up with the implementation. (or allocated but unused memory isn't included. Is that true, because if so, it should probably be fixed).

yuyichao · 2017-01-28T17:08:43Z

Note that there shouldn't be any expected memory saving in the benchmarks that has repeated array growing at all since those numbers does not take into account the memory reuse.

musm · 2020-09-17T03:30:12Z

Bump. Seems like the decision is to use 1.5 as a good compromise, any reason this got stale?

KristofferC · 2020-09-17T05:33:34Z

#16305 (comment)

musm · 2020-09-17T05:34:36Z

Probably worth to try again after four years.

stevengj · 2020-09-17T14:44:56Z

Rebased.

As @JeffBezanson commented in #32035 (comment), we may want to grow by a larger factor for small-size arrays until the length reaches some threshold.

…her than 2

stevengj · 2020-09-17T14:50:00Z

But we could always close this PR in favor of #32035.

musm · 2020-12-15T18:52:38Z

closing this in favor of #32035

stevengj force-pushed the grow1.5 branch from 53e99f1 to eb8a084 Compare May 10, 2016 23:50

stevengj added the performance Must go faster label May 10, 2016

TotalVerb reviewed May 11, 2016
View reviewed changes

stevengj force-pushed the grow1.5 branch from ac4eb66 to c118088 Compare May 11, 2016 02:15

jrevels mentioned this pull request May 11, 2016

Support for 0-indexed and arbitrary-indexed arrays #16260

Merged

stevengj force-pushed the grow1.5 branch from c118088 to a87896b Compare January 28, 2017 06:18

fredrikekre mentioned this pull request May 18, 2017

LAPACK wrappers: use resize!(a) instead of a = Vector{T}() #21938

Merged

yuyichao mentioned this pull request May 30, 2017

Do exact array resize!() in more cases #22038

Merged

KristofferC mentioned this pull request May 17, 2019

[WIP]: Fix array growth thresholding #32035

Closed

stevengj force-pushed the grow1.5 branch from a87896b to 3a472d6 Compare September 17, 2020 14:44

close JuliaLang#8269: grow arrays incrementally by factors of 1.5 rat…

f0de1ef

…her than 2

stevengj force-pushed the grow1.5 branch from 3a472d6 to f0de1ef Compare September 17, 2020 14:47

musm closed this Dec 15, 2020

close #8269: grow arrays incrementally by factors of 1.5 rather than 2 #16305

close #8269: grow arrays incrementally by factors of 1.5 rather than 2 #16305

Conversation

stevengj commented May 10, 2016 • edited Loading

JaredCrean2 commented May 10, 2016

stevengj commented May 11, 2016

TotalVerb May 11, 2016

Choose a reason for hiding this comment

stevengj May 11, 2016

Choose a reason for hiding this comment

dpsanders commented May 11, 2016

stevengj commented May 11, 2016

stevengj commented May 11, 2016 • edited Loading

stevengj commented May 11, 2016 • edited Loading

vchuravy commented May 11, 2016

stevengj commented May 11, 2016 • edited Loading

mschauer commented May 11, 2016 • edited Loading

jrevels commented May 11, 2016 • edited Loading

jrevels commented May 11, 2016

stevengj commented May 11, 2016 • edited Loading

stevengj commented May 11, 2016

jrevels commented May 11, 2016

stevengj commented May 11, 2016 • edited Loading

jrevels commented May 11, 2016

KristofferC commented May 11, 2016

StefanKarpinski commented May 12, 2016

mschauer commented May 12, 2016 • edited Loading

jrevels commented May 12, 2016

nanosoldier commented May 12, 2016

StefanKarpinski commented May 13, 2016

JeffBezanson commented May 13, 2016

mschauer commented May 13, 2016

stevengj commented May 13, 2016 • edited Loading

stevengj commented Aug 2, 2016

eschnett commented Aug 2, 2016

stevengj commented Aug 2, 2016

eschnett commented Aug 2, 2016

stevengj commented Jan 28, 2017

nanosoldier commented Jan 28, 2017

oscardssmith commented Jan 28, 2017

yuyichao commented Jan 28, 2017

musm commented Sep 17, 2020

KristofferC commented Sep 17, 2020

musm commented Sep 17, 2020

stevengj commented Sep 17, 2020 • edited Loading

stevengj commented Sep 17, 2020

musm commented Dec 15, 2020

stevengj commented May 10, 2016 •

edited

Loading

stevengj commented May 11, 2016 •

edited

Loading

stevengj commented May 11, 2016 •

edited

Loading

stevengj commented May 11, 2016 •

edited

Loading

mschauer commented May 11, 2016 •

edited

Loading

jrevels commented May 11, 2016 •

edited

Loading

stevengj commented May 11, 2016 •

edited

Loading

stevengj commented May 11, 2016 •

edited

Loading

mschauer commented May 12, 2016 •

edited

Loading

stevengj commented May 13, 2016 •

edited

Loading

stevengj commented Sep 17, 2020 •

edited

Loading