refactor(stdune): improve path tables #8052

rgrinberg · 2023-06-27T09:12:22Z

Use a more memory efficient path table. Instead of using the variant for
the key, combine 3 tables all for the individual paths.

This makes the empty table a little more bloated (3x bigger), but gives
us a saving of 2 words for every single key we store.

Signed-off-by: Rudi Grinberg [email protected]

snowleopard

Looks good! I am going to benchmark this change internally too.

otherlibs/stdune/src/path.mli

snowleopard · 2023-07-09T19:11:14Z

What about Path.Set and Path.Map? They are probably even bigger offenders (because we have more of them).

It's fine to leave them for another PR, just wondering if you've got plans.

otherlibs/stdune/src/path.ml

rgrinberg · 2023-07-09T20:02:06Z

What about Path.Set and Path.Map? They are probably even bigger offenders (because we have more of them).

I have an upcoming PR indeed.

src/dune_engine/cached_digest.ml

otherlibs/stdune/src/path.mli

snowleopard · 2023-07-10T18:10:18Z

The current version of this PR still shows as a slight regression in my benchmarks.

@rgrinberg What are the results on your end?

rgrinberg · 2023-07-10T18:55:59Z

Runtime performance shows a slight improvement but it's well within the noise range.

The GC stats are quite accurate and they do show a small improvement across the board, but that isn't necessarily better for performance.

otherlibs/stdune/src/path.ml

snowleopard · 2023-07-10T19:12:23Z

Runtime performance shows a slight improvement but it's well within the noise range.

The GC stats are quite accurate and they do show a small improvement across the board, but that isn't necessarily better for performance.

Hmm, I wonder if we're seeing different results because we're using different sets of rules.

I'm also somewhat surprised to see a small regression in my benchmarks. It seems like this PR is mostly changing things for the better, so I'm kind of puzzled.

rgrinberg · 2023-07-11T06:43:38Z

I'm not surprised that this change isn't a performance improvement, but I'm definitely surprised there's any regression. I imagined there would be no difference at all either way. I guess we should remember that OCaml hash tables aren't particularly space efficient anyway and removing a single word isn't going to change all that much.

We should note that our benchmarks are a bit biased against watch mode though. In watch mode, consuming less memory is more important because it makes subsequent operations faster. While in our measurements, we only care about the first completion build and do not care about any subsequent memory usage.

snowleopard · 2023-07-11T13:40:45Z

Just a quick update: I've started some more benchmarks.

snowleopard · 2023-07-11T17:35:45Z

Hah, now I'm seeing pretty consistent small improvements across most benchmarks! Maybe I botched it the first time.

Roughly: -0.1-0.2% in terms of allocations and -0.1-0.5% in terms of time.

snowleopard

Thanks!

snowleopard · 2023-07-11T17:44:23Z

@rgrinberg Btw, don't forget to switch to ~store:(module Path.Table) in Action_builder.contents. I'm making this change internally.

snowleopard · 2023-07-11T20:06:27Z

@rgrinberg Btw, don't forget to switch to ~store:(module Path.Table) in Action_builder.contents. I'm making this change internally.

Just to add: with this change included, benchmarks now 2% speed-up for from-cache and null builds!

Use a more memory efficient path table. Instead of using the variant for the key, combine 3 tables all for the individual paths. This makes the empty table a little more bloated (3x bigger), but gives us a saving of 2 words for every single key we store. Signed-off-by: Rudi Grinberg <[email protected]>

#8052 updated the representation for these, but didn't bump the version. Signed-off-by: Rudi Grinberg <[email protected]>

#8052 updated the representation for these, but didn't bump the version. Signed-off-by: Rudi Grinberg <[email protected]>

Alizter mentioned this pull request Jun 27, 2023

bench: add GC stats to bench #8063

Merged

3 tasks

rgrinberg force-pushed the ps/rr/refactor_stdune___improve_path_tables branch from 3f98ef2 to fce52ee Compare July 8, 2023 13:36

rgrinberg requested a review from snowleopard July 9, 2023 10:57

snowleopard reviewed Jul 9, 2023

View reviewed changes

otherlibs/stdune/src/path.mli Show resolved Hide resolved

snowleopard reviewed Jul 9, 2023

View reviewed changes

otherlibs/stdune/src/path.ml Show resolved Hide resolved

snowleopard reviewed Jul 9, 2023

View reviewed changes

otherlibs/stdune/src/path.ml Outdated Show resolved Hide resolved

rgrinberg force-pushed the ps/rr/refactor_stdune___improve_path_tables branch from fce52ee to 4048e01 Compare July 9, 2023 20:25

snowleopard reviewed Jul 9, 2023

View reviewed changes

src/dune_engine/cached_digest.ml Outdated Show resolved Hide resolved

rgrinberg force-pushed the ps/rr/refactor_stdune___improve_path_tables branch 3 times, most recently from 93a1448 to 73ea380 Compare July 9, 2023 22:16

snowleopard reviewed Jul 10, 2023

View reviewed changes

src/dune_engine/cached_digest.ml Outdated Show resolved Hide resolved

snowleopard reviewed Jul 10, 2023

View reviewed changes

src/dune_engine/cached_digest.ml Outdated Show resolved Hide resolved

snowleopard reviewed Jul 10, 2023

View reviewed changes

otherlibs/stdune/src/path.mli Outdated Show resolved Hide resolved

rgrinberg force-pushed the ps/rr/refactor_stdune___improve_path_tables branch 4 times, most recently from 3a26c8b to 1580f4d Compare July 10, 2023 15:25

rgrinberg mentioned this pull request Jul 10, 2023

refactor: use [Path.Table] for build system tables #8164

Closed

snowleopard reviewed Jul 10, 2023

View reviewed changes

otherlibs/stdune/src/path.mli Show resolved Hide resolved

rgrinberg commented Jul 10, 2023

View reviewed changes

otherlibs/stdune/src/path.ml Show resolved Hide resolved

snowleopard approved these changes Jul 11, 2023

View reviewed changes

rgrinberg force-pushed the ps/rr/refactor_stdune___improve_path_tables branch from 98c16f8 to 76d37ba Compare July 11, 2023 20:46

rgrinberg force-pushed the ps/rr/refactor_stdune___improve_path_tables branch from 76d37ba to 07a6f95 Compare July 11, 2023 20:49

rgrinberg merged commit 9a079d0 into main Jul 11, 2023

rgrinberg deleted the ps/rr/refactor_stdune___improve_path_tables branch July 11, 2023 20:50

rgrinberg mentioned this pull request Jul 13, 2023

fix: update digest and incremental db versions #8198

Merged

rgrinberg added a commit that referenced this pull request Jul 14, 2023

fix: update digest and incremental db versions (#8198)

329ba3d

#8052 updated the representation for these, but didn't bump the version. Signed-off-by: Rudi Grinberg <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(stdune): improve path tables #8052

refactor(stdune): improve path tables #8052

rgrinberg commented Jun 27, 2023

snowleopard left a comment

snowleopard commented Jul 9, 2023 •

edited

Loading

rgrinberg commented Jul 9, 2023

snowleopard commented Jul 10, 2023

rgrinberg commented Jul 10, 2023

snowleopard commented Jul 10, 2023 •

edited

Loading

rgrinberg commented Jul 11, 2023

snowleopard commented Jul 11, 2023

snowleopard commented Jul 11, 2023

snowleopard left a comment

snowleopard commented Jul 11, 2023

snowleopard commented Jul 11, 2023

refactor(stdune): improve path tables #8052

refactor(stdune): improve path tables #8052

Conversation

rgrinberg commented Jun 27, 2023

snowleopard left a comment

Choose a reason for hiding this comment

snowleopard commented Jul 9, 2023 • edited Loading

rgrinberg commented Jul 9, 2023

snowleopard commented Jul 10, 2023

rgrinberg commented Jul 10, 2023

snowleopard commented Jul 10, 2023 • edited Loading

rgrinberg commented Jul 11, 2023

snowleopard commented Jul 11, 2023

snowleopard commented Jul 11, 2023

snowleopard left a comment

Choose a reason for hiding this comment

snowleopard commented Jul 11, 2023

snowleopard commented Jul 11, 2023

snowleopard commented Jul 9, 2023 •

edited

Loading

snowleopard commented Jul 10, 2023 •

edited

Loading