Separate subtree size information from parse nodes. #4174

jonmeow · 2024-07-26T21:35:26Z

Move subtree sizes over to TreeAndSubtrees, using the different structure to represent the additional parse work that occurs, as well as making it clear which functions require the extra information. My intent is to make it hard to use this by accident.

The subtree size is still tracked during Parse::Tree construction. I think a lot of that can be cleaned up, although we use it during placeholder assignment so it may take some work. I wanted to see what people thought about this before taking action on such a change.

I'm using a 1m line source file generated by #4124 for testing. Command is time bazel-bin/toolchain/install/prefix_root/bin/carbon compile --phase=check --dump-mem-usage ~/tmp/data.carbon

At head, what I'm seeing is:

...
parse_tree_.node_impls_:
  used_bytes:      61516116
  reserved_bytes:  61516116
...
Total:
  used_bytes:      447814230
  reserved_bytes:  551663894
...
1.43s user 0.14s system 99% cpu 1.565 total

With Tree::Verify disabled completely, it looks like:

parse_tree_.node_impls_:
  used_bytes:      41010744
  reserved_bytes:  41010744
...
Total:
  used_bytes:      427308858
  reserved_bytes:  531158522
...
1.20s user 0.13s system 99% cpu 1.332 total

Re-enabling just the basic verification (what is now Tree::Verify), I'm seeing maybe 0.05s slower, but that's within noise for my system. I do see variability in my timing results, and overall I think this is a 0.2s +/- 0.1s improvement versus the earlier (always testing Extract code) implementation. That's opt; debug builds will be unaffected, because the same checking occurs as before.

Note, the subtree size is a third of the node representation, which is why I'm showing the decrease in memory usage here.

…4174) Move subtree sizes over to TreeAndSubtrees, using the different structure to represent the additional parse work that occurs, as well as making it clear which functions require the extra information. My intent is to make it hard to use this by accident. The subtree size is still tracked during Parse::Tree construction. I think a lot of that can be cleaned up, although we use it during placeholder assignment so it may take some work. I wanted to see what people thought about this before taking action on such a change. I'm using a 1m line source file generated by carbon-language#4124 for testing. Command is `time bazel-bin/toolchain/install/prefix_root/bin/carbon compile --phase=check --dump-mem-usage ~/tmp/data.carbon` At head, what I'm seeing is: ``` ... parse_tree_.node_impls_: used_bytes: 61516116 reserved_bytes: 61516116 ... Total: used_bytes: 447814230 reserved_bytes: 551663894 ... 1.43s user 0.14s system 99% cpu 1.565 total ``` With `Tree::Verify` disabled completely, it looks like: ``` parse_tree_.node_impls_: used_bytes: 41010744 reserved_bytes: 41010744 ... Total: used_bytes: 427308858 reserved_bytes: 531158522 ... 1.20s user 0.13s system 99% cpu 1.332 total ``` Re-enabling just the basic verification (what is now `Tree::Verify`), I'm seeing maybe 0.05s slower, but that's within noise for my system. I do see variability in my timing results, and overall I think this is a 0.2s +/- 0.1s improvement versus the earlier (always testing `Extract` code) implementation. That's opt; debug builds will be unaffected, because the same checking occurs as before. Note, the subtree size is a third of the node representation, which is why I'm showing the decrease in memory usage here.

github-actions bot added the toolchain label Jul 26, 2024

jonmeow changed the title ~~Separate child count information from parse nodes.~~ Separate subtree size information from parse nodes. Jul 26, 2024

Separate child count information from parse nodes.

d813e9a

jonmeow force-pushed the child-count branch from 4e37290 to 1b0ef9a Compare July 30, 2024 20:17

jonmeow marked this pull request as ready for review July 30, 2024 20:23

github-actions bot requested a review from josh11b July 30, 2024 20:24

jonmeow force-pushed the child-count branch from 1b0ef9a to 7b59c71 Compare July 30, 2024 20:24

jonmeow requested review from zygoloid and chandlerc and removed request for josh11b July 30, 2024 20:24

comment

c53d7fc

jonmeow force-pushed the child-count branch from 7b59c71 to c53d7fc Compare July 31, 2024 18:13

zygoloid approved these changes Jul 31, 2024

View reviewed changes

zygoloid added this pull request to the merge queue Jul 31, 2024

Merged via the queue into carbon-language:trunk with commit f67791c Jul 31, 2024
7 checks passed

jonmeow deleted the child-count branch July 31, 2024 21:22

jonmeow mentioned this pull request Aug 21, 2024

Re-evaluate optional features when optimizing compile time #4231

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate subtree size information from parse nodes. #4174

Separate subtree size information from parse nodes. #4174

jonmeow commented Jul 26, 2024 •

edited

Loading

Separate subtree size information from parse nodes. #4174

Separate subtree size information from parse nodes. #4174

Conversation

jonmeow commented Jul 26, 2024 • edited Loading

jonmeow commented Jul 26, 2024 •

edited

Loading