Optimize indexing comment collection #2547

vinistock · 2024-09-12T18:51:35Z

Motivation

This PR makes the index fetch entry documentation only when requested, rather than eagerly indexing all comments. The idea is that not storing all documentation for all entries eagerly will reduce memory usage and speed up initial indexing.

In benchmarks, the impact was not as high as I had hoped. About 7.5% memory reduction and 5% indexing time reduction.

Implementation

The idea is that we add a flag to ignore comments on the initial indexing. When a file is modified, we turn on the flag so that we can capture the comments on a file currently opened in the UI.

When anything tries to read comments, the index fetches them lazily using Prism.parse_file_comments.

Automated Tests

Existing tests cover it.

lib/ruby_indexer/lib/ruby_indexer/declaration_listener.rb

andyw8

Overall this seems failry unobtrusive for a small but decent reduction in memory use.

st0012

I think it should be considered a breaking change when Entry#comments is changed to return a string instead of an array of strings. Is this change necessary for the performance improvement?

vinistock · 2024-09-13T19:53:32Z

I think it should be considered a breaking change when Entry#comments is changed to return a string instead of an array of strings. Is this change necessary for the performance improvement?

Yes, it reduces the number of objects allocated since we're not longer maintaining and array of multiple strings, but instead a single string.

Conceptually, I agree, it's a breaking change. But do we know if any addon is actually invoking comments directly on entries? The Rails addon doesn't do that.

st0012 · 2024-09-13T20:43:12Z

Conceptually, I agree, it's a breaking change. But do we know if any addon is actually invoking comments directly on entries? The Rails addon doesn't do that.

I don't know any, and I don't plan to block the PR for this. But IMO it's worth adding the breaking change label here just in case.

Do we know how much of the memory reduction is contributed by lazy indexing, and how much is contributed by the type change?
If, for example, the lazy comment indexing just contributed to 1~2% of memory reduction by itself, then I'd prefer not maintaining 2 different comment collection logic.

vinistock · 2024-09-16T14:10:59Z

Do we know how much of the memory reduction is contributed by lazy indexing, and how much is contributed by the type change?
If, for example, the lazy comment indexing just contributed to 1~2% of memory reduction by itself, then I'd prefer not maintaining 2 different comment collection logic.

I just benchmarked this to compare. The lazy logic is responsible for 6.8% out of the 7.5% reduction. The reduction related to turning the comments from an array into strings is the smaller part.

st0012 · 2024-09-16T14:26:40Z

@vinistock Thanks for benchmarking the difference. IMO we can update this PR's title to be more generic, like "Optimize comment indexing", as the comment type change is also not trivial.

vinistock added server This pull request should be included in the server gem's release notes other Changes that aren't bugfixes, enhancements or breaking changes labels Sep 12, 2024

vinistock self-assigned this Sep 12, 2024

andyw8 reviewed Sep 12, 2024

View reviewed changes

lib/ruby_indexer/lib/ruby_indexer/declaration_listener.rb Outdated Show resolved Hide resolved

andyw8 reviewed Sep 12, 2024

View reviewed changes

lib/ruby_indexer/lib/ruby_indexer/declaration_listener.rb Outdated Show resolved Hide resolved

andyw8 approved these changes Sep 12, 2024

View reviewed changes

Lazily fetch entry comments

a0b74ed

vinistock force-pushed the vs-lazily-fetch-comments branch from 273719f to a0b74ed Compare September 13, 2024 18:46

vinistock marked this pull request as ready for review September 13, 2024 18:47

vinistock requested a review from a team as a code owner September 13, 2024 18:47

vinistock requested a review from st0012 September 13, 2024 18:47

st0012 reviewed Sep 13, 2024

View reviewed changes

st0012 approved these changes Sep 16, 2024

View reviewed changes

vinistock changed the title ~~Lazily fetch entry comments~~ Optimize indexing comment collection Sep 16, 2024

vinistock merged commit 9ad1d5d into main Sep 16, 2024
38 checks passed

vinistock deleted the vs-lazily-fetch-comments branch September 16, 2024 15:31

vinistock added breaking-change Non-backward compatible change and removed other Changes that aren't bugfixes, enhancements or breaking changes labels Sep 16, 2024

Earlopain mentioned this pull request Sep 20, 2024

Documentation comments are rendered in h1 #2582

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize indexing comment collection #2547

Optimize indexing comment collection #2547

vinistock commented Sep 12, 2024

andyw8 left a comment

st0012 left a comment

vinistock commented Sep 13, 2024

st0012 commented Sep 13, 2024

vinistock commented Sep 16, 2024

st0012 commented Sep 16, 2024

Optimize indexing comment collection #2547

Optimize indexing comment collection #2547

Conversation

vinistock commented Sep 12, 2024

Motivation

Implementation

Automated Tests

andyw8 left a comment

Choose a reason for hiding this comment

st0012 left a comment

Choose a reason for hiding this comment

vinistock commented Sep 13, 2024

st0012 commented Sep 13, 2024

vinistock commented Sep 16, 2024

st0012 commented Sep 16, 2024