when writing to disk bucket index, tune towards packing tighter #30761

jeffwashington · 2023-03-16T22:30:20Z

Problem

see #30711
The current implementation of disk buckets (as used by accounts index on disk) was optimized for use as a hashmap with good speed in all cases.
The implementation in the validator synchronizes the in-mem hash map with the disk based one in the background.
Currently, we resize data buckets when we don't find an empty spot when starting a search at a random offset and searching for max_search, which is defaulted to approximately 32.
This max search makes sense for the index buckets where we have to exhaustively search on read and write to prove something does not exist. For data buckets, we just need to find any vacant bucket to store data. The offset will then be stored in the index bucket.

Summary of Changes

Search 10x locations before resizing disk buckets. This will result in more compact data buckets, improving performance for reads and writes. Insertions or updates with grown/shrunk slot lists can be slower, but these only happen in the background.

Fixes #

codecov · 2023-03-17T00:45:39Z

Codecov Report

Merging #30761 (4d4a009) into master (62fe6ea) will decrease coverage by 0.1%.
The diff coverage is 100.0%.

@@            Coverage Diff            @@
##           master   #30761     +/-   ##
=========================================
- Coverage    81.6%    81.6%   -0.1%     
=========================================
  Files         723      723             
  Lines      201791   201791             
=========================================
- Hits       164863   164804     -59     
- Misses      36928    36987     +59

brooksprumo

Are there already validator runtime metrics/perf results with this change?

bucket_map/src/bucket.rs

jeffwashington · 2023-03-17T19:30:17Z

Are there already validator runtime metrics/perf results with this change?

light blue line is the validator with this change. Approx. half the # of files open (bottom graph), approx. 750M (master) vs 560M (this pr) total bytes used by data files. This means higher density, less waste to store the same data.

brooksprumo

lgtm

…na-labs#30761) * when writing to disk bucket index, tune towards packing tighter * switch to min

jeffwashington requested a review from brooksprumo March 16, 2023 22:30

when writing to disk bucket index, tune towards packing tighter

228b2e1

jeffwashington force-pushed the mm13 branch from eafa1f4 to 228b2e1 Compare March 16, 2023 22:31

brooksprumo reviewed Mar 17, 2023

View reviewed changes

bucket_map/src/bucket.rs Outdated Show resolved Hide resolved

switch to min

4d4a009

jeffwashington requested a review from brooksprumo March 17, 2023 19:30

brooksprumo approved these changes Mar 17, 2023

View reviewed changes

jeffwashington merged commit 6dd5a22 into solana-labs:master Mar 17, 2023

behzadnouri pushed a commit to behzadnouri/solana that referenced this pull request Mar 18, 2023

when writing to disk bucket index, tune towards packing tighter (sola…

efc0f4b

…na-labs#30761) * when writing to disk bucket index, tune towards packing tighter * switch to min

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

when writing to disk bucket index, tune towards packing tighter #30761

when writing to disk bucket index, tune towards packing tighter #30761

jeffwashington commented Mar 16, 2023

codecov bot commented Mar 17, 2023 •

edited

Loading

brooksprumo left a comment

jeffwashington commented Mar 17, 2023

brooksprumo left a comment

when writing to disk bucket index, tune towards packing tighter #30761

when writing to disk bucket index, tune towards packing tighter #30761

Conversation

jeffwashington commented Mar 16, 2023

Problem

Summary of Changes

codecov bot commented Mar 17, 2023 • edited Loading

Codecov Report

brooksprumo left a comment

Choose a reason for hiding this comment

jeffwashington commented Mar 17, 2023

brooksprumo left a comment

Choose a reason for hiding this comment

codecov bot commented Mar 17, 2023 •

edited

Loading