Trim extra shred bytes in blockstore #16602

steviez · 2021-04-16T19:39:13Z

Problem

Data shred bytestreams are currently inserted into the blockstore with padded 0's. Every data shred has at least some padded 0's due to there being a "restricted" section at the end of a data shred's payload:
https://github.com/solana-labs/solana/blob/master/ledger/src/shred.rs#L44-L50

Furthermore, data shreds that are not filled to capacity with data will have additional 0-padding to get to fill up the packet.

The result of these two items is that the blockstore is getting bloated with extraneous bytes.

Summary of Changes

Trim the extra padding bytes off of the shred on insertion; "re-expand" the bytestream when retrieving from the blockstore to avoid breaking the erasure coding algorithm (which requires that coding and data shreds are of same length).

Fixes #16236

codecov · 2021-04-16T21:00:33Z

Codecov Report

Merging #16602 (40ed492) into master (8f56c11) will decrease coverage by 0.0%.
The diff coverage is 97.1%.

@@            Coverage Diff            @@
##           master   #16602     +/-   ##
=========================================
- Coverage    82.7%    82.7%   -0.1%     
=========================================
  Files         414      414             
  Lines      115686   115702     +16     
=========================================
+ Hits        95760    95765      +5     
- Misses      19926    19937     +11

steviez · 2021-04-19T17:27:31Z

@sakridge - I ended up cherry-picking these commits into my own branch since they were failing the CI before; it appears those failures were either unrelated issues that were fixed or a non-deterministic failure.

Regardless of RocksDB vs. AccountsDB, I think we still want this change to keep our backend datastore as slim as possible, right?

Assuming so, thoughts on any additional testing that should be done for validation of this PR? The .resize() on the vector seems like the only thing that could affect performance

Edit: I'm pushing in one more commit to add a comment or two so CI will re-run, but everything had passed:

sakridge · 2021-04-20T19:35:26Z

@sakridge - I ended up cherry-picking these commits into my own branch since they were failing the CI before; it appears those failures were either unrelated issues that were fixed or a non-deterministic failure.

Regardless of RocksDB vs. AccountsDB, I think we still want this change to keep our backend datastore as slim as possible, right?

Assuming so, thoughts on any additional testing that should be done for validation of this PR? The .resize() on the vector seems like the only thing that could affect performance

Edit: I'm pushing in one more commit to add a comment or two so CI will re-run, but everything had passed:

Yep, we still want it, exactly as you said to keep the db as small as possible.

steviez · 2021-04-21T14:44:29Z

Here are the performance run results:
https://solanalabs.slack.com/archives/CP2L2S4KV/p1618955757009100

mean_tps: 21020
max_tps: 50685.5
mean_confirmation_ms: 2244
max_confirmation_ms: 20425
99th_percentile_confirmation_ms: 10463
max_tower_distance: 164
last_tower_distance: 63.5
slots_per_second: 2.041

Does seem to be worse than the recent runs on master's latest (which are hitting 55k+ max TPS)

sakridge · 2021-04-21T15:15:22Z

Here are the performance run results:
https://solanalabs.slack.com/archives/CP2L2S4KV/p1618955757009100
mean_tps: 21020
max_tps: 50685.5
mean_confirmation_ms: 2244
max_confirmation_ms: 20425
99th_percentile_confirmation_ms: 10463
max_tower_distance: 164
last_tower_distance: 63.5
slots_per_second: 2.041
Does seem to be worse than the recent runs on master's latest (which are hitting 55k+ max TPS)

I think it might be within the noise of this measurement. You could compare the specific insert_shred or the fetch_entries part of replay stats. Those correspond to when we store or read shreds from the db.

ledger/src/blockstore.rs

sakridge

lgtm

steviez · 2021-04-21T19:35:22Z

I think it might be within the noise of this measurement. You could compare the specific insert_shred or the fetch_entries part of replay stats. Those correspond to when we store or read shreds from the db.

So I dug a little deeper on this and did a comparison of my run with the previous nightly run. For the next two graphs, the first two humps are the previous two nightly runs; the third hump is my run.

Here is insert:

and here is fetch:

Digging deeper on the graphana graphs, the data is pretty "jagged" so think I would agree that any observed difference is within the noise, so I'm happy if you are

carllin · 2021-04-21T21:52:00Z

nit: Summary of Changes section seems to have been cut off 😃

ledger/src/blockstore.rs

Pull request has been modified.

core/src/serve_repair.rs

steviez · 2021-04-26T16:47:41Z

Would it make more sense to fold this into test_should_insert_data_shred? That way we can also add a test case in test_should_insert_data_shred

Yeah, I think you're right. We already have another similar validity check in should_insert_data_shred and makes sense to check this as early as possible

Pull request has been modified.

Strip the zero-padding off of data shreds before insertion into blockstore Co-authored-by: Stephen Akridge <[email protected]> Co-authored-by: Nathan Hawkins <[email protected]>

This is a partial backport of solana-labs#16602 to allow compatibility with that change.

* Zero pad data shreds on fetch from blockstore This is a partial backport of #16602 to allow compatibility with that change. * Remove size check and resize shreds to consistent length

sakridge mentioned this pull request Apr 20, 2021

Dropping rocks column families instead of running compaction #16589

Closed

steviez marked this pull request as ready for review April 21, 2021 14:41

steviez requested a review from sakridge April 21, 2021 14:41

sakridge requested a review from carllin April 21, 2021 17:43

sakridge reviewed Apr 21, 2021

View reviewed changes

ledger/src/blockstore.rs Show resolved Hide resolved

sakridge previously approved these changes Apr 21, 2021

View reviewed changes

carllin reviewed Apr 21, 2021

View reviewed changes

ledger/src/blockstore.rs Show resolved Hide resolved

steviez force-pushed the trim_extra_shred_bytes branch from ea3f8dd to 66c9c61 Compare April 22, 2021 16:09

steviez force-pushed the trim_extra_shred_bytes branch 2 times, most recently from e9017be to 28f7ad8 Compare April 23, 2021 17:22

steviez commented Apr 23, 2021

View reviewed changes

core/src/serve_repair.rs Outdated Show resolved Hide resolved

steviez force-pushed the trim_extra_shred_bytes branch 2 times, most recently from 35b4da4 to f25c9f5 Compare April 26, 2021 16:43

carllin previously approved these changes Apr 27, 2021

View reviewed changes

sakridge and others added 7 commits April 27, 2021 12:14

Only store the shred payload

8a78373

Fix test to not create an invalid shred

cec5ee5

Check that the header size is valid with payload size

72162cc

Fix test and assert

1b668d5

Make tests pass

ee54a93

Add some comments on payload resize

ca97ea8

Check size on insertion

8e7cbc1

Move payload size check to should_insert_data_shred()

40ed492

steviez force-pushed the trim_extra_shred_bytes branch from f25c9f5 to 40ed492 Compare April 27, 2021 18:27

steviez merged commit bc31378 into solana-labs:master Apr 27, 2021

steviez deleted the trim_extra_shred_bytes branch April 27, 2021 22:42

steviez mentioned this pull request Apr 29, 2021

Add test to ensure data shreds with empty data would be inserted #16955

Merged

ryoqun mentioned this pull request May 10, 2021

Shreds inserted by master causes panic when read by v1.6 #17136

Closed

steviez pushed a commit to steviez/solana that referenced this pull request May 10, 2021

Zero pad data shreds on fetch from blockstore

1d71d94

This is a partial backport of solana-labs#16602 to allow compatibility with that change.

steviez mentioned this pull request May 10, 2021

Zero pad data shreds on fetch from blockstore #17147

Merged

steviez pushed a commit to steviez/solana that referenced this pull request May 10, 2021

Zero pad data shreds on fetch from blockstore

2065f37

This is a partial backport of solana-labs#16602 to allow compatibility with that change.

steviez pushed a commit to steviez/solana that referenced this pull request May 10, 2021

Zero pad data shreds on fetch from blockstore

b235da8

This is a partial backport of solana-labs#16602 to allow compatibility with that change.

carllin pushed a commit to carllin/solana that referenced this pull request May 11, 2021

Zero pad data shreds on fetch from blockstore

fdb7ece

This is a partial backport of solana-labs#16602 to allow compatibility with that change.

carllin pushed a commit to carllin/solana that referenced this pull request May 11, 2021

Zero pad data shreds on fetch from blockstore

a8e4691

This is a partial backport of solana-labs#16602 to allow compatibility with that change.

carllin pushed a commit to carllin/solana that referenced this pull request May 11, 2021

Zero pad data shreds on fetch from blockstore

2acca9f

This is a partial backport of solana-labs#16602 to allow compatibility with that change.

This was referenced May 11, 2021

Shreds are always zero-padded #8909

Closed

Only store the shred payload #14940

Closed

steviez mentioned this pull request May 13, 2021

Pad shreds on retrieval from blockstore only when needed #17204

Closed

steviez pushed a commit to steviez/solana that referenced this pull request May 14, 2021

Zero pad data shreds on fetch from blockstore

eff1184

This is a partial backport of solana-labs#16602 to allow compatibility with that change.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trim extra shred bytes in blockstore #16602

Trim extra shred bytes in blockstore #16602

steviez commented Apr 16, 2021 •

edited

Loading

codecov bot commented Apr 16, 2021 •

edited

Loading

steviez commented Apr 19, 2021 •

edited

Loading

sakridge commented Apr 20, 2021

steviez commented Apr 21, 2021

sakridge commented Apr 21, 2021

sakridge left a comment

steviez commented Apr 21, 2021

carllin commented Apr 21, 2021

steviez commented Apr 26, 2021

Trim extra shred bytes in blockstore #16602

Trim extra shred bytes in blockstore #16602

Conversation

steviez commented Apr 16, 2021 • edited Loading

Problem

Summary of Changes

codecov bot commented Apr 16, 2021 • edited Loading

Codecov Report

steviez commented Apr 19, 2021 • edited Loading

sakridge commented Apr 20, 2021

steviez commented Apr 21, 2021

sakridge commented Apr 21, 2021

sakridge left a comment

Choose a reason for hiding this comment

steviez commented Apr 21, 2021

carllin commented Apr 21, 2021

steviez commented Apr 26, 2021

steviez commented Apr 16, 2021 •

edited

Loading

codecov bot commented Apr 16, 2021 •

edited

Loading

steviez commented Apr 19, 2021 •

edited

Loading