core/state/snapshot: replace diffToDisk ideal batch size with 64MB #27977

aaronbuchwald · 2023-08-22T17:59:09Z

This PR updates diffToDisk(...) within snapshot flattening to write the entire diff to disk in a single batch.

Writing the diff to disk across multiple batches leads to the possibility that the snapshot will be corrupted during an ungraceful shutdown and can cause degraded performance during snapshot re-generation on startup.

rjl493456442 · 2023-08-23T02:47:49Z

Unfortunately, this will be problematic. In Ethereum, it's possible some contracts will be deleted, and the associated storage should be deleted from the snapshot. However, these contracts can be unbound and might result in out-of-memory(theoretically possible to build such contract, but have the cost to fulfill the slots though).

Therefore, we split the big write into several small ones.

holiman · 2023-08-23T07:43:26Z

These checks were added because they were needed, see #22582 for more details, and also #22497 (comment) for some history:

The ropsten BitlleGasStation was self-destructed at block 6490899, in this tx .

aaronbuchwald · 2023-08-23T15:34:56Z

Thanks for the context! I was under the mistaken impression that this change was made as a performance optimization as opposed to avoid an OOM.

aaronbuchwald · 2023-08-23T15:39:52Z

As an alternative to avoid snapshot corruption. Applying the diff from the same block should be an idempotent operation. Would you be open to a PR that:

write the previous and next block hash when applying a snapshot diffToDisk operation so that there's a marker when in the middle of a flatten operation of exactly which diff is being applied
on restart, if you observe a corrupted snapshot, re-process the same block to re-create the same diff and apply it to complete the incomplete diffToDisk operation

I understand that this may break some abstractions, so curious whether you'd be open to this idea, think it would be too gross to add, or even if snapshot corruption is not much of a concern in your view.

karalabe · 2023-08-23T15:49:55Z

Can't we have a middle ground? The IdealBatchSize is something like 100KB. That will trigger on a lot of contracts. But if we cap it at some significantly higher number (something reasonable in comparison with the max allowed deletion), then it should be fine almost always, except if you nuke your node exactly when it's deleting something larger?

karalabe · 2023-08-23T15:51:23Z

I guess there's no max allowed deletion in the snapshots. Still, could we special case this and cap it at a significantly higher number instead of 100KB? Say 64MB-256MB?

holiman · 2023-08-23T16:04:33Z

Say 64MB-256MB?

Should be fine; IMO rather 64 than 256

karalabe · 2023-08-24T09:03:20Z

core/state/snapshot/snapshot.go

-			// Ensure we don't delete too much data blindly (contract can be
-			// huge). It's ok to flush, the root will go missing in case of a
-			// crash and we'll detect and regenerate the snapshot.
-			if batch.ValueSize() > ethdb.IdealBatchSize {


Can we keep the code for now and replace ethdb.IdealBatchSize with 64 * 1024 * 1024 in both invocations? When removing SELFDESTRUCT lands in Cancun, we can nuke it out according to the original PR. In between we'd at least have a more robust code.

Just updated during only those two invocations.

…thereum#27977)

… 64MB (ethereum#27977)" This reverts commit 1f2bb01.

aaronbuchwald requested review from karalabe, holiman and rjl493456442 as code owners August 22, 2023 17:59

holiman closed this Aug 23, 2023

karalabe reopened this Aug 24, 2023

karalabe added this to the 1.13.0 milestone Aug 24, 2023

karalabe reviewed Aug 24, 2023

View reviewed changes

aaronbuchwald force-pushed the prevent-corrupted-snapshot branch from 9b3dd18 to dacb3fa Compare August 24, 2023 16:59

aaronbuchwald changed the title ~~core/state/snapshot: make diffToDisk atomic to prevent snapshot corruption~~ ethdb: update IdealBatchSize to 64MB to reduce likelihood of corrupted snapshot Aug 24, 2023

core/state/snapshot: replace diffToDisk ideal batch size with 64MB

87ffce8

aaronbuchwald force-pushed the prevent-corrupted-snapshot branch from dacb3fa to 87ffce8 Compare August 24, 2023 17:01

aaronbuchwald changed the title ~~ethdb: update IdealBatchSize to 64MB to reduce likelihood of corrupted snapshot~~ core/state/snapshot: replace diffToDisk ideal batch size with 64MB Aug 24, 2023

rjl493456442 approved these changes Aug 25, 2023

View reviewed changes

karalabe merged commit 56d2366 into ethereum:master Aug 25, 2023
1 check passed

devopsbo3 pushed a commit to HorizenOfficial/go-ethereum that referenced this pull request Nov 10, 2023

core/state/snapshot: replace diffToDisk ideal batch size with 64MB (e…

1f2bb01

…thereum#27977)

devopsbo3 added a commit to HorizenOfficial/go-ethereum that referenced this pull request Nov 10, 2023

Revert "core/state/snapshot: replace diffToDisk ideal batch size with…

92fd5bc

… 64MB (ethereum#27977)" This reverts commit 1f2bb01.

devopsbo3 added a commit to HorizenOfficial/go-ethereum that referenced this pull request Nov 10, 2023

Revert "core/state/snapshot: replace diffToDisk ideal batch size with…

19b32ad

… 64MB (ethereum#27977)" This reverts commit 1f2bb01.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core/state/snapshot: replace diffToDisk ideal batch size with 64MB #27977

core/state/snapshot: replace diffToDisk ideal batch size with 64MB #27977

aaronbuchwald commented Aug 22, 2023

rjl493456442 commented Aug 23, 2023

holiman commented Aug 23, 2023

aaronbuchwald commented Aug 23, 2023

aaronbuchwald commented Aug 23, 2023

karalabe commented Aug 23, 2023

karalabe commented Aug 23, 2023

holiman commented Aug 23, 2023

karalabe Aug 24, 2023

aaronbuchwald Aug 24, 2023

core/state/snapshot: replace diffToDisk ideal batch size with 64MB #27977

core/state/snapshot: replace diffToDisk ideal batch size with 64MB #27977

Conversation

aaronbuchwald commented Aug 22, 2023

rjl493456442 commented Aug 23, 2023

holiman commented Aug 23, 2023

aaronbuchwald commented Aug 23, 2023

aaronbuchwald commented Aug 23, 2023

karalabe commented Aug 23, 2023

karalabe commented Aug 23, 2023

holiman commented Aug 23, 2023

karalabe Aug 24, 2023

Choose a reason for hiding this comment

aaronbuchwald Aug 24, 2023

Choose a reason for hiding this comment