feat: add hashcache to pbss difflayer #29991

will-2012 · 2024-06-13T06:52:15Z

Description

In the native transfer test scenario, we found that the bottleneck of pbss trienode reading is the recursive query of the 128-layer difflayer. The current PR is mainly to optimize the pbss trienode reading latency.

Rationale

In difflayer, a new cache map with node hash as key is added. The Node interface gives priority to accessing the cache. If a hit is found, it is returned directly. If it is not found, the disklayer is queried. Compared with the recursive query of the 128-layer difflayer, this cache query is O(1). The original query path is optimized, and most of them are fastpath.
The cache is added when the difflayer is added to the layertree and is released when the difflayer is merged into the disklayer.

Performance

Test scenario: 1 million accounts were randomly selected from 25 million accounts, and native transfer transactions were performed at 600qps. The difflayer cache optimization effect was significant, and the validation phase was reduced from 450ms to 100ms.

Changes

Notable changes:

Pbss difflayer: add hash cache.

karalabe · 2024-06-13T07:10:46Z

Hmmm, interesting idea with using the hash as the cache key since it can't change, so you don't need to track liveness across layers. Out of curiosity, wouldn't using an existing cache type work instead of defining your own? We use (I think) victoriametrics.

~~Also, do you have any benchmarks or numbers that you could point us to?~~ Edit: was added later.

will-2012 · 2024-06-13T07:16:13Z

Hmmm, interesting idea with using the hash as the cache key since it can't change, so you don't need to track liveness across layers. Out of curiosity, wouldn't using an existing cache type work instead of defining your own? We use (I think) victoriametrics.

~~Also, do you have any benchmarks or numbers that you could point us to?~~ Edit: was added later.

The performance of using fastcache is basically the same, but it takes up extra memory~

karalabe · 2024-06-13T11:58:51Z

Seems your PR has a consensus fault in it, PTAL: https://ci.appveyor.com/project/ethereum/go-ethereum/builds/50010174

rjl493456442 · 2024-06-13T12:51:01Z

@karalabe this pull request is not compatible with verkle. I guess it might be the reason for the failing test.

will-2012 · 2024-06-13T14:28:30Z

Seems your PR has a consensus fault in it, PTAL: https://ci.appveyor.com/project/ethereum/go-ethereum/builds/50010174

Fixed~

rjl493456442 · 2024-06-18T02:45:54Z

Dump some preliminary performance data after benchmarking this branch against master for a few days (~85 hours).

This branch is 7hours ahead of the master, roughly 8% performance speedup.

The speedup mostly comes from the accountUpdate which involves trie node retrieval.

EVM execution is slightly slower (5ms per block on average), which might be relevant with some weird side effects
Triedb commit is slightly slower (2ms per block) due to the overhead of cache maintenance

I think the idea is valuable and i am very curious how much performance gain we can have by applying it to state snapshot.

will-2012 · 2024-06-18T03:15:11Z

how much performance gain we can have by applying it to state snapshot.
-->
This is a very good idea, and it may produce good results. It is worth trying. I will try it in the near future, and I will update the results if there are any.

fynnss · 2024-06-18T03:30:38Z

triedb/pathdb/difflayer.go

+			cache: make(map[common.Hash]*RefTrieNode),
+		}
+	case *diffLayer:
+		dl.origin = l.originDiskLayer()


Here you can get the parent origin directly without the read lock, because it's already gotten when it's called.

Semantically, the origin disklayer is mutable, and perhaps the read and write operations of the origin can also be serialized, without the need for a lock guard. But adding a lock guard may be clearer :)

will-2012 · 2024-07-03T08:59:43Z

how much performance gain we can have by applying it to state snapshot. --> This is a very good idea, and it may produce good results. It is worth trying. I will try it in the near future, and I will update the results if there are any.

At present, adding multi-version cache to snapshot difflayer does not improve performance. The main reasons are as follows:

Snapshot is a flat kv, without the mpt amplification effect; in the test data set, the access volume is about ~20% of the mpt trie;
Snapshot difflayer has a bloomfilter to avoid unnecessary access;
mpt has a large number of internal shared node accesses, and the probability of hitting difflayer is high, about ~60%; snapshot is a flat account kv access, and the probability of hitting difflayer is low, about ~30%.

feat: add hashcache to pbss difflayer

b97b0b6

will-2012 mentioned this pull request Jun 13, 2024

Use finalized block as the difflayer/dislayer indicator in pathdb #29822

Closed

will@2012 added 2 commits June 13, 2024 21:04

fix: check disklayer hash consistency

d5e4c6c

fix: add more check

c0c8a64

fynnss mentioned this pull request Jun 18, 2024

pathdb: add bloomfilter for pathdb's difflayer #29868

Closed

fynnss reviewed Jun 18, 2024

View reviewed changes

rjl493456442 mentioned this pull request Jun 18, 2024

Path-based global cache #30022

Open

AungThuSoe24 approved these changes Jun 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add hashcache to pbss difflayer #29991

feat: add hashcache to pbss difflayer #29991

will-2012 commented Jun 13, 2024 •

edited

Loading

karalabe commented Jun 13, 2024 •

edited

Loading

will-2012 commented Jun 13, 2024

karalabe commented Jun 13, 2024 •

edited

Loading

rjl493456442 commented Jun 13, 2024

will-2012 commented Jun 13, 2024

rjl493456442 commented Jun 18, 2024

will-2012 commented Jun 18, 2024

fynnss Jun 18, 2024

will-2012 Jun 18, 2024

will-2012 commented Jul 3, 2024

feat: add hashcache to pbss difflayer #29991

Are you sure you want to change the base?

feat: add hashcache to pbss difflayer #29991

Conversation

will-2012 commented Jun 13, 2024 • edited Loading

Description

Rationale

Performance

Changes

karalabe commented Jun 13, 2024 • edited Loading

will-2012 commented Jun 13, 2024

karalabe commented Jun 13, 2024 • edited Loading

rjl493456442 commented Jun 13, 2024

will-2012 commented Jun 13, 2024

rjl493456442 commented Jun 18, 2024

will-2012 commented Jun 18, 2024

fynnss Jun 18, 2024

Choose a reason for hiding this comment

will-2012 Jun 18, 2024

Choose a reason for hiding this comment

will-2012 commented Jul 3, 2024

will-2012 commented Jun 13, 2024 •

edited

Loading

karalabe commented Jun 13, 2024 •

edited

Loading

karalabe commented Jun 13, 2024 •

edited

Loading