Perf: Relax locking contention for cache and cachekv #353

yzang2019 · 2023-11-17T15:19:21Z

Describe your changes and provide context

Problem:
Currently when doing profiling, there are lot of locking contention happening in the cachekv layer, this is because we are using mutex for all read and write keys, but cachekv as a transient cache doesn't really need such a strict locking mechanism. Having a high locking contention would hurt the parallelize transaction execution performance a lot.

Solution:

Replace BoundedCache with sync.map to have a per key based locking. We don't really need to bound the cache size for transient cachekv store since the cache will be destroyed after the block is finalized.
Do not read through the cache. When call get, previously we will also write the value to the cache which requires us to add a lock around the whole read+write back operation, however, as a transient cache, this wouldn't actually help benefit with cache hit too much, removing the read through behavior would help reduce contention a lot
Relax and narrow the locking scope for commitkvcache, this will still be used as an inter block cache

Testing performed to validate your change

Fully tested in loadtest env

codecov · 2023-11-21T17:59:16Z

Codecov Report

Merging #353 (fe8744a) into main (5f04ca7) will decrease coverage by 0.04%.
Report is 1 commits behind head on main.
The diff coverage is 91.80%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #353      +/-   ##
==========================================
- Coverage   55.41%   55.38%   -0.04%     
==========================================
  Files         620      620              
  Lines       51694    51605      -89     
==========================================
- Hits        28646    28581      -65     
+ Misses      20964    20941      -23     
+ Partials     2084     2083       -1

Files	Coverage Δ
store/cache/cache.go	`78.57% <100.00%> (+0.31%)`	⬆️
store/cachekv/store.go	`81.57% <100.00%> (+9.28%)`	⬆️
x/auth/ante/sigverify.go	`63.35% <100.00%> (-0.23%)`	⬇️
x/auth/types/params.go	`76.04% <ø> (ø)`
x/auth/ante/batch_sigverify.go	`0.00% <0.00%> (ø)`

... and 2 files with indirect coverage changes

stevenlanders

This looks good - it might make sense to do a gobench test before/after the change to help double-check the throughput increases (hard to eyeball, this could be much faster, not sure)

stevenlanders · 2023-12-04T13:50:44Z

store/cachekv/store.go

 	} else {
-		value = cacheValue.Value()
+		return store.parent.Get(key)


small nit, we can drop the else block here and just return

True, will be fix

stevenlanders · 2023-12-04T13:52:44Z

store/cachekv/store.go


 	// We need a copy of all of the keys.
 	// Not the best, but probably not a bottleneck depending.
-	keys := make([]string, 0, store.cache.Len())
+	keys := []string{}


To avoid the allocations, keeping the size at 0 and capacity at store.cache.Len() can actually be better.

Yeah, but unfortunately, sync map doesnt support length, that's why we remove all length here

stevenlanders · 2023-12-04T13:56:29Z

store/cachekv/store.go

 		}
 	}

 	// Clear the cache using the map clearing idiom
 	// and not allocating fresh objects.
 	// Please see https://bencher.orijtech.com/perfclinic/mapclearing/
-	store.cache.DeleteAll()
+	store.cache.Range(func(key, value any) bool {


side node: I wonder if it would make sense to just set the cache to a new map (requires concurrency protection) and let the garbage collector clean up the old one. This is logically fine.

Yeah this is good questions, I've thought about that, but I think it could be risky, I'm not sure why initially we are not doing that tbh, so better to just keep the same logic first.

## Describe your changes and provide context This PR reverts the change in #353 and #391 until we have OCC fully enabled. ## Testing performed to validate your change Unit test coverage

Relax locking contention for cache and cachekv

0be7ad0

yzang2019 requested review from philipsu522, codchen, udpatil and stevenlanders November 17, 2023 15:19

Fix unit tests

cbfbe51

Clean up the code

22f0628

philipsu522 approved these changes Nov 21, 2023

View reviewed changes

stevenlanders approved these changes Dec 4, 2023

View reviewed changes

yzang2019 added 2 commits December 5, 2023 08:00

Update store.go to remove else

0d8e3d6

Merge branch 'main' into yzang/SEI-6061

fe8744a

yzang2019 merged commit 628c7e4 into main Dec 5, 2023
15 checks passed

yzang2019 deleted the yzang/SEI-6061 branch December 5, 2023 16:12

yzang2019 mentioned this pull request Jan 17, 2024

Revert removing events for cachekv #396

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perf: Relax locking contention for cache and cachekv #353

Perf: Relax locking contention for cache and cachekv #353

yzang2019 commented Nov 17, 2023 •

edited

Loading

codecov bot commented Nov 21, 2023 •

edited

Loading

stevenlanders left a comment

stevenlanders Dec 4, 2023

yzang2019 Dec 4, 2023

stevenlanders Dec 4, 2023

yzang2019 Dec 4, 2023

stevenlanders Dec 4, 2023

yzang2019 Dec 4, 2023 •

edited

Loading

Perf: Relax locking contention for cache and cachekv #353

Perf: Relax locking contention for cache and cachekv #353

Conversation

yzang2019 commented Nov 17, 2023 • edited Loading

Describe your changes and provide context

Testing performed to validate your change

codecov bot commented Nov 21, 2023 • edited Loading

Codecov Report

stevenlanders left a comment

Choose a reason for hiding this comment

stevenlanders Dec 4, 2023

Choose a reason for hiding this comment

yzang2019 Dec 4, 2023

Choose a reason for hiding this comment

stevenlanders Dec 4, 2023

Choose a reason for hiding this comment

yzang2019 Dec 4, 2023

Choose a reason for hiding this comment

stevenlanders Dec 4, 2023

Choose a reason for hiding this comment

yzang2019 Dec 4, 2023 • edited Loading

Choose a reason for hiding this comment

yzang2019 commented Nov 17, 2023 •

edited

Loading

codecov bot commented Nov 21, 2023 •

edited

Loading

yzang2019 Dec 4, 2023 •

edited

Loading