Speed up memory db by using fixed-size keys in skiplist search #387

holiman · 2021-12-13T12:47:48Z

This PR is a follow-up to #385, and only the last two commits are "this PR".

The first "this PR" commit contains a small command-line utility to run a set of Put-operations, and export the times in a json-format. The json format can be made into graphs via this little tool: https://github.com/holiman/mkchart . Note, the stats package can be dropped, I just included it in this PR so other people can repro my results, and try out other combos.

The last commit contains the actual change.

Problem

Let's take a look at the graphs for storing 4M key/value pairs. In every example, the key is 32 byte. The graphs show value size of 32, 96 and 256 byte, respectively.

As can be seen, when the values become larger, the Put operation becomes slower. This is due to the skiplist search -- for every Put, we need to find the correct skiplist position. This entails iterating the nodeData structure, but also accessing the kvData structure to do the key-comparisons.

With larger kvData, we get less benefits from various caches, and the memory accesses slow the whole thing down.

Solution

This PR frees up one field in the skiplist node, -- by packing the height element into the keySize. The uint64 which previously held height instead now holds 8 bytes of the key.
During the search, instead of loading the full key, we can do a quickCmp using the data we've already loaded.

( note: this trick can only be performed if the memdb is configured to use the DefaultComparer, otherwise it falls back to using the configured comparer )

With this feature enabled, here are the new charts:

Impact

The total runtimes for the three examples are as follows

Key/value combo	Master	PR
`32:32`	`17.93s`	`10.85s`
`32:96`	`18.17s`	`11.83s`
`32:256`	`24.58s`	`13.30s`

Note for 32-bit platforms

The PR as written assumes 64-bit platform, but can easily be extended to support 32-bit. In that case, we could use 4 bytes instead of 8 for the quickCmp. If we assume that 1 byte of a key is application-specific prefix, that leaves 3 bytes of entropy, so even that should be 'ok' for up to 16M items -- which is quite a lot for a memory db.

I have experimented with a more niche variant, where the node is based on uint32, and 6 bytes are used. That can be achieved if the keyLength is limited to 4096, the height uses only one nibble, and, of course, that the kvOffset is limited to 32 bits (which is currently already the case for 32-bit platforms). However, this PR does not make any niche-specific changes to go-leveldb which would not fit the general case.

Todos not fixed yet:

Fix this for 32-bit platforms
Use the same quickCmp for findLT, and possibly Seek and fill.

Note about benchmarks

A final note: the existing benchmarks do not quite demonstrate this issue, nor do they show any significant speed improvement in this PR (at least not the Put tests). The reason for that is twofold:

The existing benchmarks uses a 4-byte key, meaning that each lookup requires some alloc for padding the key,
The existing benchmarks uses nil value, meaning that they do not suffer (much) from reading data from the kvData section -- which is essentially just a tighly packed space of keys.

For posterity though:

name         old time/op  new time/op  delta
Put-6        1.05µs ± 8%  1.03µs ±31%     ~     (p=0.548 n=5+5)
PutRandom-6  2.08µs ± 8%  1.77µs ±27%     ~     (p=0.095 n=5+5)
Get-6        1.08µs ±10%  0.93µs ± 6%  -14.35%  (p=0.008 n=5+5)
GetRandom-6  2.63µs ± 5%  2.03µs ± 7%  -22.71%  (p=0.008 n=5+5)

rjl493456442 · 2022-01-05T03:30:56Z

After reading the commit dae8fc2, I guess the main improvement is gained from skipping most of key loading and byte comparison(by using integer comparison)?

holiman · 2022-01-05T09:14:41Z

skipping most of key loading and byte comparison(by using integer comparison)

Primarily avoiding loading -- by minimizing the memory surface area that we have to jump around on during the search

rjl493456442 · 2022-01-20T04:11:16Z

leveldb/memdb/memdb.go

@@ -469,11 +478,31 @@ func (p *DB) Reset() {
 func New(cmp comparer.BasicComparer, capacity int) *DB {
 	p := &DB{
 		cmp:       cmp,
+		quickCmp:  (cmp == comparer.DefaultComparer),


It's wrong here. Actually the cmp used in leveldb is iComparer with the notion of sequence and key type.
It means the quickCmp is disabled by default.

rjl493456442 · 2022-01-20T05:44:39Z

leveldb/memdb/node.go

+func (n node) quickCmp(key []byte) int {
+	var other nodeInt
+	if is64Bit {
+		other = nodeInt(binary.BigEndian.Uint64(key))


It's not suitable to use Uint64. Integer conversion can break the real ordering.

Or we can change to comparison flag to uint. It doesn't make sense to contain negative value.

rjl493456442 · 2022-01-20T06:09:30Z

leveldb/memdb/node.go

+	}
+	// pad
+	k := make([]byte, fixedKeyLength)
+	copy(k, key)


never mind, it's right to right padding.

holiman added 5 commits December 11, 2021 18:41

memdb: use object-oriented accessors for skiplist nodes

54c7f78

memdb: define backing-type

667fbf8

memdb: make tests use accessors

081ba6c

memdb: remove unused constants

de5eb3f

memdb: minor refactor

afed0b2

holiman changed the title ~~Use fixed-size keys in skiplist search~~ Speed up memory db by using fixed-size keys in skiplist search Dec 13, 2021

holiman added 2 commits December 13, 2021 14:06

stats: add separate benchmark test

825876f

memdb: add fixed-key comparison via nodeData

dae8fc2

holiman force-pushed the prekey branch from fbc862f to dae8fc2 Compare December 13, 2021 13:06

memdb: handle 32-bit platforms

80f4416

rjl493456442 reviewed Jan 20, 2022

View reviewed changes

leveldb/memdb/node.go

}

// pad

k := make([]byte, fixedKeyLength)

copy(k, key)

Copy link

Contributor

rjl493456442 Jan 20, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

never mind, it's right to right padding.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up memory db by using fixed-size keys in skiplist search #387

Speed up memory db by using fixed-size keys in skiplist search #387

holiman commented Dec 13, 2021 •

edited

Loading

rjl493456442 commented Jan 5, 2022 •

edited

Loading

holiman commented Jan 5, 2022

rjl493456442 Jan 20, 2022 •

edited

Loading

rjl493456442 Jan 20, 2022 •

edited

Loading

rjl493456442 Jan 20, 2022

rjl493456442 Jan 20, 2022

Speed up memory db by using fixed-size keys in skiplist search #387

Are you sure you want to change the base?

Speed up memory db by using fixed-size keys in skiplist search #387

Conversation

holiman commented Dec 13, 2021 • edited Loading

Problem

Solution

Impact

Note for 32-bit platforms

Note about benchmarks

rjl493456442 commented Jan 5, 2022 • edited Loading

holiman commented Jan 5, 2022

rjl493456442 Jan 20, 2022 • edited Loading

Choose a reason for hiding this comment

rjl493456442 Jan 20, 2022 • edited Loading

Choose a reason for hiding this comment

rjl493456442 Jan 20, 2022

Choose a reason for hiding this comment

rjl493456442 Jan 20, 2022

Choose a reason for hiding this comment

holiman commented Dec 13, 2021 •

edited

Loading

rjl493456442 commented Jan 5, 2022 •

edited

Loading

rjl493456442 Jan 20, 2022 •

edited

Loading

rjl493456442 Jan 20, 2022 •

edited

Loading