sql: perform point key-value reads where possible #46758

nvanbenschoten · 2020-03-30T22:53:17Z

The KV API exposes both a ranged ScanRequest read operation and a point GetRequest read operation. Currently, SQL's row.Fetcher only ever uses ScanRequests, even when looking up a discrete set of keys. We should explore updating it to use GetRequests when possible.

For a multitude of reasons, we expect point KV operations to perform marginally better than ranged KV operations, even in cases where the SQL layer knows that the ranged operation operates over a fixed number of keys. Here's a non-exhaustive list of reasons why:

MVCCGet, unlike MVCCScan, is able to use a prefix iterator, which allows it to make use of RocksDB/Pebble bloom filters. There may also be other wins at the storage layer, like avoiding prefetching
various data structures have optimizations and fast-paths for single-key operations (e.g. timestamp cache, latch manager, refresh span tracking)
various code paths are simpler in the point operation case
a point operation implies only needing a single key allocation instead of a start and end key allocation
a point operation implies only needing to return up to a single result, which means that we can avoid some indirection in various places

We've known that these could add up to a performance win on simple workloads for some time. However, (to my knowledge) we've never actually made any measurements. Inspired by #46752, I hacked up a test using this patch. I then ran kv95 on a single-node cluster with and without the load-gen change that enables the use of GetRequest instead of ScanRequest. This led to the following improvement in throughput:

name  old ops/sec  new ops/sec  delta
kv95   44.9k ± 3%   45.9k ± 2%  +2.16%  (p=0.056 n=6+6)

This seems significant enough to justify further investigation.

I think there's also a similar issue to be filed about DeleteRequest vs DeleteRangeRequest.

cc. @jordanlewis for triage.

The text was updated successfully, but these errors were encountered:

jordanlewis · 2020-03-31T01:26:15Z

Thanks for taking a look at this! I did an informal experiment with this a while ago and didn't record my results, but I don't think I saw anything like 2%. We'll put it into the backlog, although there are a few performance improvements (lookup join stuff, removal of extra planning latency) we're planning that will probably take priority.

nvanbenschoten · 2020-03-31T03:29:06Z

Thanks Jordan!

nvanbenschoten · 2020-09-04T01:24:42Z

I think there's also a similar issue to be filed about DeleteRequest vs DeleteRangeRequest.

I finally got around to writing this up in #53939 after seeing it come up again in TPC-C.

nvanbenschoten · 2020-10-23T16:17:35Z

In his work with the separated lock table, @sumeerbhola is introducing another case where point reads will be optimized over ranged reads. With his changes, point reads will be able to use bloom filters when searching the persistent portion of the lock table. This will not be possible for ranged reads.

sumeerbhola · 2021-01-23T17:14:30Z

(Adding to the earlier list of performance benefits) A scan over a span that has at most 1 key, but does not know it, needs to iterate over all the versions (or eventually do a second expensive seek), looking for the non-existent next key in the span. A Get can do a single seek, read the key, and be done.

Regarding the lock table, if locks are rare and the BatchRequest has a large number of ScanRequests, it may actually be faster than a similar number of GetRequests, since the first scan will seek past all the spans being requested and the later seeks will get optimized, due to the Pebble improvements in cockroachdb/pebble#951 and cockroachdb/pebble#947. But we can always change a Get to a Scan inside the storage package (or below) if we find any significant performance benefit, but we can't do the reverse.

jordanlewis · 2021-03-05T22:44:15Z

@nvanbenschoten do you have an intuition about whether it would be better, in the case of a fetch of a row with 2 (or 3 or 4 or n) column families, to issue n GetRequests, one per family, or 1 ScanRequest?

nvanbenschoten · 2021-03-15T03:44:20Z

@nvanbenschoten do you have an intuition about whether it would be better, in the case of a fetch of a row with 2 (or 3 or 4 or n) column families, to issue n GetRequests, one per family, or 1 ScanRequest?

This is an interesting question! I don't have much intuition about which would be better or where the crossover point lies. Let's run some experiments and let that guide our decision here.

jordanlewis · 2021-03-15T03:46:43Z

While I've got your attention, do you have an idea about the workload that would be mostly likely to showcase a difference between GetRequest and ScanRequest? I've been playing with KV and have seen nearly zero noticeable difference in performance.

nvanbenschoten · 2021-03-15T03:59:42Z

You've been playing with an empty database, right? That might be contributing, as a major benefit of point reads is that they can use bloom filters to reduce read amplification once an LSM fills out. This effect will also be more pronounced once the entire data set no longer fits in RAM. I know it's slow, but consider throwing a few iterations of tpccbench/nodes=3/cpu=4 at both variants and letting it run overnight.

nvanbenschoten added C-performance Perf of queries or internals. Solution not expected to change functional behavior. A-sql-execution Relating to SQL execution. labels Mar 30, 2020

nvanbenschoten assigned jordanlewis Mar 30, 2020

asubiotto mentioned this issue Apr 14, 2020

rowexec: investigate and improve lookup join performance #47472

Closed

5 tasks

asubiotto mentioned this issue Jun 2, 2020

rowexec: improve LEFT SEMI/ANTI lookup join performance #49790

Closed

2 tasks

nvanbenschoten mentioned this issue Aug 5, 2020

sql/kv: support FOR NO KEY UPDATE and FOR KEY SHARE #52420

Open

ajwerner mentioned this issue Aug 13, 2020

kv: pessimistic-mode, replicated read locks to enable large, long running transactions #52768

Closed

nvanbenschoten mentioned this issue Sep 4, 2020

sql: perform fast-path point (not ranged) key-value deletes #53939

Closed

nvanbenschoten mentioned this issue Oct 23, 2020

keys,storage: add lock table key space, EngineKey, and LockTableKey #55878

Merged

joshimhoff added the O-sre For issues SRE opened or otherwise cares about tracking. label Feb 16, 2021

jordanlewis mentioned this issue Mar 6, 2021

sql: issue GetRequests from SQL when possible #61583

Merged

craig bot closed this as completed in 2e60515 Apr 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: perform point key-value reads where possible #46758

sql: perform point key-value reads where possible #46758

nvanbenschoten commented Mar 30, 2020

jordanlewis commented Mar 31, 2020

nvanbenschoten commented Mar 31, 2020

nvanbenschoten commented Sep 4, 2020

nvanbenschoten commented Oct 23, 2020

sumeerbhola commented Jan 23, 2021

jordanlewis commented Mar 5, 2021

nvanbenschoten commented Mar 15, 2021

jordanlewis commented Mar 15, 2021

nvanbenschoten commented Mar 15, 2021

sql: perform point key-value reads where possible #46758

sql: perform point key-value reads where possible #46758

Comments

nvanbenschoten commented Mar 30, 2020

jordanlewis commented Mar 31, 2020

nvanbenschoten commented Mar 31, 2020

nvanbenschoten commented Sep 4, 2020

nvanbenschoten commented Oct 23, 2020

sumeerbhola commented Jan 23, 2021

jordanlewis commented Mar 5, 2021

nvanbenschoten commented Mar 15, 2021

jordanlewis commented Mar 15, 2021

nvanbenschoten commented Mar 15, 2021