sql: statement timings in transactions are suspicious #40675

piyush-singh · 2019-09-11T16:59:45Z

Describe the problem

Roko noted that we had a customer with unexpected latencies in their Admin UI. The customers' latencies were different when measured from their client compared to the latencies in the UI. Roko and I did a simple experiment to determine whether wrapping statements in txns was impacting timings for the statements. It seems that we are inaccurately reporting latencies in the below situation.

To Reproduce

Setup steps:

roachprod create $CLUSTER -n 3 --geo
roachprod stage $CLUSTER release v19.1.4
roachprod start $CLUSTER
roachprod sql $CLUSTER:1

SQL:

create database test;
use test;
create table b(a int, b int);
show experimental_ranges from table b; # (confirm I'm connected to the leaseholder as gateway)
begin;
insert into b values (1,1);
select * from b;
commit;

When I check the UI, I see the read took 800+ ms despite being connected to the leaseholder as my gateway node:

This is unexpected. When executing this via an implicit txn, I see the statement executing much faster:

root@localhost:26257/test> select * from b;
  a | b  
+---+---+
  1 | 1  
(1 row)

Time: 362.221301ms

root@localhost:26257/test> select * from b;
  a | b  
+---+---+
  1 | 1  
(1 row)

Time: 793.702µs

(side note: why is the second run so much faster?)

Subsequent executions yield some strange results in other ways. I see that latencies for the planning phase on the insert are extremely high (this is also true on the read, but closer to 100ms instead of 400+):

It is unclear why the timings for the reads is so different inside of vs outside of transactions. cc @rkruze and @jordanlewis as we spoke about this in person.

gz#8246

The text was updated successfully, but these errors were encountered:

jordanlewis · 2019-09-11T17:17:21Z

@piyush-singh thanks for writing this up! It's missing one crucial measurement that I was trying to explain in person - you included the timings as reported by the SQL shell (which just measures the time until response was finished from the server) for the implicit txn, but not for the explicit txn. Can you include those timings? In particular, the crucial measurement is whether the select within the transaction returns quickly or slowly, and whether the commit statement returns quickly or slowly.

Thank you!

piyush-singh · 2019-09-11T18:35:47Z

No problem - so when running the explicit transaction, I only get a single timing if I don't have results returned for each statement:

root@localhost:26257/test> begin;
Now adding input for a multi-line SQL transaction client-side (smart_prompt enabled).
Press Enter two times to send the SQL text collected so far to the server, or Ctrl+C to cancel.
You can also use \show to display the statements entered so far.
                        -> insert into b values (1,1);
                        -> select * from b;
                        -> commit;
  a | b  
+---+---+
  1 | 1  
(1 row)

Time: 972.120879ms

If I execute the statements one at a time, I see:

begin;
Now adding input for a multi-line SQL transaction client-side (smart_prompt enabled).
Press Enter two times to send the SQL text collected so far to the server, or Ctrl+C to cancel.
You can also use \show to display the statements entered so far.
                        -> insert into d values (1,1);
                        -> 
INSERT 1

Time: 757.001271ms

root@localhost:26257/test  OPEN> select * from d;
  a | b  
+---+---+
  1 | 1  
(1 row)

Time: 822.218µs

root@localhost:26257/test  OPEN> commit;
COMMIT

Time: 73.486123ms

One thing I wonder about this second set of timings is whether entering these slowly, manually impacts the timings vs executing them rapidly programmatically (without waiting to type them out in between). Perhaps if replication is able to take place while I'm typing it might impact reporting latencies?

RaduBerinde · 2019-10-25T15:42:55Z

What is "overhead" in the latency breakdown referring to?

piyush-singh · 2019-10-25T15:55:25Z

Overhead is a catch-all bucket as I understand it. Essentially we subtract the parse, plan, and run latencies from the overall latency and the remainder is just stuck under the overhead label.

knz · 2019-10-25T16:06:20Z

When executing this via an implicit txn, I see the statement executing much faster:

There is additional complexity in the SQL code with explicit transactions. We'd need some advanced investigation, but I would not be surprised if it was truly slower.
For example, explicit txns do not benefit from the "1PC optimization" available to standalone statements. That alone cuts down many network roundtrips in a multi-node cluster.

Overhead is a catch-all bucket as I understand it. Essentially we subtract the parse, plan, and run latencies from the overall latency and the remainder is just stuck under the overhead label.

Correct (I wrote that code). It ensures that the sum of latencies balances out. It also attracts our attention to measurement bugs if the overhead every gets too large compared to other things.

nvanbenschoten · 2019-10-31T17:42:18Z

@piyush-singh Is it possible that the leaseholder for the table descriptors or namespaces were remote and that the latency was due to lookups in these metadata tables? That would be consistent with your finding that the latency was attributed to "planning". It would also explain why the first access of table b in the transaction was significantly slower than later accesses because these lookups are cached.

nvanbenschoten · 2019-10-31T17:42:29Z

It would also explain this:

(side note: why is the second run so much faster?)

andy-kimball · 2019-10-31T17:59:49Z

It's "expected" that the 1st planning time will be very large, because we have to fetch sql schema in order to plan. In fact, even subsequent planning times can be large, because you might hit different gateway nodes, and each node then has to fetch schema.

solongordon · 2020-06-10T20:28:50Z

There could be more than one issue at play here but it looks to me like our latency accounting is just plain wrong when multiple statements are submitted in a single request. The execution time of the earlier statements is counted as "Overhead" in the later statements. This affects both implicit and explicit transactions.

SQL:

root@:26257/defaultdb> select pg_sleep(2); select 1;

WebUI:

I'll look into how to fix the accounting for this in conn_executor.

53800: server: defer gossip start to when absolutely needed r=irfansharif a=irfansharif This was pulled out of #52526 to keep the diff there focussed on the introduction of the RPC (and to see if that alone shaked out any failures). That change lets us make this one, were we can defer gossip start until right before we start up Node. Release justification: low risk, high benefit change to existing functionality Release note: None 53846: sql: fix inflated "overhead" in statement timings r=solongordon a=solongordon Fixes #40675 Fixes #50108 Release justification: Low-risk, high-reward fix to existing functionality. Release note (admin ui change): In some cases, the Execution Stats page would show a confusingly high Overhead latency for a statement. This could happen due to multiple statements being parsed together or due to statement execution being retried. To avoid this, we stop considering the time between when parsing ends and execution begins when determining service latency. 53906: roachtest: Unskip disk-full roachtest r=tbg a=itsbilal The disk-full roachtest no longer fails now that Pebble is the default storage engine. Remove the skipped marker. Fixes #35328. Release justification: Roachtest-only change. Release note: None. 53907: geos: bump to include seg fault fix r=sumeerbhola a=otan Resolves #53254. Release justification: bug fix to new functionality Release note: None Co-authored-by: irfan sharif <[email protected]> Co-authored-by: Solon Gordon <[email protected]> Co-authored-by: Bilal Akhtar <[email protected]> Co-authored-by: Oliver Tan <[email protected]>

rafiss · 2021-03-29T21:04:29Z

@solongordon will sync up with the Observability team and move ownership over to them.

awoods187 · 2021-04-23T20:57:29Z

@jordanlewis can we get this on the next milestone?

daniel-crlabs · 2021-04-26T19:07:54Z

Another customer has reached out to us with this same issue:

We seem to be processing the individual in 3ms. However, when we look at the Admin UI, we see an "overhead" of 2.3s

While analyzing the trace statement, it clearly indicates the statements take less than 400ms to complete

awoods187 · 2021-04-26T19:43:32Z

cc @maryliag @kevin-v-ngo

maryliag · 2021-04-26T20:07:52Z

From Solon's last comments, the solution for this will be just "stop displaying this "overhead" on the statement details page"?
Or a combination will another fix? Since Daniel's comment indicates there is also an issue with "overall"

knz added A-monitoring A-sql-executor SQL txn logic C-investigation Further steps needed to qualify. C-label will change. C-performance Perf of queries or internals. Solution not expected to change functional behavior. labels Oct 25, 2019

solongordon self-assigned this Jun 10, 2020

awoods187 added the A-sql-ui Why is my query slow? label Jun 11, 2020

solongordon mentioned this issue Sep 2, 2020

sql: fix inflated "overhead" in statement timings #53846

Merged

craig bot closed this as completed in 6371d37 Sep 3, 2020

solongordon reopened this Nov 6, 2020

awoods187 mentioned this issue Nov 6, 2020

Remove remaining high-latency items from "overhead" bucket and differentiate them cockroachdb/docs#7752

Closed

rafiss unassigned solongordon Apr 26, 2021

exalate-issue-sync bot added the T-sql-observability label May 5, 2021

exalate-issue-sync bot assigned maryliag May 5, 2021

yuzefovich mentioned this issue May 19, 2021

sql: fix inflated "overhead" in statement timings #65480

Merged

craig bot closed this as completed in 6469dab May 21, 2021

yuzefovich mentioned this issue Jun 4, 2021

release-21.1: sql: fix inflated "overhead" in statement timings #66108

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: statement timings in transactions are suspicious #40675

sql: statement timings in transactions are suspicious #40675

piyush-singh commented Sep 11, 2019 •

edited by exalate-issue-sync bot

Loading

jordanlewis commented Sep 11, 2019

piyush-singh commented Sep 11, 2019

RaduBerinde commented Oct 25, 2019

piyush-singh commented Oct 25, 2019

knz commented Oct 25, 2019

nvanbenschoten commented Oct 31, 2019

nvanbenschoten commented Oct 31, 2019

andy-kimball commented Oct 31, 2019

solongordon commented Jun 10, 2020

rafiss commented Mar 29, 2021

awoods187 commented Apr 23, 2021

daniel-crlabs commented Apr 26, 2021

awoods187 commented Apr 26, 2021 •

edited

Loading

maryliag commented Apr 26, 2021

sql: statement timings in transactions are suspicious #40675

sql: statement timings in transactions are suspicious #40675

Comments

piyush-singh commented Sep 11, 2019 • edited by exalate-issue-sync bot Loading

jordanlewis commented Sep 11, 2019

piyush-singh commented Sep 11, 2019

RaduBerinde commented Oct 25, 2019

piyush-singh commented Oct 25, 2019

knz commented Oct 25, 2019

nvanbenschoten commented Oct 31, 2019

nvanbenschoten commented Oct 31, 2019

andy-kimball commented Oct 31, 2019

solongordon commented Jun 10, 2020

rafiss commented Mar 29, 2021

awoods187 commented Apr 23, 2021

daniel-crlabs commented Apr 26, 2021

awoods187 commented Apr 26, 2021 • edited Loading

maryliag commented Apr 26, 2021

piyush-singh commented Sep 11, 2019 •

edited by exalate-issue-sync bot

Loading

awoods187 commented Apr 26, 2021 •

edited

Loading