sql: stream planNode.Next() results through pgwire without collecting all intermediate results in RAM #7775

knz · 2016-07-12T15:09:36Z

As discussed previously with @mjibson. Will help support work on #7572 and #7657.

maddyblue · 2016-07-12T19:40:30Z

For whoever does this work: I think implementing the streaming part and returning an error when needed is straightforward. I'm mostly worried about client error handling, though, and think that this deserves more research before we can ship a PR. Mostly I want someone to have thought through the following situation and be convinced that we're ok.

Let's take some app that uses this feature. It runs a SELECT and starts reading through the result rows, and for each row returned the app produces some side effect. Then after some number of rows the SELECT is aborted or errors for any reason. Those side effects have now already happened. With our current code, the error is detected first and so no side effects are produced. A user now has to figure out what to do. They can re-run the SELECT, but the results may be different, and thus have produced some erroneous side effects. How can we handle this?

However, thinking through this it's possible this exact problem is already possible. Today, the server buffers all rows in SQL, then sends them up to the pgwire layer, which sends each row off to the client. A connection or other error can also occur here, resulting in exactly the same situation. So maybe we won't be worse off than today, so the above problem isn't a real problem?

knz · 2016-07-12T21:04:36Z

Really I'd argue it's already a (non-)problem, because of transactions.

We already report errors before the end of a transaction. So if a transaction gets canceled, whatever side-effects on the client side that occured since the beginning of the transaction need to be undone anyway. I'd argue that any well designed MVC client code will always buffer/process data from the server (and wait for txn completion) before presenting data to the user and/or effecting irreversible side-effects.

knz · 2016-10-26T22:24:30Z

Ok I have investigated this more. Explaining here as requested by @petermattis.

@andreimatei you may want to chime in since the main focus here ends up revolving around autoretries and API.

The issue to solve here is not that clients may be confused to receive an error after some data has been communicated already. It is actually valid in pgwire (and other sql wire protocols) to start returning results and then fail before the statement has completed.

The main issue I found is that the API between pgwire and Executor is currently a function that drives the entire SQL transaction state, the mapping between the SQL transaction state and KV transaction state, and the retry logic. To change this to a streaming interface is not simple. It amounts to an API change that will break the abstraction boundary and cause headaches with retries.

A naive idea, not to be considered seriously but related here for contrast with the other idea below, is to change the Executor API to return a generator of planNodes to pgwire, to be driven via Start/Next etc by pgwire. However if/when an error occurs or anything like a transaction boundary in the middle of the SQL syntax, the Executor must be informed of this accordingly by pgwire (during the progress of the stream of planNodes). This would amount to a complex bi-directional interaction between pgwire and Executor, ie too complex for what we want to maintain.

The more promising direction is to have pgwire pass a closure to Executor, like a "sender function" that the Executor can use to actively send the results towards pgwire during ExecStatements.

Now the problem with this is what to do with transaction retries. The main issue is that as soon as some data has flown on the wire from the executor to the client, a retry becomes impossible. IF we naively start to stream data as soon as a statement produces data, no implicit retry becomes ever possible again for out-of-txn SELECT or INSERT/UPDATE/UPSERT ... RETURNING. So the better solution, assuming we want to keep autoretries, would be to start accumulating results locally in the server, and only stream that back to the client as soon as some local capacity limit has been reached, and limit autoretry to those statements for which the results has fit entirely in the buffer (and have not been communicated back to the client yet).

Overall all this looks pretty doable with regards to functionality but there will be a lot of corner cases for testing, namely what happens with this buffering of rows in the presence of client errors and txn errors, or when there is a transaction boundary inside a single SQL string (like "... COMMIT; BEGIN ...").

petermattis · 2016-10-27T00:31:55Z

Passing some sort of interface or closure to ExecuteStatements sounds reasonable to me, though I haven't looked at this code closely in a while. Sort of mirrors the approach of passing an io.Writer to some function that wants to write data rather than having that function return a []byte or a string.

bdarnell · 2016-10-29T13:23:47Z

This would amount to a complex bi-directional interaction between pgwire and Executor, ie too complex for what we want to maintain.

Why is this prohibitively complex? There's clear precedent for this kind of API in database/sql.Rows which uses a Start/Next/Close interface. Or anything that takes a cancelable context.Context has a similar level of bidirectional interaction.

Passing an interface or closure to handle the rows is restrictive in its own way - the connection can't signal the executor that the client has gone away except when the executor has a row available. If the query is expensive (e.g. a select that does a full table scan with a filter that discards nearly all rows), we'd like the pgwire connection to be able to tell the executor that the client has gone away instead of continuing to run the command.

spencerkimball · 2017-04-02T00:14:58Z

@knz, assigning you. Please re-assign, remove 1.0 milestone as you see fit.

knz · 2017-04-10T14:09:09Z

New milestone as discussed with @cuongdo

cuongdo · 2017-05-16T16:41:07Z

Now that @tristan-ohlson is on this, this will be part of the 1.1 release.

cc @knz @andreimatei @RaduBerinde @jordanlewis

knz changed the title ~~sql: stream planNode.Next() results through pgwire without collecting all intermediate results~~ sql: stream planNode.Next() results through pgwire without collecting all intermediate results in RAM Jul 12, 2016

This was referenced Aug 28, 2016

sql: implement mechanisms to limit resource consumption #8879

Closed

sql: preliminary mechanism to track and limit SQL memory usage. #8691

Merged

petermattis added this to the 1.0 milestone Feb 22, 2017

spencerkimball assigned knz Apr 2, 2017

knz modified the milestones: Q2 2017, 1.0 Apr 10, 2017

dianasaur323 modified the milestones: 1.1, Q2 2017 Apr 20, 2017

cuongdo assigned tso and unassigned knz May 16, 2017

bdarnell mentioned this issue May 22, 2017

perf: pipeline transactional writes at the KV layer #16026

Closed

knz mentioned this issue Jun 1, 2017

sql: actively cancel queries when client connection drops #16262

Closed

knz mentioned this issue Jul 17, 2017

cli/tables: add a missing test for errors after streamed results #17048

Closed

tso mentioned this issue Aug 3, 2017

sql: streaming results through pgwire #17019

Merged

tso closed this as completed in #17019 Aug 10, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: stream planNode.Next() results through pgwire without collecting all intermediate results in RAM #7775

sql: stream planNode.Next() results through pgwire without collecting all intermediate results in RAM #7775

knz commented Jul 12, 2016

maddyblue commented Jul 12, 2016

knz commented Jul 12, 2016

knz commented Oct 26, 2016

petermattis commented Oct 27, 2016

bdarnell commented Oct 29, 2016

spencerkimball commented Apr 2, 2017

knz commented Apr 10, 2017

cuongdo commented May 16, 2017

sql: stream planNode.Next() results through pgwire without collecting all intermediate results in RAM #7775

sql: stream planNode.Next() results through pgwire without collecting all intermediate results in RAM #7775

Comments

knz commented Jul 12, 2016

maddyblue commented Jul 12, 2016

knz commented Jul 12, 2016

knz commented Oct 26, 2016

petermattis commented Oct 27, 2016

bdarnell commented Oct 29, 2016

spencerkimball commented Apr 2, 2017

knz commented Apr 10, 2017

cuongdo commented May 16, 2017