[FlightSQL] Support `DoExchange` (in addition to `DoPut`) to bind parameters and execute prepared statements #37741

alamb · 2023-09-15T16:19:27Z

Describe the enhancement requested

As suggested by @kou on #37720 (comment)

Usecase

Reduce the number of messages round trips required to run a prepared statement via FlightSQL
Avoid the need to (potentially) serialize bind parameters in a stateless architecture (see [FlightSQL] Stateless prepared statement with parameter support #37720)

Background

Currently FlightSQL requires three messages to run a prepared statement. DoPut, then a GetFlightInfo and then a DoGet:

...
DoPut `CommandPreparedStatementQuery` 
GetFlightInfo `CommandPreparedStatementQuery`
DoGet 
...

Proposal

By supporting DoExchange instead of DoPut + GetFlightInfo + DoGet only a single round trip is needed:

...
DoExchange `CommandPreparedStatementQuery` 
...

Benefits:

With this approach, parameters aren't needed to send-back to client-side

Drawbacks:

the query execution result can be returned by only one server.

Component(s)

FlightRPC

The text was updated successfully, but these errors were encountered:

lidavidm · 2023-09-15T16:44:42Z

SGTM

We should add flags in (the awkwardly named) SqlInfo to indicate support for these as we did with other new features

zeroshade · 2023-09-18T14:46:07Z

+1 and I agree with @lidavidm that we should add flags to SqlInfo

suremarc · 2024-02-02T21:07:27Z

I have started looking into implementing this on the Go client side, and I'm running into some difficulties. Namely, the existing interface for PreparedStatement.Execute returns a *FlightInfo, but DoExchange skips GetFlightInfo altogether. The Rust client also presents a similar interface, and I would guess that most clients in other languages do the same.

Of course, one can always drop down to raw Flight/FlightSQL calls without using the abstraction of prepared statement "handles", but this just moves complexity to consumers of the library and makes me worry if the DoExchange protocol for prepared statements will actually be adopted in the ecosystem.

So we have a few options:

Change PreparedStatement.Execute to fetch the flight streams instead of just returning the FlightInfo
- Pros: simpler for users, and hides details like using DoPut/DoExchange
- Cons: disruptive breaking change, less control over consumption of the flight streams.
Add a new PreparedStatement.DoExchange method, separate from the existing Execute implementation that uses DoPut
- Pros: less disruptive
- Cons: dispatching to the correct implementation (aka whatever the server supports) is still forced on users of this library

@alamb do you have any thoughts on this?

alamb · 2024-02-04T14:10:22Z

@suremarc can you remind me why we would need to pass a FlightInfo? I think we could send the required information as part of the payload of the stream, right?

For example

DoExchange

arrow/format/Flight.proto

Line 127 in aded7bf

rpc DoExchange(stream FlightData) returns (stream FlightData) {}

Sends a stream of FlightData

arrow/format/Flight.proto

Lines 495 to 520 in aded7bf

    
           message FlightData { 
        
             /* 
        
              * The descriptor of the data. This is only relevant when a client is 
        
              * starting a new DoPut stream. 
        
              */ 
        
             FlightDescriptor flight_descriptor = 1; 
        
             /* 
        
              * Header for message data as described in Message.fbs::Message. 
        
              */ 
        
             bytes data_header = 2; 
        
             /* 
        
              * Application-defined metadata. 
        
              */ 
        
             bytes app_metadata = 3; 
        
             /* 
        
              * The actual batch of Arrow data. Preferably handled with minimal-copies 
        
              * coming last in the definition to help with sidecar patterns (it is 
        
              * expected that some implementations will fetch this field off the wire 
        
              * with specialized code to avoid extra memory copies). 
        
              */ 
        
             bytes data_body = 1000; 
        
           }

And the FlightData:: flight_descriptor field has the embedded cmd message (in which FlightSQL messages are embedded)

arrow/format/Flight.proto

Line 310 in aded7bf

bytes cmd = 2;

FYI @kallisti-dev who is also working on stateless prepared statement execution

suremarc · 2024-02-04T18:12:06Z

@alamb I think we are on the same page that DoExchange does not require passing a FlightInfo — the problem is that the existing prepared statement interfaces do require returning a FlightInfo back to the client. Introducing the DoExchange protocol breaks this assumption.

For example, see the Go prepared statement implementation: PreparedStatement.Execute, which returns a FlightInfo. The Rust FlightSQL client does this too.

kou · 2024-02-05T09:07:35Z

How about just returning null or an empty FlightInfo (no endpoint)?

kou · 2024-02-05T09:12:36Z

Ah, we may want to return *flight.Reader like Client.DoGet() for DoExchange version.

alamb · 2024-02-05T10:05:24Z

@alamb I think we are on the same page that DoExchange does not require passing a FlightInfo — the problem is that the existing prepared statement interfaces do require returning a FlightInfo back to the client. Introducing the DoExchange protocol breaks this assumption.

I always think of FlightInfo as a source of potential indirection (as it can contain endpoint information so the subsequent call may be to a different endpoint / hostname)

Thus if there is no subsequent calls (just DoExchange instead of DoPut/GetFlightInfo/DoGet) the lack of FlightInfo makes sense to me (as DoExchange starts feeding back data from whatever endpoint you sent data to)

alamb added the Type: enhancement label Sep 15, 2023

github-actions bot added the Component: FlightRPC label Sep 15, 2023

alamb mentioned this issue Sep 15, 2023

[FlightSQL] Stateless prepared statement with parameter support #37720

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FlightSQL] Support `DoExchange` (in addition to `DoPut`) to bind parameters and execute prepared statements #37741

[FlightSQL] Support `DoExchange` (in addition to `DoPut`) to bind parameters and execute prepared statements #37741

alamb commented Sep 15, 2023

lidavidm commented Sep 15, 2023

zeroshade commented Sep 18, 2023

suremarc commented Feb 2, 2024 •

edited

Loading

alamb commented Feb 4, 2024

suremarc commented Feb 4, 2024 •

edited

Loading

kou commented Feb 5, 2024

kou commented Feb 5, 2024

alamb commented Feb 5, 2024

[FlightSQL] Support DoExchange (in addition to DoPut) to bind parameters and execute prepared statements #37741

[FlightSQL] Support DoExchange (in addition to DoPut) to bind parameters and execute prepared statements #37741

Comments

alamb commented Sep 15, 2023

Describe the enhancement requested

Usecase

Background

Proposal

Component(s)

lidavidm commented Sep 15, 2023

zeroshade commented Sep 18, 2023

suremarc commented Feb 2, 2024 • edited Loading

alamb commented Feb 4, 2024

suremarc commented Feb 4, 2024 • edited Loading

kou commented Feb 5, 2024

kou commented Feb 5, 2024

alamb commented Feb 5, 2024

[FlightSQL] Support `DoExchange` (in addition to `DoPut`) to bind parameters and execute prepared statements #37741

[FlightSQL] Support `DoExchange` (in addition to `DoPut`) to bind parameters and execute prepared statements #37741

suremarc commented Feb 2, 2024 •

edited

Loading

suremarc commented Feb 4, 2024 •

edited

Loading