feat: Improved latency for executeStreaminSql calls #7254

saranshdhingra · 2024-04-24T13:19:23Z

Earlier the encodeMessage was used to convert the PartialResultSet proto to an array. Now, with googleapis/gax-php#554 in GAX, we can provide a way to convert the proto to an array which gets us this huge improvement in the conversion. The improvements are visible specially when the protobuf PECL extension is enabled.

A few tests that I ran:

On my local mac (Protobuf extension enabled):

Test 1:

Load a proto with JSON size ~ 1MB

Time to serialize the proto into an array:
Without this fix: 810 ms
With this fix: 28 ms

Test 2:

Call a SELECT * FROM <table> LIMIT 100 (23 column table) using $database->execute twice. The time recorded is end-to-end, including looping through the results. Please note that, the first query can often be slower due to many network/GRPC related reasons.

Without this fix:
Iteration 1 completed in 110.547951 ms
Iteration 2 completed in 80.933988 ms

With this fix:
Iteration 1 completed in 76.404962 ms
Iteration 2 completed in 57.99326 ms

Test 3:

Call a SELECT * FROM <table> LIMIT 10000 (23 column table) using $database->execute twice. The time recorded is end-to-end, including looping through the results. Please note that, the first query can often be slower due to many network/GRPC related reasons.

Without this fix:
Iteration 1 completed in 4.09 s
Iteration 2 completed in 3.37 s

With this fix:
Iteration 1 completed in 706.560961 ms
Iteration 2 completed in 321.485171 ms

On a high power compute VM (protobuf enabled)

Test 1:

Load a proto with JSON size ~ 1MB

Time to serialize the proto into an array:
Without this fix: 345 ms
With this fix: 42 ms

Test 2:

Call a SELECT * FROM <table> LIMIT 100 (23 column table) using $database->execute twice. The time recorded is end-to-end, including looping through the results. Please note that, the first query can often be slower due to many network/GRPC related reasons.

Without this fix:
Iteration 1 completed in 31 ms
Iteration 2 completed in 26 ms

With this fix:
Iteration 1 completed in 16 ms
Iteration 2 completed in 15 ms

Test 3:

Call a SELECT * FROM <table> LIMIT 10000 (23 column table) using $database->execute twice. The time recorded is end-to-end, including looping through the results. Please note that, the first query can often be slower due to many network/GRPC related reasons.

Without this fix:
Iteration 1 completed in 1.83 s
Iteration 2 completed in 1.78 s

With this fix:
Iteration 1 completed in 204 ms
Iteration 2 completed in 185 ms

It's important to note that in tests 2 and 3, there are many more factors involved like the regions of the application and Spanner instance, internet speed, machine capabilities.

Spanner/composer.json

composer.json

Spanner/tests/Unit/Connection/GrpcTest.php

Update GrpcTest Co-authored-by: Brent Shaffer <[email protected]>

ser-sergeev · 2024-05-15T09:57:18Z

@saranshdhingra @bshaffer
I think you break the library somehow. I'm not sure how, but 1.76.0 and 1.76.1 doesn't return all rows.
Here is an information.
#7311

Improved latency for executeStreaminSql calls

ba190b9

saranshdhingra requested review from a team as code owners April 24, 2024 13:19

saranshdhingra and others added 5 commits April 24, 2024 19:41

Modified GAX version for tests

d9322ca

Update GAX dependency to dev-main

0a8c417

Increasing GAX version to run tests

fdda755

Update composer.json

babd51a

Merge branch 'main' into spanner-spedify-partialresultset

0364373

bshaffer reviewed Apr 25, 2024

View reviewed changes

Spanner/composer.json Outdated Show resolved Hide resolved

composer.json Outdated Show resolved Hide resolved

update to latest GAX (1.32)

03e905d

bshaffer requested changes Apr 25, 2024

View reviewed changes

Spanner/tests/Unit/Connection/GrpcTest.php Show resolved Hide resolved

saranshdhingra and others added 2 commits April 26, 2024 13:13

Update Spanner/tests/Unit/Connection/GrpcTest.php

d8f4145

Update GrpcTest Co-authored-by: Brent Shaffer <[email protected]>

Addressed comments about GrpcTest

79b610d

bshaffer approved these changes Apr 26, 2024

View reviewed changes

bshaffer merged commit 6d164d9 into main Apr 26, 2024
27 checks passed

bshaffer deleted the spanner-spedify-partialresultset branch April 26, 2024 14:45

release-please bot mentioned this pull request Apr 26, 2024

chore(main): release 0.243.0 #7255

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Improved latency for executeStreaminSql calls #7254

feat: Improved latency for executeStreaminSql calls #7254

saranshdhingra commented Apr 24, 2024 •

edited

Loading

ser-sergeev commented May 15, 2024

feat: Improved latency for executeStreaminSql calls #7254

feat: Improved latency for executeStreaminSql calls #7254

Conversation

saranshdhingra commented Apr 24, 2024 • edited Loading

On my local mac (Protobuf extension enabled):

Test 1:

Test 2:

Test 3:

On a high power compute VM (protobuf enabled)

Test 1:

Test 2:

Test 3:

ser-sergeev commented May 15, 2024

saranshdhingra commented Apr 24, 2024 •

edited

Loading