Investigate throughput for multiple tables in state store committer lambda #3117

patchwork01 · 2024-08-22T07:53:41Z

Background

Related to:

When we added system tests for throughput of the state store committer in StateStoreCommitterThroughputST, we found that it's considerably reduced when we have 10 Sleeper tables.

Description

We'd like to find out why throughput is reduced so much for multiple tables, and see how we can minimise this.

Analysis

We can find more details of the behaviour of throughput on the lambda, since right now we only get the raw throughput per table. We can also look at possible causes.

There are a number of possible causes for this:

Throughput is limited for the FIFO queue
Throughput is limited for the Lambda event source
Multiple tables are processed by a single lambda invocation too often
Lambda instances spend time coming up to date as tables are swapped between them
Lambda instances spend time waiting for a retry adding a transaction as tables are swapped between them

Throughput limited

We could enable high throughput for SQS FIFO:

https://docs.aws.amazon.com/AWSSimpleQueueService/latest/SQSDeveloperGuide/high-throughput-fifo.html

We've split out this issue:

High throughput FIFO queue for state store committer #3134

Multiple tables processed by a single lambda invocation

We could log out statistics for how many tables were processed by a lambda invocation, since we can already get that information from QueryStateStoreCommitterLogs and StateStoreCommitterRuns. We could also assert on that in StateStoreCommitterThroughputST.

We've split out this issue:

Track multiple Sleeper tables in a single invocation of state store committer #3135

Spending extra time when tables are swapped

We already log out how much time is spent coming up to date in TransactionLogHead. We could add processing of that in QueryStateStoreCommitterLogs and ReadStateStoreCommitterLogs. We could also log more information about the time spent waiting.

We've split out this issue:

Track Sleeper tables swapping between state store committer Lambda instances #3136

This is also related:

Chart throughput over time in state store throughput test #3098

patchwork01 added the statestore-module label Aug 22, 2024

patchwork01 added this to the 0.25.0 milestone Aug 22, 2024

patchwork01 mentioned this issue Aug 22, 2024

Support asynchronous commit of all state store updates #2934

Closed

patchwork01 added parent-issue An issue that is or should be split into multiple sub-issues and removed statestore-module labels Aug 27, 2024

patchwork01 removed this from the 0.25.0 milestone Aug 27, 2024

This was referenced Aug 27, 2024

Merge transactions together in an invocation of state store committer lambda #2992

Open

Optimise job tracker updates on state store commits #3121

Open

patchwork01 mentioned this issue Nov 8, 2024

Improve compaction job throughput #3643

Open

patchwork01 changed the title ~~Throughput for multiple tables in state store committer~~ Investigate throughput for multiple tables in state store committer Nov 12, 2024

patchwork01 changed the title ~~Investigate throughput for multiple tables in state store committer~~ Investigate throughput for multiple tables in state store committer lambda Nov 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate throughput for multiple tables in state store committer lambda #3117

Investigate throughput for multiple tables in state store committer lambda #3117

patchwork01 commented Aug 22, 2024 •

edited

Loading

Investigate throughput for multiple tables in state store committer lambda #3117

Investigate throughput for multiple tables in state store committer lambda #3117

Comments

patchwork01 commented Aug 22, 2024 • edited Loading

Background

Description

Analysis

Throughput limited

Multiple tables processed by a single lambda invocation

Spending extra time when tables are swapped

patchwork01 commented Aug 22, 2024 •

edited

Loading