feat: Pre-Fetch Streamer Messages #269

darunrs · 2023-10-05T00:54:06Z

Historical streamer messages are not fetched in coordinator prior to the IndexerRunner call. As a result, we cannot apply the latency saving benefits of having coordinator cache the streamer message for use by runner. Instead, we want to pre fetch from S3 so that runner invocations don't have to wait for the streamer message to be retrieved from S3.

In addition, it's possible for real-time messages to backup temporarily preventing the cached message from being used. So, we also want to prefetch any messages which aren't found in the cache.

The new workflow works by having two loops for each worker thread: a producer and a consumer. The producer loads a promise for fetching the block (either from cache or S3) into an array. The consumer then removes the first element from the array and processes it, deleting the streamer message upon success. While one block is being processed, the other blocks are being fetched. This ensures that wait time is minimal. The producer loop attempts to keep the array as close to full as possible.

darunrs · 2023-10-05T18:50:21Z

#264

morgsmccauley

Really great start on this - left a couple comments :)

runner/src/lake-client/lake-client.ts

runner/src/metrics.ts

runner/src/lake-client/lake-client.ts

runner/src/stream-handler/worker.ts

darunrs · 2023-11-06T21:35:00Z

I'm still experimenting with the blocking values. In the meantime, I released the changes I've made so far. Let me know what you think!

darunrs · 2023-11-06T23:21:04Z

I ran roughly 5000 messages of sweat blockheight before and after the blocking commit and I can visually see an improvement. Left is with the commit and right is without it. Block wait time is lower and less spread out. Although, it was already low enough to not impact much anymore anyway. Execution duration and overhead latency spread is tighter too. The BPS curve is more linear and the peak was higher too. In any case, it seems the commit definitely helped.

runner/src/lake-client/lake-client.ts

runner/src/redis-client/redis-client.ts

runner/src/stream-handler/worker.ts

darunrs · 2023-11-08T21:02:59Z

I did some more experimentation and found that blocking does in fact block both loops, not just the producer loop. As a result, once no stream messages are present, execution of the functions becomes slow as the xread blocks the whole thread for the blocking duration. Removing it returned the speed improvements.

In addition, I ran a test against sweat_blockheight where I appended the block-height of each stream message to one file and then appended the block height to a second file after runFunction is called. I diff'd the two files and found no differences. This was enough to verify that every message in the stream is seen and processed exactly once.

morgsmccauley

LGTM - great work :)

morgsmccauley · 2023-11-08T21:22:01Z

Just need to fix failing tests

This reverts commit 262b183.

Reverts #269

Reverts #377 - Merging #269 back in

darunrs linked an issue Oct 5, 2023 that may be closed by this pull request

Pre-Fetch Streamer Messages #264

Closed

darunrs changed the base branch from main to cacheStreamer October 5, 2023 03:48

darunrs force-pushed the cacheStreamer branch from 698440a to 498ff1f Compare October 5, 2023 18:22

darunrs force-pushed the cacheHistorical branch from e0604e0 to 0024859 Compare October 5, 2023 18:31

darunrs force-pushed the cacheStreamer branch from cc018d0 to 4f97940 Compare October 5, 2023 23:42

Base automatically changed from cacheStreamer to main October 5, 2023 23:50

darunrs mentioned this pull request Oct 5, 2023

Optimize Runner Streamer Message Acquisition #204

Closed

darunrs force-pushed the cacheHistorical branch 3 times, most recently from f71a38d to 4e10f80 Compare November 2, 2023 01:12

darunrs marked this pull request as ready for review November 2, 2023 02:18

darunrs requested a review from a team as a code owner November 2, 2023 02:18

darunrs requested a review from morgsmccauley November 2, 2023 02:18

darunrs changed the title ~~Pre-Fetch Historical Streamer Messages~~ feat: Pre-Fetch Historical Streamer Messages Nov 2, 2023

darunrs changed the title ~~feat: Pre-Fetch Historical Streamer Messages~~ feat: Pre-Fetch Streamer Messages Nov 2, 2023

morgsmccauley requested changes Nov 3, 2023

View reviewed changes

darunrs requested a review from morgsmccauley November 6, 2023 21:35

morgsmccauley reviewed Nov 7, 2023

View reviewed changes

darunrs added 9 commits November 8, 2023 10:55

MVP of Pre Fetching Historical

bddfd0e

Implement producer and consumer loops

1160091

Updates to allow for test metrics without deleting messages

a555058

Add some metrics for more testing

79374f9

Migrated S3 code to new class

c897125

Add more Instrumentation

cd0d2e5

Clean up code

9d17c22

Encapsulate streamer message builder function

b790af1

Ensure Unit Tests Pass

e3a3869

darunrs added 13 commits November 8, 2023 10:55

Undo local testing changes

c32290f

Address Offline Comments

4041869

Implement Morgan's Metrics Fix

607cd37

Prepare Code for PR

a0ab89b

Complete Lake Client Tests

044dd27

Finalize Metrics for PR

00132ab

Tune parameters for waiting

6321526

Address unit test failures

0e3e26e

Perform Block conversion in Lake Client and address cleanliness comments

b155dc3

Remove Unrelated Metrics

1aa3627

Eliminate Global Variables and Clean Up Code

b050e4c

Experiment with Blocking

7847eb0

Address More Comments

43911fd

darunrs force-pushed the cacheHistorical branch from ae1aef8 to 43911fd Compare November 8, 2023 18:56

Undo Blocking for XRead

19e9406

darunrs requested a review from morgsmccauley November 8, 2023 21:03

morgsmccauley approved these changes Nov 8, 2023

View reviewed changes

Fix failing tests

6cb07b1

darunrs merged commit 262b183 into main Nov 8, 2023
3 checks passed

darunrs deleted the cacheHistorical branch November 8, 2023 22:23

morgsmccauley mentioned this pull request Nov 9, 2023

Prod Release 09/11/23 #376

Merged

morgsmccauley added a commit that referenced this pull request Nov 9, 2023

Revert "feat: Pre-Fetch Streamer Messages (#269)"

8ed2277

This reverts commit 262b183.

morgsmccauley mentioned this pull request Nov 9, 2023

Revert "feat: Pre-Fetch Streamer Messages" #377

Merged

morgsmccauley added a commit that referenced this pull request Nov 9, 2023

Revert "feat: Pre-Fetch Streamer Messages" (#377)

91f7999

Reverts #269

morgsmccauley mentioned this pull request Nov 9, 2023

Revert "Revert "feat: Pre-Fetch Streamer Messages"" #378

Merged

morgsmccauley added a commit that referenced this pull request Nov 9, 2023

Revert "Revert "feat: Pre-Fetch Streamer Messages"" (#378)

2f5505b

Reverts #377 - Merging #269 back in

morgsmccauley mentioned this pull request Apr 22, 2024

test stable branch git fix up #687

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Pre-Fetch Streamer Messages #269

feat: Pre-Fetch Streamer Messages #269

darunrs commented Oct 5, 2023 •

edited

Loading

darunrs commented Oct 5, 2023

morgsmccauley left a comment

darunrs commented Nov 6, 2023

darunrs commented Nov 6, 2023 •

edited

Loading

darunrs commented Nov 8, 2023 •

edited

Loading

morgsmccauley left a comment

morgsmccauley commented Nov 8, 2023

feat: Pre-Fetch Streamer Messages #269

feat: Pre-Fetch Streamer Messages #269

Conversation

darunrs commented Oct 5, 2023 • edited Loading

darunrs commented Oct 5, 2023

morgsmccauley left a comment

Choose a reason for hiding this comment

darunrs commented Nov 6, 2023

darunrs commented Nov 6, 2023 • edited Loading

darunrs commented Nov 8, 2023 • edited Loading

morgsmccauley left a comment

Choose a reason for hiding this comment

morgsmccauley commented Nov 8, 2023

darunrs commented Oct 5, 2023 •

edited

Loading

darunrs commented Nov 6, 2023 •

edited

Loading

darunrs commented Nov 8, 2023 •

edited

Loading