Store Real Time Streamer Messages in Redis #241

darunrs · 2023-09-27T19:03:18Z

The streamer message is used by both the coordinator and runner. However, both currently poll the message from S3. There is a huge latency impact for pulling the message from S3. In order to improve this, the streamer message will now be cached in Redis with a TTL and pulled by runner from Redis. Only if there is a cache miss will runner pull from S3 again.

Pulling from S3 currently takes up 200-500ms, which is roughly 80-85% of the overall execution time of a function in runner. By caching the message, a cache hit leads to loading the data in 1-3ms in local testing, which corresponds to about 3-5% of the execution time, or a 1100% improvement in latency. The reduction of network related activity to such a low percentage of execution time also reduces the variability of a function's execution time greatly. Cache hits and misses will be logged for further tuning of TTL to reduce cache misses. In addition, processing the block takes around 1-3ms. This processing can be moved to be done before caching, saving 1-3ms each time that block is read from cache, which does stack up over time. The improvement there will be more important for historical backfill, which is planned to be cached soon.

Tracking Issue: #262
Parent Issue: #204

darunrs · 2023-09-27T21:03:21Z

#204

runner/src/indexer/indexer.ts

gabehamilton

👍

morgsmccauley

Great first start :) I've left a few comments. Can you please also link the issue?

indexer/storage/src/lib.rs

indexer/queryapi_coordinator/src/main.rs

indexer/queryapi_coordinator/src/utils.rs

runner/src/metrics.ts

runner/src/indexer/indexer.ts

runner/package.json

runner/src/indexer/indexer.ts

darunrs · 2023-10-02T22:18:04Z

Tracking Issue: #262
Parent Issue: #204

Reduces build and test runtime by 50%

morgsmccauley

Couple small comments but otherwise looks good!

indexer/storage/src/lib.rs

indexer/queryapi_coordinator/src/main.rs

morgsmccauley

LGTM!

The streamer message is used by both the coordinator and runner. However, both currently poll the message from S3. There is a huge latency impact for pulling the message from S3. In order to improve this, the streamer message will now be cached in Redis with a TTL and pulled by runner from Redis. Only if there is a cache miss will runner pull from S3 again. Pulling from S3 currently takes up 200-500ms, which is roughly 80-85% of the overall execution time of a function in runner. By caching the message, a cache hit leads to loading the data in 1-3ms in local testing, which corresponds to about 3-5% of the execution time, or a 1100% improvement in latency. The reduction of network related activity to a much lower percentage of execution time also reduces the variability of a function's execution time greatly. Cache hits and misses will be logged for further tuning of TTL to reduce cache misses. In addition, processing the block takes around 1-3ms. This processing has been moved to be done before caching, saving an extra 1-3ms each time that block is read from cache. The improvement there will be important for historical backfill, which is planned to be optimized soon. Tracking Issue: #262 Parent Issue: #204

darunrs requested review from gabehamilton and morgsmccauley September 27, 2023 19:03

darunrs mentioned this pull request Sep 27, 2023

Optimize Runner Streamer Message Acquisition #204

Closed

darunrs linked an issue Sep 27, 2023 that may be closed by this pull request

Optimize Runner Streamer Message Acquisition #204

Closed

darunrs force-pushed the cacheStreamer branch from 0730b83 to 6d6b20d Compare September 27, 2023 23:20

darunrs removed a link to an issue Sep 29, 2023

Optimize Runner Streamer Message Acquisition #204

Closed

darunrs linked an issue Sep 29, 2023 that may be closed by this pull request

Cache Real Time Streamer Message in Redis #262

Closed

darunrs changed the title ~~Store Streamer Messages in Redis~~ Store Real Time Streamer Messages in Redis Sep 29, 2023

darunrs force-pushed the cacheStreamer branch from 90589ae to 66f6046 Compare October 2, 2023 19:14

darunrs commented Oct 2, 2023

View reviewed changes

runner/src/indexer/indexer.ts Show resolved Hide resolved

darunrs marked this pull request as ready for review October 2, 2023 19:19

darunrs requested a review from a team as a code owner October 2, 2023 19:19

darunrs requested a review from roshaans October 2, 2023 19:19

gabehamilton reviewed Oct 2, 2023

View reviewed changes

morgsmccauley requested changes Oct 2, 2023

View reviewed changes

darunrs force-pushed the cacheStreamer branch from 0e4eb87 to 1e1f086 Compare October 3, 2023 18:45

darunrs requested a review from morgsmccauley October 3, 2023 19:41

darunrs force-pushed the cacheStreamer branch from 073bd18 to 698440a Compare October 5, 2023 18:21

darunrs added 10 commits October 5, 2023 11:22

Migrate AWS-SDK from v2 to v3

e885946

Reduces build and test runtime by 50%

Implement Cache and Fix SDK Implementation

4802299

Migrate Redis Caching to Coordinator

299a1a8

Add logging for cache hits and misses

63821d1

Add tests for redis client

2d474ef

Remove yarn.lock and undo block server upgrade

42b72ec

Migrated block processing to Coordinator for cached blocks

623e7ae

Fix formatting

79c23fb

Address PR Comments

507b214

Fix failing tests in github

498ff1f

darunrs force-pushed the cacheStreamer branch from 698440a to 498ff1f Compare October 5, 2023 18:22

morgsmccauley reviewed Oct 5, 2023

View reviewed changes

indexer/storage/src/lib.rs Outdated Show resolved Hide resolved

indexer/storage/src/lib.rs Outdated Show resolved Hide resolved

indexer/queryapi_coordinator/src/main.rs Outdated Show resolved Hide resolved

darunrs force-pushed the cacheStreamer branch from 085c888 to cc018d0 Compare October 5, 2023 23:38

Address some more PR comments

4f97940

darunrs force-pushed the cacheStreamer branch from cc018d0 to 4f97940 Compare October 5, 2023 23:42

morgsmccauley approved these changes Oct 5, 2023

View reviewed changes

darunrs merged commit 9678087 into main Oct 5, 2023
6 checks passed

darunrs deleted the cacheStreamer branch October 5, 2023 23:50

morgsmccauley mentioned this pull request Apr 22, 2024

test stable branch git fix up #687

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store Real Time Streamer Messages in Redis #241

Store Real Time Streamer Messages in Redis #241

darunrs commented Sep 27, 2023 •

edited

Loading

darunrs commented Sep 27, 2023

gabehamilton left a comment

morgsmccauley left a comment

darunrs commented Oct 2, 2023

morgsmccauley left a comment

morgsmccauley left a comment

Store Real Time Streamer Messages in Redis #241

Store Real Time Streamer Messages in Redis #241

Conversation

darunrs commented Sep 27, 2023 • edited Loading

darunrs commented Sep 27, 2023

gabehamilton left a comment

Choose a reason for hiding this comment

morgsmccauley left a comment

Choose a reason for hiding this comment

darunrs commented Oct 2, 2023

morgsmccauley left a comment

Choose a reason for hiding this comment

morgsmccauley left a comment

Choose a reason for hiding this comment

darunrs commented Sep 27, 2023 •

edited

Loading