Characterize relayer throughput #185

cam-schultz · 2024-02-15T17:16:02Z

Context and scope
The relayer processes Warp messages from separate chains concurrently, but within a single chain messages are processed in serial (see #31). There is therefore a throughput limit per chain that we should mesaure and characterize.

Discussion and alternatives

Throughput is very likely network bound, as the application side processing complexity is insignificant compared to the number of network round trips required to relay a message. We should aim to answer the following questions:
- Is single message relaying in fact network bound?
- What's the breakdown between network latency and application side processing latency for a single message?
- What's the minimum and maximum number of sequential network round trips needed to relay a single message?

Open questions
At what load level do concurrent database writes become a bottleneck?

cam-schultz · 2024-06-17T18:55:59Z

Alongside implementing #31, load testing was performed using https://github.com/ava-labs/as-simulator. The tests were performed on Fuji, and measured on-chain end-to-end Teleporter message latency. At 10 TPS sustained, the measured average latency was ~2seconds, which when considering the per-chain expected time to finality of ~1s, is close to optimal.

The observed bottleneck in raw throughput was due to limits on the number of simultaneous transactions from a single address that AvalancheGo nodes will keep in the mempool before ejecting further transactions. #256 Addresses this corner case.

This round of testing is enough evidence to conclude that the relayer's concurrency model is scalable enough that it will likely not be the bottleneck in an end-to-end cross-chain system. Closing this ticket out as completed. Future profiling and optimization work will be represented by new tickets with more focused target areas.

cam-schultz added enhancement New feature or request tests Improving test coverage through unit/e2e tests, etc. labels Feb 15, 2024

github-project-automation bot added this to Platform Engineering Group Feb 15, 2024

github-project-automation bot moved this to Backlog 🗄️ in Platform Engineering Group Feb 15, 2024

cam-schultz added this to the Post-Durango fast follows milestone Mar 8, 2024

cam-schultz mentioned this issue May 13, 2024

Process messages concurrently #288

Merged

cam-schultz mentioned this issue May 22, 2024

Single message processing optimizations #296

Open

cam-schultz moved this from Backlog 🗄️ to In Progress 🏗 in Platform Engineering Group Jun 6, 2024

cam-schultz self-assigned this Jun 6, 2024

cam-schultz closed this as completed Jun 17, 2024

github-project-automation bot moved this from In Progress 🏗 to Done ✅ in Platform Engineering Group Jun 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Characterize relayer throughput #185

Characterize relayer throughput #185

cam-schultz commented Feb 15, 2024 •

edited by michaelkaplan13

Loading

cam-schultz commented Jun 17, 2024

Characterize relayer throughput #185

Characterize relayer throughput #185

Comments

cam-schultz commented Feb 15, 2024 • edited by michaelkaplan13 Loading

cam-schultz commented Jun 17, 2024

cam-schultz commented Feb 15, 2024 •

edited by michaelkaplan13

Loading