Kinesis source improvements for larger deployments #99

istreeter · 2024-11-25T11:00:21Z

Three improvements getting ready for using common-streams on larger pods or with more shards.

Configurable max leases to steal at one time. The KCL default is 1. In order to avoid latency during scale up/down and pod-ration, we want the app to be quick to acquire shard-leases to process. With bigger instances we tend to have more shard-leases per instance, so we increase how aggressivley it acquires leases.
Set KCL maxPendingProcessRecordsInput to the minimum allowed (1). This is a precaution to avoid potentials OOMs in case a single pod subscribes to a very large number of shards. It shouldn't have a negative performance impact, because common-streams apps tend to manage their own pre-fetching. In fact, this makes the Kinesis source more similar to the other Sources which don't have in-built pre-fetching.
Fixes a bug in which latency was set by the latest record in a batch, not the earliest.

Two improvements getting ready for using common-streams on larger pods or with more shards. 1. Configurable max leases to steal at one time. The KCL default is 1. In order to avoid latency during scale up/down and pod-ration, we want the app to be quick to acquire shard-leases to process. With bigger instances we tend to have more shard-leases per instance, so we increase how aggressivley it acquires leases. 2. Set KCL `maxPendingProcessRecordsInput` to the minimum allowed (1). This is a precaution to avoid potentials OOMs in case a single pod subscribes to a very large number of shards. It shouldn't have a negative performance impact, because common-streams apps tend to manage their own pre-fetching. In fact, this makes the Kinesis source more similar to the other Sources which don't have in-built pre-fetching. 3. Fixes a bug in which latency was set by the latest record in a batch, not the earliest.

colmsnowplow · 2024-11-26T12:57:40Z

modules/kinesis/src/main/resources/reference.conf

@@ -10,6 +10,7 @@ snowplow.defaults: {
        maxRecords: 1000
      }
      leaseDuration: "10 seconds"
+      maxLeasesToStealAtOneTimeFactor: 2.0


Curious that this is a decimal, you can't have a fraction of a lease surely? Doesn't seem like a likely oversight - so I'm guessing there's a reason I don't see?

This number is first multiplied by the number of processors. It is helpful to make it a decimal, e.g. maybe we want a 8-core instance to steal 2 leases at one time, so we set this value to 0.25.

I've been using this pattern in lots of places recently. It's a good way to make the app auto-guess a good configuration, instead of putting the burden on the user to configure it appropriately for its vertical size. And I consistently put Factor as a suffix of params that get multiplied by num processors.

Ah I see - interesting. But nuanced, but I get it. It's cores*this = n instances. I assume it rounds up to the nearest CEIL, since it's max... Cool, thanks for explaining!

colmsnowplow · 2024-11-26T13:01:22Z

...esis/src/main/scala/com/snowplowanalytics/snowplow/sources/kinesis/KinesisSourceConfig.scala

@@ -16,6 +16,26 @@ import java.net.URI
 import java.time.Instant
 import scala.concurrent.duration.FiniteDuration

+/**
+ * Config to be supplied from the app's hocon
+ *


Love the addition of in-code documentation for the other parameters too

colmsnowplow

LGTM!

See snowplow-incubator/common-streams#99 for the relevant change This library upgrade brings improvements to the Kinesis source, which should help on vertically larger instances.

* Bump common-streams to 0.9.0 See snowplow-incubator/common-streams#99 for the relevant change This library upgrade brings improvements to the Kinesis source, which should help on vertically larger instances. * Bump snowflake-ingest-sdk to 3.0.0

istreeter changed the base branch from main to develop November 25, 2024 11:00

istreeter force-pushed the kinesis-source-improvements branch 2 times, most recently from 52c5345 to b779ec3 Compare November 25, 2024 11:12

istreeter force-pushed the kinesis-source-improvements branch from b779ec3 to f07cc4b Compare November 26, 2024 09:23

istreeter changed the base branch from develop to main November 26, 2024 09:26

istreeter changed the base branch from main to develop November 26, 2024 09:26

colmsnowplow reviewed Nov 26, 2024

View reviewed changes

colmsnowplow approved these changes Nov 26, 2024

View reviewed changes

istreeter merged commit 38aaf95 into develop Nov 26, 2024
7 checks passed

istreeter deleted the kinesis-source-improvements branch November 26, 2024 13:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kinesis source improvements for larger deployments #99

Kinesis source improvements for larger deployments #99

istreeter commented Nov 25, 2024 •

edited

Loading

colmsnowplow Nov 26, 2024

istreeter Nov 26, 2024

colmsnowplow Nov 26, 2024

colmsnowplow Nov 26, 2024

colmsnowplow left a comment

Kinesis source improvements for larger deployments #99

Kinesis source improvements for larger deployments #99

Conversation

istreeter commented Nov 25, 2024 • edited Loading

colmsnowplow Nov 26, 2024

Choose a reason for hiding this comment

istreeter Nov 26, 2024

Choose a reason for hiding this comment

colmsnowplow Nov 26, 2024

Choose a reason for hiding this comment

colmsnowplow Nov 26, 2024

Choose a reason for hiding this comment

colmsnowplow left a comment

Choose a reason for hiding this comment

istreeter commented Nov 25, 2024 •

edited

Loading