confluent-keda-poc

Description

This repo is to:

Demo Keda connection to confluent
Demo scaling capabilities of Keda

Set up

Kubernetes

Used Minikube To get endpoint for minikube

minikube service dep --url

Go

touch main.go
// Add in basic server codes
go mod init confluent-keda-poc
go mod tidy

Keda

Reference:

helm repo add kedacore https://kedacore.github.io/charts
helm repo update
kubectl create namespace keda
helm install keda kedacore/keda --namespace keda

Observations

Kafka

Spammed produce endpoint

http://127.0.0.1:56825/api/produce

Confluent Cloud Consumer Lag

Keda Consumer Lag Setting: lagThreshold: "50"

Scaling up
Scaling down
Scaling down to below original replicas of 3

Keda Consumer Config: minReplicaCount: 1

Kafka Total Lag

specified to a scaling target of 100 total consumer lag

Total Kafka consumer lag

Note that Specific topic scaling is not set for topic_2. This demo shows that KEDA is triggering based on total Kafka Consumer Lag

HPA Trigger Scale Up to 2 replicas

The name of the external metrics also points to the trigger for total Kafka Consumer Lag

2 Replicas

CPU

Hit 20% Average CPU, Scaling up

cpu scaler need HPAContainerMetrics feature enabled

Cron

Before Cron scaling
After Cron scaling
CPU Scaling during Cron scaling period

Custom KEDA Codes to exclude Partitions stuck due to error

Custom KEDA Codes

Github Link: https://github.com/JosephABC/keda

Changes are in kafka_scaler.go in getLagForPartition function

Demo

Consumer Lag remain the same due to being stuck
Custom KEDA code excludes Consumer Lag for these partitions, hence 0 consumer lag shown in HPA

KEDA does not trigger scaling of consumer deployment based on these stuck partitions

Issues to think about

Kafka-scaler

Message in a partition encounters error and is unable to be consumed and offset cannot be committed.
Partition Key specified for topic. Large consumer lag observed on one/few particular partitions. Scaling out will probably have less effect on performance
Metric watched is the total consumer lag for Topic or all topics subcribed by the consumer group

CPU-Scaler

containerName parameter requires Kubernetes cluster version 1.20 or higher with HPAContainerMetrics feature enabled.

Others

To Observe and kill process on local

netstat -anop | grep -i 5000
pkill <PID>
kill -9 <PID>

To Deploy KEDA to Cluster

IMAGE_REGISTRY=docker.io IMAGE_REPO=josephangbc make publish
IMAGE_REGISTRY=docker.io IMAGE_REPO=josephangbc make deploy

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
controllers		controllers
images		images
manifests		manifests
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

confluent-keda-poc

Description

Set up

Kubernetes

Go

Keda

Observations

Kafka

Kafka Total Lag

CPU

Cron

Custom KEDA Codes to exclude Partitions stuck due to error

Custom KEDA Codes

Demo

Issues to think about

Kafka-scaler

CPU-Scaler

Others

To Deploy KEDA to Cluster

About

Releases

Packages

Languages

josephangbc/confluent-keda-poc

Folders and files

Latest commit

History

Repository files navigation

confluent-keda-poc

Description

Set up

Kubernetes

Go

Keda

Observations

Kafka

Kafka Total Lag

CPU

Cron

Custom KEDA Codes to exclude Partitions stuck due to error

Custom KEDA Codes

Demo

Issues to think about

Kafka-scaler

CPU-Scaler

Others

To Deploy KEDA to Cluster

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages