-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Spike] Research and confirm which Epiphany components scale down correctly/incorrectly #1497
Comments
related to #1051 |
Results are described in the next comment |
Scaling statusThis document describes results of upscale/downscale processes for main Epiphany components. Prerequisites
Kubernetes masterUpscaleSupported. DownscaleNot supported with a log entry:
SummaryNeed to add support for downscale. Tested scheme: 1->3. Kubernetes nodeUpscaleUpscale is ok, checked nodes list and status via Downscale
SummaryNeed to add support for downscale. Tested scheme: 1->3->0. LoggingUpscaleAs expected, but sometimes the number of retries is not enough for Downscale
SummaryStep See As in other components, disks should be removed after downscale. Tested scheme: 1->2->3->1->0. MonitoringUpscaleSupported, it was verified that DownscaleDisks are not removed. SummaryDisks should be removed after downscale. Tested scheme: 1->3->2->0. KafkaVerification commandssystemctl status kafka
systemctl status zookeeper
# list brockers
echo dump | nc localhost 2181 - check available brokers
/opt/zookeeper-3.4.12/bin/zkCli.sh -server localhost:2181 ls /brokers/ids
# list topics
/opt/kafka_2.12-2.3.1/bin/kafka-topics.sh --zookeeper localhost:2181 --list
# create a topic
/opt/kafka_2.12-2.3.1/bin/kafka-topics.sh --create --zookeeper localhost:2181 --replication-factor 1 --partitions 1 --topic topic1 Set replication factor greater than number of available brokers to see number of available brokers.
Configuration files
Results of
|
Is your feature request related to a problem? Please describe.
We don't really know which Epiphany components scale down correctly and which are not. It's not documented anywhere.
Describe the solution you'd like:
This spike should help us to provide well described CLUSTER documentation. We should also know what has to be done to provide full autoscaling.
After the spike, we should create new issues to improve/implement autoscaling.
The text was updated successfully, but these errors were encountered: