AKS cluster autoscaling tests #1441

mkyc · 2020-07-13T13:07:19Z

Prepare manual load tests to ensure that cluster scales up and down as expected. Load tests will be included later into epiphany code but not at that moment.

Is your feature request related to a problem? Please describe.
AKS clusters we will provide in epiphany need to have horizontal autoscaling enabled and tested.

Describe the solution you'd like
There should be some Kubernetes Deployment defined with images applying some high load on cluster. With this deployment we should be able to scale number of replicas up and down, and check if vertical autoscaler is responding accordingly and scales nodes up and down.

Deployment and description of commands should be committed to sub-directory with terraform scripts of AKS cluster. There is no need to integrate it with any test pipeline at this point.

Describe alternatives you've considered
Unknown.

Additional context
No.

mkyc · 2020-07-16T11:17:55Z

This task in fact is extremely similar to #1443 and should be synchronised with that one.

rpudlowski93 · 2020-08-10T15:33:27Z

Autoscaling tests done. In order to reproduce manual load test execute the following points:

Create cluster AKS with enabled option: enable_auto_scaling = true with min and max count of nodes and some custom parameters.
Apply deployment with simple php-apache image:
kubectl apply -f https://k8s.io/examples/application/php-apache.yaml
Create horizontal pod autoscale based on CPU/Memory or some other from eg. prometheus metrics. Server metrics is installed by default in AKS version higher then 1.10 :
kubectl autoscale deployment php-apache --cpu-percent=50 --min=1 --max=50
Create deployment with a containers in order to increase load on nodes using infinite loop of queries to the php-apache service created in previous point.

apiVersion: apps/v1
kind: Deployment
metadata:
  name: load-generator
spec:
  selector:
    matchLabels:
      run: load-generator
  replicas: 1
  template:
    metadata:
      labels:
        run: load-generator
    spec:
      containers:
      - name: load-generator
        image: busybox
        args:
        - /bin/sh
        - "-c"
        - "while true; do wget -q -O- http://php-apache; done"

Increase load as needed:
kubectl scale deployment load-generator --replicas=5
Check the effect using monitoring tools:
kubectl get hpa
or Azure Insights:

Summing up: after some tests and observations, autoscaling works fine and after applying high load the cluster scale up and down as expected. By default scale up works very fast and new nodes are created immediately if scale condition is true but by default scale down will be executed after 10 min if scale condition is true (cpu load = 50% in this case).

przemyslavic · 2020-08-21T07:20:44Z

Autoscaling has been tested. The number of pods and nodes varied depending on the load.

mkyc added the status/grooming-needed label Jul 13, 2020

This was referenced Jul 13, 2020

Adding ability to scale-down the pool of worker nodes. #1202

Closed

AKS cluster epiphany configuration #1444

Closed

mkyc added area/development area/kubernetes type/performance priority/important-soon type/feature-request provider/azure labels Jul 16, 2020

mkyc mentioned this issue Jul 16, 2020

EKS cluster autoscaling tests #1443

Closed

ghost removed the status/grooming-needed label Jul 17, 2020

mkyc added this to the S20200729 milestone Jul 17, 2020

mkyc modified the milestones: S20200729, S20200813 Jul 29, 2020

rpudlowski93 assigned rpudlowski93 and unassigned rpudlowski93 Aug 5, 2020

rpudlowski93 mentioned this issue Aug 11, 2020

Cluster autoscaler feature #1551

Merged

mkyc modified the milestones: S20200813, S20200827 Aug 13, 2020

przemyslavic self-assigned this Aug 18, 2020

mkyc closed this as completed Aug 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AKS cluster autoscaling tests #1441

AKS cluster autoscaling tests #1441

mkyc commented Jul 13, 2020 •

edited

Loading

mkyc commented Jul 16, 2020

rpudlowski93 commented Aug 10, 2020 •

edited

Loading

przemyslavic commented Aug 21, 2020

AKS cluster autoscaling tests #1441

AKS cluster autoscaling tests #1441

Comments

mkyc commented Jul 13, 2020 • edited Loading

mkyc commented Jul 16, 2020

rpudlowski93 commented Aug 10, 2020 • edited Loading

przemyslavic commented Aug 21, 2020

mkyc commented Jul 13, 2020 •

edited

Loading

rpudlowski93 commented Aug 10, 2020 •

edited

Loading