determine and document resource requirements #485

BenTheElder · 2019-05-05T19:07:26Z

What would you like to be documented: A more accurate lower bound on resources when using kind with docker desktop. Currently we suggest 4GB / 4 CPU which while probably accurate for building kubernetes should be more than we need to run a node.

https://kind.sigs.k8s.io/docs/user/quick-start/#creating-a-cluster

Why is this needed: We don't want to overstate requirements dramatically and scare off potential users :-)

We'll need to do some testing to determine the threshold. It should be reduced at HEAD, it might also be interesting to check if that is true 🙃

BenTheElder · 2019-05-05T19:43:08Z

/assign

nickolaev · 2019-05-05T21:01:40Z

I am trying kind in the CircleCI machine executor which seems to be perfectly fine with its 2 CPU cores. Hope this helps.

BenTheElder · 2019-05-05T22:26:17Z

thanks!

can confirm that at HEAD even the smallest settings docker desktop offers currently work fine for kind create cluster:

$ kind create cluster
Creating cluster "kind" ...
 ✓ Ensuring node image (kindest/node:v1.14.1) 🖼
 ✓ Preparing nodes 📦 
 ✓ Creating kubeadm config 📜 
 ✓ Starting control-plane 🕹️ 
 ✓ Installing CNI 🔌 
 ✓ Installing StorageClass 💾 
Cluster creation complete. You can now use the cluster with:

export KUBECONFIG="$(kind get kubeconfig-path --name="kind")"
kubectl cluster-info

$ kubectl get no
NAME                 STATUS   ROLES    AGE   VERSION
kind-control-plane   Ready    master   20s   v1.14.1

$ kubectl get po --all-namespaces
NAMESPACE     NAME                      READY   STATUS    RESTARTS   AGE
kube-system   coredns-fb8b8dccf-qhm47   1/1     Running   0          23s
kube-system   coredns-fb8b8dccf-rc56v   1/1     Running   0          23s
kube-system   kube-proxy-ksw7d          1/1     Running   0          23s
kube-system   weave-net-dqz4p           2/2     Running   0          23s

BenTheElder · 2019-05-05T22:41:29Z

will actually have to follow up and sample the usage over time, the lowest settings on docker desktop / macOS appear to be above our lower bound 🙃

costinm · 2019-05-09T04:40:15Z

We are using both CircleCI machine and remote_docker environments, which are 2 CPU, to test Istio - it seems to be working quite well. Since Circle doesn't allow higher CPU, it would be good to keep this as a baseline.

costinm · 2019-05-09T04:41:17Z

Besides - if the ARM64 bugs are fixed, I would hope Kind will run on Raspberry Pi - k8s can run just fine.

BenTheElder · 2019-05-09T04:53:12Z

We are using both CircleCI machine and remote_docker environments, which are 2 CPU, to test Istio - it seems to be working quite well. Since Circle doesn't allow higher CPU, it would be good to keep this as a baseline.

A single node should work with considerably less than this, however the rest of what we can do performance wise is mostly bound by Kubernetes / CRI / ... CNI is probably the last place we have room for squeezing this lower and we're working on that.

It may regress some in the future due to the components we don't control, but keeping everything as low as we can is a high priority 👍

Besides - if the ARM64 bugs are fixed, I would hope Kind will run on Raspberry Pi - k8s can run just fine.

As far as I know ARM64 works but requires building images yourself. Currently it will be painful to cross-build those because of getting kubernetes loaded, but that is being worked on in low priority.

There is some limited ARM64 CI working now from the openlab folks.

BenTheElder · 2019-05-15T00:39:04Z

these will shift a bit with the updated CNI configuration (should be lower), but will remeasure.

still need to document as well

BenTheElder · 2019-06-25T20:43:14Z

since we've approached the limits of what we can reduce from kind's end alone, some experimentation with making upstream Kubernetes lighter: https://github.com/BenTheElder/kubernetes/tree/experiment

if we go forward with this change upstream then kind will support leveraging it immediately.

BenTheElder · 2019-07-20T01:22:05Z

I've improved that prototype with the goals of:

being able to test / use out of tree cloud providers without in-tree code well ahead of the in-tree removal
being able to ship kind with lighter node images

So far that more or less works and I've created a provisional PR upstream.

At this point I think the next step is a KEP, expect more on this in the near future :-)

We may need to slightly adjust what else we ship though (EG currently we are missing the metrics APIs) but will continue to push for light weight clusters overall. I think we can lighten some other things at the same time to make room without adding much overhead.

BenTheElder · 2019-10-08T03:37:25Z

#932 + recent containerd build infra and upgrades should reduce the memory overhead per pod.

BenTheElder · 2019-11-07T09:34:36Z

/help

k8s-ci-robot · 2019-11-07T09:34:37Z

@BenTheElder:
This request has been marked as needing help from a contributor.

Please ensure the request meets the requirements listed here.

If this request no longer meets these requirements, the label can be removed
by commenting with the /remove-help command.

In response to this:

/help

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

shreyasbapat · 2020-04-23T05:37:23Z

Hi @BenTheElder
I wish to work on this issue and get started, can you confirm if only the image of the docker desktop interface needs to be changed with the one you posted above?

BenTheElder · 2020-04-23T05:39:51Z

the image can be left actually, as the image is talking about building kubernetes image which takes more resources.

it would be helpful to determine exactly how much resources a typical kind cluster uses in a repeatable fashion and keep this documented.

shreyasbapat · 2020-04-24T20:03:45Z

it would be helpful to determine exactly how much resources a typical kind cluster uses in a repeatable fashion and keep this documented.

And for that, I will have to perform some experiments and report back right? Do you suggest me to check multiple times?

BenTheElder · 2020-04-24T23:08:20Z

that's a good idea, I think the most important thing is that we write down how we determined this somewhere so we can come back and verify what it's currently at :-)

shreyasbapat · 2020-04-27T09:25:00Z

that's a good idea, I think the most important thing is that we write down how we determined this somewhere so we can come back and verify what it's currently at :-)

On it. Will notify once I am done

shekhar-rajak · 2020-11-07T11:24:57Z

it would be helpful to determine exactly how much resources a typical kind cluster uses in a repeatable fashion and keep this documented.

Different hardware system and system config can behave differently, not sure where to benchmark memory & time .

jayunit100 · 2021-07-14T10:56:17Z

FWIW I ran an experiment on my kids 2 core i5 with 8GB of ram dedicated to docker and was able to run a four node kind cluster with no issues , including scheduling of 10+ pods (14 if you include CNI) and sonobuoy .

Meanwhile pushing to ten nodes on a massive server with 48 cores failed bc of etcd.

So sounds like the most important tweak for kind may be he running etcd in memory if running large number of nodes.

williscool · 2024-09-17T19:06:50Z

just wanted to share that aws publishes the maximum pods you can run on an ec2 instances for eks. which you can roughly map to an amount of cpus and memory per pod

https://github.com/awslabs/amazon-eks-ami/blob/main/templates/shared/runtime/eni-max-pods.txt

found that here

https://stackoverflow.com/questions/57970896/pod-limit-on-node-aws-eks

not sure if its helpful or not for finding the right numbers for kind but figured why not share

BenTheElder added the kind/documentation Categorizes issue or PR as related to documentation. label May 5, 2019

BenTheElder mentioned this issue May 5, 2019

Images Improvements [Breaking Changes] #461

Merged

k8s-ci-robot assigned BenTheElder May 5, 2019

BenTheElder mentioned this issue May 5, 2019

remove generated base image sources #490

Merged

k8s-ci-robot added the help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. label Nov 7, 2019

BenTheElder added the good first issue Denotes an issue ready for a new contributor, according to the "help wanted" guidelines. label Nov 19, 2019

BenTheElder mentioned this issue Jan 8, 2020

RFE: Constrained deployment scenarios (Edge) kubernetes/kubeadm#2000

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 17, 2020

kubernetes-sigs deleted a comment from fejta-bot Feb 18, 2020

BenTheElder added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Feb 18, 2020

BenTheElder mentioned this issue Mar 9, 2020

Can't create multiple single-node clusters (10th cluster creation fails) #1388

Closed

BenTheElder removed their assignment Jun 23, 2020

stg-0 pushed a commit to stg-0/kind that referenced this issue Mar 4, 2024

Fix - IP on doc (kubernetes-sigs#485)

1d5377c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

determine and document resource requirements #485

determine and document resource requirements #485

BenTheElder commented May 5, 2019

BenTheElder commented May 5, 2019

nickolaev commented May 5, 2019

BenTheElder commented May 5, 2019

BenTheElder commented May 5, 2019

costinm commented May 9, 2019

costinm commented May 9, 2019

BenTheElder commented May 9, 2019

BenTheElder commented May 15, 2019

BenTheElder commented Jun 25, 2019

BenTheElder commented Jul 20, 2019

BenTheElder commented Oct 8, 2019

BenTheElder commented Nov 7, 2019

k8s-ci-robot commented Nov 7, 2019

shreyasbapat commented Apr 23, 2020

BenTheElder commented Apr 23, 2020

shreyasbapat commented Apr 24, 2020

BenTheElder commented Apr 24, 2020

shreyasbapat commented Apr 27, 2020

shekhar-rajak commented Nov 7, 2020

jayunit100 commented Jul 14, 2021

williscool commented Sep 17, 2024

determine and document resource requirements #485

determine and document resource requirements #485

Comments

BenTheElder commented May 5, 2019

BenTheElder commented May 5, 2019

nickolaev commented May 5, 2019

BenTheElder commented May 5, 2019

BenTheElder commented May 5, 2019

costinm commented May 9, 2019

costinm commented May 9, 2019

BenTheElder commented May 9, 2019

BenTheElder commented May 15, 2019

BenTheElder commented Jun 25, 2019

BenTheElder commented Jul 20, 2019

BenTheElder commented Oct 8, 2019

BenTheElder commented Nov 7, 2019

k8s-ci-robot commented Nov 7, 2019

shreyasbapat commented Apr 23, 2020

BenTheElder commented Apr 23, 2020

shreyasbapat commented Apr 24, 2020

BenTheElder commented Apr 24, 2020

shreyasbapat commented Apr 27, 2020

shekhar-rajak commented Nov 7, 2020

jayunit100 commented Jul 14, 2021

williscool commented Sep 17, 2024