Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change instance type for GKE build clusters #2438

Open
ameukam opened this issue Jul 30, 2021 · 13 comments
Open

Change instance type for GKE build clusters #2438

ameukam opened this issue Jul 30, 2021 · 13 comments
Labels
area/infra Infrastructure management, infrastructure design, code in infra/ lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete. sig/k8s-infra Categorizes an issue or PR as relevant to SIG K8s Infra.
Milestone

Comments

@ameukam
Copy link
Member

ameukam commented Jul 30, 2021

Prow build clusters currently use instances of the n1 machine family. We could potentially migrate to the:

This is purely a financially suggestion. Change to machine type family will help in cost savings of resource consumption.

/milestone v1.24

@k8s-ci-robot k8s-ci-robot added this to the v1.24 milestone Jul 30, 2021
@ameukam ameukam added the area/infra Infrastructure management, infrastructure design, code in infra/ label Jul 30, 2021
@spiffxp
Copy link
Member

spiffxp commented Aug 6, 2021

I very much want to try out local SSD before we decide whether the e2 family makes sense.

My quick glance is N2's run us more than N1's, I'd be curious to understand what benefit we think we'd be getting with N2D's

@ameukam
Copy link
Member Author

ameukam commented Aug 6, 2021

I very much want to try out local SSD before we decide whether the e2 family makes sense.

My quick glance is N2's run us more than N1's, I'd be curious to understand what benefit we think we'd be getting with N2D's

N2D (specially the n2d-highmem-8) have memory than the n1 instance. This could help increase individual node density and reduce cluster size. But this is pure speculation. I need to get my calculations. I also want to explore all possible options to improve capacity planning.

@spiffxp
Copy link
Member

spiffxp commented Sep 2, 2021

/priority important-longterm

@k8s-ci-robot k8s-ci-robot added the priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete. label Sep 2, 2021
@k8s-ci-robot k8s-ci-robot added sig/k8s-infra Categorizes an issue or PR as relevant to SIG K8s Infra. and removed wg/k8s-infra labels Sep 29, 2021
@spiffxp
Copy link
Member

spiffxp commented Oct 1, 2021

An update from #1187. We are now running instances with local SSDs for ephemeral storage. It's not clear whether they've substantively improved our build performance, but since e2's don't support local SSD, I would likely not consider e2's unless there's a really compelling reason to.

@ameukam
Copy link
Member Author

ameukam commented Nov 3, 2021

After a quick review of the GCP compute pricing, change the instances type will be the opposite of what I expect. I still believe the pricing will evolve over the next years and allow us to make the change.

/remove-milestone
/priority backlog
/lifecycle frozen

@k8s-ci-robot k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. priority/backlog Higher priority than priority/awaiting-more-evidence. labels Nov 3, 2021
@ameukam
Copy link
Member Author

ameukam commented Nov 3, 2021

/milestone clear

@k8s-ci-robot k8s-ci-robot removed this from the v1.24 milestone Nov 3, 2021
@ameukam
Copy link
Member Author

ameukam commented Aug 26, 2022

/milestone v1.26
/remove-lifecycle frozen
/remove-priority backlog

@k8s-ci-robot k8s-ci-robot removed lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. priority/backlog Higher priority than priority/awaiting-more-evidence. labels Aug 26, 2022
@k8s-ci-robot k8s-ci-robot added this to the v1.26 milestone Aug 26, 2022
@k8s-triage-robot
Copy link

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

  • After 90d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Mark this issue or PR as fresh with /remove-lifecycle stale
  • Mark this issue or PR as rotten with /lifecycle rotten
  • Close this issue or PR with /close
  • Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

@k8s-ci-robot k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 24, 2022
@ameukam
Copy link
Member Author

ameukam commented Nov 24, 2022

/remove-lifecycle stale
/milestone clear
/lifecycle frozen

@k8s-ci-robot k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Nov 24, 2022
@k8s-ci-robot k8s-ci-robot removed this from the v1.26 milestone Nov 24, 2022
@ameukam
Copy link
Member Author

ameukam commented Mar 3, 2024

We will use the N2 family once we are done with the migration prow.k8s.io

@ameukam ameukam changed the title Change instance type for build clusters Change instance type for GKE build clusters Jul 25, 2024
@ameukam
Copy link
Member Author

ameukam commented Dec 6, 2024

More instance families have been until now: https://cloud.google.com/compute/docs/general-purpose-machines#c4_series

We should probably go with the C4 machines:

  • Additional 10 GB in memory
  • More recent CPU
  • same CPU/memory ratio
  • support of local SSD

@ameukam
Copy link
Member Author

ameukam commented Dec 6, 2024

/milestone v1.33

@k8s-ci-robot k8s-ci-robot added this to the v1.33 milestone Dec 6, 2024
@ameukam ameukam moved this to Backlog in SIG K8S Infra Dec 6, 2024
@ameukam
Copy link
Member Author

ameukam commented Dec 6, 2024

/remove-lifecycle

@ameukam ameukam moved this from Backlog to In Progress in SIG K8S Infra Dec 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/infra Infrastructure management, infrastructure design, code in infra/ lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete. sig/k8s-infra Categorizes an issue or PR as relevant to SIG K8s Infra.
Projects
Status: In Progress
Development

No branches or pull requests

4 participants