🪵 Add Karpenter to Airflow #4757

jacobwoffenden · 2024-07-24T15:01:31Z

User Story

As an Analytical Platform engineer
I want to offer various Karpenter profiles to Airflow customers
So that they have flexibility in what instance type and size they use

Value / Purpose

The current Airflow compute types are:

general (t3.xlarge 4 vCPU, 16 GiB)
high-memory (r6i.8xlarge 32 vCPU, 256 GiB)

general is an EKS MNG with a desired size of 3 but can scale to 10

high-memory is an EKS MNG with a desired size of 0 but can scale to 1, if multiple high-memory jobs are scheduled they will share this instance, and if non can fit, presumably they will fail

In Analytical Platform Compute, we have Karpenter deployed and can create node pools that can provide different types of compute

Useful Contacts

@jacobwoffenden

User Types

No response

Hypothesis

If we create imports that create resources (requests and limits, tolerations and affinity
Then users can use them to utilise our compute platform better

Proposal

We already have general-on-demand, general-spot and gpu-on-demand, so we should enhance this offering, and have a bigger mix. Also introduce resource limits and requests profiles so Karpenter can find an appropriate instance

Additional Information

No response

Definition of Done

Karpenter node pools diversified
Import for resource limits and requests created
User docs have been updated
Another team member has reviewed
Tests are green

The text was updated successfully, but these errors were encountered:

jacobwoffenden added the story label Jul 24, 2024

github-project-automation bot added this to Analytical Platform Jul 24, 2024

github-project-automation bot moved this to 👀 TODO in Analytical Platform Jul 24, 2024

jacobwoffenden mentioned this issue Jul 24, 2024

🪵 Update APC Karpenter GPU profiles ministryofjustice/modernisation-platform-environments#7275

Closed

github-actions bot mentioned this issue Aug 1, 2024

Monthly issue metrics report #4823

Closed

jacobwoffenden mentioned this issue Aug 1, 2024

🪵 Add GPU Spot NodePool to APC ministryofjustice/modernisation-platform-environments#7350

Merged

jacobwoffenden self-assigned this Aug 1, 2024

jacobwoffenden closed this as completed Aug 20, 2024

github-project-automation bot moved this from 🚀 In Progress to 🎉 Done in Analytical Platform Aug 20, 2024

jacobwoffenden reopened this Aug 20, 2024

github-project-automation bot moved this from 🎉 Done to 🚀 In Progress in Analytical Platform Aug 20, 2024

jacobwoffenden moved this from 🚀 In Progress to 🛂 In Review in Analytical Platform Aug 20, 2024

jacobwoffenden closed this as completed Aug 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🪵 Add Karpenter to Airflow #4757

🪵 Add Karpenter to Airflow #4757

jacobwoffenden commented Jul 24, 2024 •

edited

Loading

🪵 Add Karpenter to Airflow #4757

🪵 Add Karpenter to Airflow #4757

Comments

jacobwoffenden commented Jul 24, 2024 • edited Loading

User Story

Value / Purpose

Useful Contacts

User Types

Hypothesis

Proposal

Additional Information

Definition of Done

jacobwoffenden commented Jul 24, 2024 •

edited

Loading