Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use Elastic Inference for GPU-powered clusters #643

Closed
arun-gupta opened this issue Mar 18, 2019 · 10 comments
Closed

Use Elastic Inference for GPU-powered clusters #643

arun-gupta opened this issue Mar 18, 2019 · 10 comments
Labels
area/managed-nodegroup EKS Managed Nodegroups area/nodegroup priority/backlog Not staffed at the moment. Help wanted. stale

Comments

@arun-gupta
Copy link

Why do you want this feature?

Elastic Inference adds GPU acceleration to any Amazon EC2 instance for faster inference at much lower cost (up to 75% savings). EKS customers could have extensive savings by provisioning a cluster using this instead of straight up GPUs.

Not sure, if this even made sense.

What feature/behavior/change do you want?

A GPU-powered cluster is created as:

   eksctl create cluster eks-gpu \
      --node-type=p3.8xlarge \
      --nodes=2 \
      --region=us-west-2 \
      --ssh-access \
      --ssh-public-key=arun-us-west2

A CLI using Elastic Inference could be:

   eksctl create cluster eks-gpu \
      --nodes=2 \
      --region=us-west-2 \
      --vpc-elastic-inference
      --ssh-public-key=arun-us-west2

Do not hesitate, when appropriate, to share the exact commands or API you would like, and/or to share a diagram (e.g.: asciiflow.com): "a picture is worth a thousand words".

@errordeveloper
Copy link
Contributor

Could you please share the docs for CloudFormation along with reference docs?

@aflc
Copy link

aflc commented Aug 1, 2019

@mumoshu
Copy link
Contributor

mumoshu commented Aug 1, 2019

Do we have anything other than the ones listed below to be addressed for this?

@rbrigden
Copy link

+1 this is critical for economic inference workloads

@github-actions
Copy link
Contributor

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.

@github-actions github-actions bot added the stale label Jan 27, 2021
@michaelbeaumont michaelbeaumont added area/managed-nodegroup EKS Managed Nodegroups area/nodegroup priority/backlog Not staffed at the moment. Help wanted. and removed stale labels Jan 28, 2021
@github-actions
Copy link
Contributor

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.

@github-actions github-actions bot added the stale label Feb 28, 2021
@github-actions
Copy link
Contributor

github-actions bot commented Mar 5, 2021

This issue was closed because it has been stalled for 5 days with no activity.

@github-actions github-actions bot closed this as completed Mar 5, 2021
@deliahu
Copy link

deliahu commented Apr 7, 2021

@michaelbeaumont I was just wondering if this feature is still on the roadmap, as supporting Elastic Inference is a common request among cortex users (we use eksctl to help configure users' clusters).

@RobertLucian
Copy link

@michaelbeaumont we're very interested in this feature. Any updates on this one?

@deliahu
Copy link

deliahu commented Jun 8, 2021

@mumoshu @errordeveloper I was just wondering if there were any updates on this. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/managed-nodegroup EKS Managed Nodegroups area/nodegroup priority/backlog Not staffed at the moment. Help wanted. stale
Projects
None yet
Development

No branches or pull requests

8 participants