-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use Elastic Inference for GPU-powered clusters #643
Comments
Could you please share the docs for CloudFormation along with reference docs? |
+1 |
Do we have anything other than the ones listed below to be addressed for this?
|
+1 this is critical for economic inference workloads |
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days. |
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days. |
This issue was closed because it has been stalled for 5 days with no activity. |
@michaelbeaumont I was just wondering if this feature is still on the roadmap, as supporting Elastic Inference is a common request among cortex users (we use |
@michaelbeaumont we're very interested in this feature. Any updates on this one? |
@mumoshu @errordeveloper I was just wondering if there were any updates on this. Thank you! |
Why do you want this feature?
Elastic Inference adds GPU acceleration to any Amazon EC2 instance for faster inference at much lower cost (up to 75% savings). EKS customers could have extensive savings by provisioning a cluster using this instead of straight up GPUs.
Not sure, if this even made sense.
What feature/behavior/change do you want?
A GPU-powered cluster is created as:
A CLI using Elastic Inference could be:
Do not hesitate, when appropriate, to share the exact commands or API you would like, and/or to share a diagram (e.g.: asciiflow.com): "a picture is worth a thousand words".
The text was updated successfully, but these errors were encountered: