-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Standardize on container images instead of machine images #146
Comments
If future versions of CRIU support loading/restoring the internal state of CUDA devices, standardizing on containers could have the additional advantage of allowing us to perform live migrations between spot instances. The advantages versus data-based checkpoints aren't especially obvious, but it looks like the next cool technology. 😄 See also #176 (comment) |
Blockers for containerized
|
Moved from the experimental XPD library. |
do you mean allow |
This issue predates the |
🪓 terraform {
required_providers {
iterative = { source = "iterative/iterative" }
}
}
provider "iterative" {}
resource "iterative_task" "example" {
cloud = "aws"
image = "nvidia"
machine = "g4dn.xlarge"
script = <<-END
#!/usr/bin/env -S sh -c 'docker run --rm -iv "$(realpath "$0"):/file" alpine sh /file'
cat /etc/alpine-release
END
} |
Follow-up of #127 (comment)
It would be nice to offer a single, consistent environment on every platform, and we can ship default container images as part of the machine images to avoid pull delays and costs.
This proposal assumes that:
I'm inclined to think that those assumptions are pretty reasonable, and a good compromise between impact and effort on our side.
The text was updated successfully, but these errors were encountered: