-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
guidance for updates on jobs with multiple volumes #8058
Comments
Example of using HCL2 to accomplish this is in #9449 |
Not that I can think of currently. You'd need to have a pool of volumes ready-to-go and unclaimed, because Nomad can't create volumes on its own (see #8212). And even then, you'd need to be able to interpolate the volume |
I'm late to the party, but there's a slightly related scenario that I at least wanted to document: using Using Nomad Pack, we are able to create multiple instances of the same job with a slightly different name so they can be independently managed to protect the state's integrity.
When this is rendered using Nomad Pack, I can now control which job I am interacting with by specifying the So I'm late to the party and this uses an additional tool, but it's one more potential way to deal with the complexities of stateful applications. |
I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues. |
In #8045 we encountered a common use case that was a bit underdesigned in the CSI/host volumes implementation.
Suppose you want to run a multiple-node database (ex. MySQL with replication). You need to specify a specific volume to mount to each task. Your options are:
update
stanza, which means when you update the job you can have all 3 nodes stop at the same time and end up taking an outage.count = n
. But then you need some kind of consistent metadata for the tasks to be interpolated in the volume ID. You can't useNOMAD_ALLOC_INDEX
because there's no guarantee that(job_version=1, alloc_index=1)
is going to be stopped and have its volumes released before(job_version=2, alloc_index=1)
.We need to provide some documentation guidance for operators on this, and/or come up with a better way to orchestrate tracking resources across job versions.
The text was updated successfully, but these errors were encountered: