guidance for updates on jobs with multiple volumes #8058

tgross · 2020-05-27T13:00:26Z

In #8045 we encountered a common use case that was a bit underdesigned in the CSI/host volumes implementation.

Suppose you want to run a multiple-node database (ex. MySQL with replication). You need to specify a specific volume to mount to each task. Your options are:

Run each DB node as its own job, orchestrate the deployment outside of Nomad, and lose features like constraints.
Run each DB node as its own task group with a single job, but lose the ability to control concurrent updates via the update stanza, which means when you update the job you can have all 3 nodes stop at the same time and end up taking an outage.
Run each DB node as a task within a single task group with count = n. But then you need some kind of consistent metadata for the tasks to be interpolated in the volume ID. You can't use NOMAD_ALLOC_INDEX because there's no guarantee that (job_version=1, alloc_index=1) is going to be stopped and have its volumes released before (job_version=2, alloc_index=1).

We need to provide some documentation guidance for operators on this, and/or come up with a better way to orchestrate tracking resources across job versions.

The text was updated successfully, but these errors were encountered:

tgross · 2020-11-25T19:07:20Z

Example of using HCL2 to accomplish this is in #9449

ayZagen · 2021-01-07T23:55:31Z

@tgross I came here from #8045. HCL2 example doesn't play well with nomad autoscaler as autoscaler will increase count. Is there any way/workaround to apply requirement with autoscaler ?

tgross · 2021-01-08T13:26:54Z

Not that I can think of currently. You'd need to have a pool of volumes ready-to-go and unclaimed, because Nomad can't create volumes on its own (see #8212). And even then, you'd need to be able to interpolate the volume source, which is #7877. I've been working on a design this week for both #8212 and #7877, but I don't have a timeline on when that work would be complete.

joliver · 2022-06-09T22:13:21Z

I'm late to the party, but there's a slightly related scenario that I at least wanted to document: using host_volume rather than the CSI plugin, it's not obvious how to assign a particular allocation (e.g. 0, 1, 2, etc.) to a particular node. When using stateful resources like MySQL, PostgreSQL, etc. having a particular node be assigned to an allocation makes things operationally easier and reduces the potential for changes to cause problems in stateful application.

Using Nomad Pack, we are able to create multiple instances of the same job with a slightly different name so they can be independently managed to protect the state's integrity.

job "mysql-[[ .my.instance_name ]]" {
 ...
  group "default" {
    count = 1 # instances are individually controlled and thus there's only ever a single instance per job, but multiple jobs
    constraint {
      attribute = meta.tags
      operator  = "set_contains"
      value     = "mysql-[[ .my.instance_name ]]" # a constraint that ties the name of the job ("-a", "-b" or "-1", "-2" to a node)
    }
 ...
 }

When this is rendered using Nomad Pack, I can now control which job I am interacting with by specifying the instance_name variable which will only ever manage that particular job instance.

So I'm late to the party and this uses an additional tool, but it's one more potential way to deal with the complexities of stateful applications.

github-actions · 2022-10-08T02:35:39Z

I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

tgross added type/enhancement theme/docs Documentation issues and enhancements theme/storage labels May 27, 2020

tgross mentioned this issue May 27, 2020

[question] NOMAD_ALLOC_INDEX consistency across job upgrades is unclear #8045

Closed

tgross added stage/needs-discussion and removed type/enhancement labels May 27, 2020

tgross added this to the unscheduled milestone Jul 9, 2020

tgross mentioned this issue Nov 25, 2020

docs: using interpolation for volumes #9449

Merged

tgross modified the milestones: unscheduled , 1.0 Nov 25, 2020

tgross closed this as completed Nov 30, 2020

tgross reopened this Nov 30, 2020

tgross closed this as completed in #9449 Dec 2, 2020

github-actions bot locked as resolved and limited conversation to collaborators Oct 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

guidance for updates on jobs with multiple volumes #8058

guidance for updates on jobs with multiple volumes #8058

tgross commented May 27, 2020

tgross commented Nov 25, 2020

ayZagen commented Jan 7, 2021

tgross commented Jan 8, 2021

joliver commented Jun 9, 2022 •

edited

Loading

github-actions bot commented Oct 8, 2022

guidance for updates on jobs with multiple volumes #8058

guidance for updates on jobs with multiple volumes #8058

Comments

tgross commented May 27, 2020

tgross commented Nov 25, 2020

ayZagen commented Jan 7, 2021

tgross commented Jan 8, 2021

joliver commented Jun 9, 2022 • edited Loading

github-actions bot commented Oct 8, 2022

joliver commented Jun 9, 2022 •

edited

Loading