Feature request: support PreserveCounts for resource nomad_job #420

Jamesits · 2024-01-12T13:25:11Z

Currently when I deploy Nomad jobs with scaling {} configuration, the new job will automatically be scaled to count (which might be a very small value). This makes rolling upgrade of a busy job very dangerous.

Is it possible that we support the PreserveCounts argument during a job deployment, so we can make Terraform-based job deployment less painful?

(Related: hashicorp/nomad#9839 hashicorp/nomad#9843)

The text was updated successfully, but these errors were encountered:

lgfa29 · 2024-01-29T21:08:12Z

Thanks for the suggestion @Jamesits!

I try to quickly add this, but unfortunately it requires a bit more work that just adding a new flag. The first problem is that the job plan endpoint does not support PreserveCounts, so the TF plan will contain a diff even if preserve_counts = true. I opened hashicorp/nomad#19845 to track this.

But even with that change implemented, I suspect the provider itself will need changes. When computing the diff, the provider compares the value in Nomad with the jobspec directly:

terraform-provider-nomad/nomad/resource_job.go

Line 842 in 1c695fb

d.SetNew("task_groups", jobTaskGroupsRaw(job.TaskGroups))

I will try to get to these when I have some extra time.

lattwood · 2024-06-26T13:48:26Z

Any thoughts on a workaround in the meantime?

LKNSI · 2024-07-20T19:41:02Z

@lgfa29 hey! would love this equally. our biggest concern is that during job updates, when using nomad-autoscaler - not having preserve jobs leads to a situation where we are scalling-down jobs simply because of the job update itself.

to echo @lattwood 's point, do you have any workarounds that you could suggest in the meantime?

Additionally, we do not include a count verb in our job files, which we just let nomad autoscaler handler instead.

Sample stanza:

job "api_server_${template_job_name}" {
  datacenters = ["${template_datacenter}"]
  region = "${template_region}"

  spread {
    attribute = "$${node.datacenter}"
  }

  group "server" {
    scaling {
      enabled = true
      min = ${template_min_scaling_size}
      max = ${template_max_scaling_size}
      policy {
        cooldown = "3m"
        evaluation_interval = "1m"
        check "avg_cpu" {
          source = "nomad-apm"
          query = "avg_cpu-allocated"
          query_window = "3m"
          strategy "target-value" {
            # Test value, to force the autoscaler for this issue ^^
            target = 1
          }
        }
        check "avg_memory" {
          source = "nomad-apm"
          query = "avg_memory-allocated"
          query_window = "3m"
          strategy "target-value" {
            # Test value, to force the autoscaler for this issue ^^
            target = 1
          }
        }
      }
    }
    ...

lgfa29 · 2024-07-22T15:30:57Z

Hi @lattwood and @LKNSI 👋

Apologies for the delay in getting back to you, but I no longer work at HashiCorp and I wasn't able to solve this issue before I left.

As a workaround, I haven't tested it myself, but I wonder if the ignore_changes lyfecycle rule could help.

LKNSI · 2024-07-22T16:50:41Z

@lgfa29 no worries, thanks for getting back to us on this count, pun unintended

lgfa29 added theme/resource/job stage/accepted type/enhancement labels Jan 29, 2024

Jamesits mentioned this issue Apr 29, 2024

Declarative interface hashicorp/nomad-pack#368

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: support PreserveCounts for resource nomad_job #420

Feature request: support PreserveCounts for resource nomad_job #420

Jamesits commented Jan 12, 2024 •

edited

Loading

lgfa29 commented Jan 29, 2024

lattwood commented Jun 26, 2024

LKNSI commented Jul 20, 2024 •

edited

Loading

lgfa29 commented Jul 22, 2024

LKNSI commented Jul 22, 2024

Feature request: support PreserveCounts for resource nomad_job #420

Feature request: support PreserveCounts for resource nomad_job #420

Comments

Jamesits commented Jan 12, 2024 • edited Loading

lgfa29 commented Jan 29, 2024

lattwood commented Jun 26, 2024

LKNSI commented Jul 20, 2024 • edited Loading

lgfa29 commented Jul 22, 2024

LKNSI commented Jul 22, 2024

Jamesits commented Jan 12, 2024 •

edited

Loading

LKNSI commented Jul 20, 2024 •

edited

Loading