Invalid count argument #690

tvvignesh · 2020-09-26T13:48:17Z

Hi. I tried setting up gke private cluster (safer-cluster-update-variant) and whenever I make any errors (accidentaly giving the wrong image name or machine type and so on), the apply fails (not detected in plan) which is understandable.

But, if I try fixing the issue, and run plan and apply again, I get this:

It has been discussed here:
hashicorp/terraform#21450
hashicorp/terraform#12570

but I am not able to understand how to get over this.

I do understand that it is happening because it is not able to find any node pool in the cluster for which it can determine the count. If I go to .terraform/modules/global_gke.gke.gcloud_wait_for_cluster/main.tf I can see this line which is where the issue is.

resource "null_resource" "module_depends_on" {
  count = length(var.module_depends_on) > 0 ? 1 : 0

  triggers = {
    value = length(var.module_depends_on)
  }
}

Currently what I am doing is deleting the cluster every time and re-creating it from scratch. May I know how I can avoid doing that and just fix this issue? Thanks.

The text was updated successfully, but these errors were encountered:

bharathkkb · 2020-09-26T20:39:35Z

Hi @tvvignesh
Could you let me know which version of TF and GKE module and if not version = "~> 11.1.0" could you try that?

tvvignesh · 2020-09-27T10:05:14Z

@bharathkkb Hi. Running in TF version v0.13.2 and GKE 1.18.6-gke.4801 an latest version of this module.

bharathkkb · 2020-09-27T23:53:37Z

@tvvignesh could you provide your config, I can try to reproduce

tvvignesh · 2020-09-28T00:02:55Z

@bharathkkb Sure. This would be the relevant portion of the config. Kindly replace the vars where necessary.

module "global_gke" {
  source = "../modules/safer-cluster-update-variant"

  description                     = "My Cluster"
  project_id                      = module.global_enabled_google_apis.project_id
  name                            = var.global_cluster_name
  region                          = var.global_region
  network                         = module.global_vpc.network_name
  subnetwork                      = module.global_vpc.subnets_names[0]
  horizontal_pod_autoscaling      = true
  enable_vertical_pod_autoscaling = true
  enable_pod_security_policy      = true
  http_load_balancing             = true
  gce_pd_csi_driver               = true
  monitoring_service              = "none"
  logging_service                 = "none"
  release_channel                 = "RAPID"
  enable_shielded_nodes           = true
  ip_range_pods                   = module.global_vpc.subnets_secondary_ranges[0].*.range_name[0]
  ip_range_services               = module.global_vpc.subnets_secondary_ranges[0].*.range_name[1]
  master_authorized_networks = [{
    cidr_block   = "${module.global_bastion.ip_address}/32"
    display_name = "Global Bastion Host"
  }]
  grant_registry_access = true
  node_pools = [
    {
      name            = "global-pool-1"
      machine_type    = "n1-standard-4"
      min_count       = 1
      max_count       = 20
      local_ssd_count = 0
      disk_size_gb    = 30
      disk_type       = "pd-ssd"
      image_type      = "UBUNTU_CONTAINERD"
      auto_repair     = true
      auto_upgrade    = true
      node_metadata   = "GKE_METADATA_SERVER"
      service_account = "${var.global_sa}"
      preemptible     = false
    }
  ]
}

halkyon · 2020-09-29T07:05:27Z

Having the exact same issue as well. Seems to only happen when you've made an error, and once it gets in this state you can't terraform destroy to start again either.

morgante · 2020-09-29T12:32:28Z

@halkyon What was the error you made? Reproducing this will likely require us to see your broken config.

halkyon · 2020-10-01T09:05:36Z

@morgante Here you go: https://github.com/halkyon/gke-beta-private-cluster-example

Using Terraform v0.13.4.

Change the values in terraform.tfvars to your liking, and do a terraform init && terraform apply to provision a new cluster. Now change the machine_type value in the node_pools variable in terraform.tfvars to something invalid, then terraform apply again, and you'll get an error as expected. Now fix that back up to e2-medium or another valid type, and terraform apply again. This error is shown:

Error: Invalid count argument

  on .terraform/modules/gke.gcloud_delete_default_kube_dns_configmap/main.tf line 63, in resource "null_resource" "module_depends_on":
  63:   count = length(var.module_depends_on) > 0 ? 1 : 0

The "count" value depends on resource attributes that cannot be determined
until apply, so Terraform cannot predict how many instances will be created.
To work around this, use the -target argument to first apply only the
resources that the count depends on.

Hope this helps!

mspinassi-medallia · 2020-10-06T21:32:09Z

Exact same issue here.

bharathkkb · 2020-10-07T03:31:23Z

I was able to reproduce this with 0.13.4; seems like after the node pool config errors out, TF is unable to resolve [for pool in google_container_node_pool.pools : pool.name] at plan time. I'll do some more digging for a fix and see if its just for 0.13.4 or all 0.13.x.

Works as intended with 0.12.29.

innovia · 2020-10-08T23:26:19Z

Any updates , this happens to me to with 13.4 and after upgrading the node pool

innovia · 2020-10-14T01:56:47Z

what's up with this? if the module fail and its easy to replicate if you put invalid machine type say for example e2-medium-2 it fails on this error as if its in a bad state.

can you please fix this?

morgante · 2020-10-14T02:03:56Z

Since this is working in Terraform 0.12.x but not in 0.13.x I'm inclined to believe this is a Terraform Core issue. We can attempt to workaround it but it's not a high priority when Core should be fixing it.

bharathkkb · 2020-10-14T03:44:23Z

I was able to create a light repro which works with 0.12.x and not with 0.13.4. I will open an issue in core.
A workaround seems to be to use terraform apply -refresh=false which bypasses the initial refresh that throws this error.

github-actions · 2021-01-05T23:30:17Z

This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed in 7 days

github-actions · 2021-03-08T23:16:36Z

This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed in 7 days

AlexBulankou · 2021-03-16T21:59:38Z

I'm getting this issue with terraform:0.14.7, during tf plan phase:

Error: Invalid count argument

  on .terraform/modules/config_sync.configsync_operator.k8sop_manifest/main.tf line 57, in resource "random_id" "cache":
  57:   count = (! local.skip_download) ? 1 : 0

The "count" value depends on resource attributes that cannot be determined
until apply, so Terraform cannot predict how many instances will be created.
To work around this, use the -target argument to first apply only the
resources that the count depends on.

Any suggestions on the workaround?

morgante · 2021-03-16T22:02:45Z

@AlexBulankou Is this for a fresh deploy? What does your module configuration look like?

AlexBulankou · 2021-03-24T21:27:13Z

Yes, this is a fresh deploy: module confg.

AlexBulankou · 2021-04-02T23:49:23Z

To follow-up, the workaround for me was to back to terraform:0.12.29.

bharathkkb self-assigned this Oct 1, 2020

bharathkkb added the v0.13 Terraform v0.13 issue. label Oct 7, 2020

bharathkkb mentioned this issue Oct 14, 2020

TF 0.13 bug with The "count" value depends on resource attributes that cannot be determined until apply hashicorp/terraform#26579

Closed

bharathkkb added the upstream Work required on Terraform core or provider label Oct 14, 2020

bharathkkb mentioned this issue Dec 17, 2020

Removing a node pool causes cycle #767

Closed

github-actions bot added the Stale label Jan 5, 2021

bharathkkb removed the Stale label Jan 7, 2021

github-actions bot added the Stale label Mar 8, 2021

morgante added bug Something isn't working triaged Scoped and ready for work and removed Stale labels Mar 8, 2021

shako92 mentioned this issue Apr 20, 2021

delete_default_kube_dns_configmap throwing Invalid count argument #871

Closed

morgante mentioned this issue May 27, 2021

how to make ACM module depend on hub? #911

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Invalid count argument #690

Invalid count argument #690

tvvignesh commented Sep 26, 2020

bharathkkb commented Sep 26, 2020

tvvignesh commented Sep 27, 2020

bharathkkb commented Sep 27, 2020

tvvignesh commented Sep 28, 2020

halkyon commented Sep 29, 2020 •

edited

Loading

morgante commented Sep 29, 2020

halkyon commented Oct 1, 2020 •

edited

Loading

mspinassi-medallia commented Oct 6, 2020

bharathkkb commented Oct 7, 2020

innovia commented Oct 8, 2020

innovia commented Oct 14, 2020

morgante commented Oct 14, 2020

bharathkkb commented Oct 14, 2020

github-actions bot commented Jan 5, 2021

github-actions bot commented Mar 8, 2021

AlexBulankou commented Mar 16, 2021

morgante commented Mar 16, 2021

AlexBulankou commented Mar 24, 2021

AlexBulankou commented Apr 2, 2021

Invalid count argument #690

Invalid count argument #690

Comments

tvvignesh commented Sep 26, 2020

bharathkkb commented Sep 26, 2020

tvvignesh commented Sep 27, 2020

bharathkkb commented Sep 27, 2020

tvvignesh commented Sep 28, 2020

halkyon commented Sep 29, 2020 • edited Loading

morgante commented Sep 29, 2020

halkyon commented Oct 1, 2020 • edited Loading

mspinassi-medallia commented Oct 6, 2020

bharathkkb commented Oct 7, 2020

innovia commented Oct 8, 2020

innovia commented Oct 14, 2020

morgante commented Oct 14, 2020

bharathkkb commented Oct 14, 2020

github-actions bot commented Jan 5, 2021

github-actions bot commented Mar 8, 2021

AlexBulankou commented Mar 16, 2021

morgante commented Mar 16, 2021

AlexBulankou commented Mar 24, 2021

AlexBulankou commented Apr 2, 2021

halkyon commented Sep 29, 2020 •

edited

Loading

halkyon commented Oct 1, 2020 •

edited

Loading