cadvisor metrics and problematic identification #17365

bobheadxi · 2021-01-18T13:47:48Z

Background

Prometheus generally attaches useful labels based on the target it is scraping. For example, when scraping frontend, Prometheus reaches out to frontend, knows certain things about frontend (e.g. service name, pod, instance, etc), and can attach those labels onto metrics exported by frontend

cAdvisor exports metrics for other containers. So despite all cAdvisor metrics looking like they are coming from cAdvisor, they are actually for other containers.

On some systems cAdvisor generates a name that is some combination of fields that might make a target monitored by cAdvisor unique. This worked alright for a while right up until we discovered it didn't: https://github.com/sourcegraph/sourcegraph/issues/17069, https://github.com/sourcegraph/sourcegraph/issues/17072

Problem

We need an effective way to identify Sourcegraph services inside cAdvisor metrics. The current strategy is outlined in our docs, but the approach is not perfect:

in environments like GCP (Cloud and k8s.sgdev), certain containers like GCP's prometheus-to-* exporters get picked up on the prometheus matcher.
in the past, cAdvisor's export-everything-nature has caused issues like killing Prometheus: customer/issues/75

We are a bit hamstrung in that whatever name-labelling convention we have must also:

work with the limited set of labels that cAdvisor provides (e.g. in Kubernetes, we only get io.kubernetes.container.name,io.kubernetes.pod.name,io.kubernetes.pod.namespace,io.kubernetes.pod.uid)
work in environments that don't use the Docker runtime (which cAdvisor is geared towards)
be easy to match on across both Kubernetes (where we need to encode a lot of information in the name, e.g. pod and container) and docker-compose (where just the container name is sufficient)
- by extension, it needs to work despite varying naming conventions across the two (e.g. sourcegraph-frontend vs frontend vs sourcegraph-frontend-internal)

Docker-compose doesn't seem to be as much of an issue since it doesn't seem they are generally deployed on machines that do anything other than serve Sourcegraph on docker-compose, but in kubernetes there's no telling what's on the nodes.

One approach attempted was to filter on namespace via metric_relabel_configs in k8s (sourcegraph/deploy-sourcegraph#1644), e.g.:

      metric_relabel_configs:
      # cAdvisor-specific customization. Drop container metrics exported by cAdvisor
      # not in the same namespace as Sourcegraph.
      # Uncomment this if you have problems with certain dashboards or cAdvisor itself
      # picking up non-Sourcegraph services. Ensure all Sourcegraph services are running
      # within the Sourcegraph namespace you have defined.
      # The regex must keep matches on '^$' (empty string) to ensure other metrics do not
      # get dropped.
      - source_labels: [container_label_io_kubernetes_pod_namespace]
        regex: ^$|ns-sourcegraph # ensure this matches with namespace declarations
        action: keep

but due to the various ways a namespace can be applied when deploying, there's no guarantee that a customer won't forget to update the Prometheus relabel rule - more discussion in sourcegraph/deploy-sourcegraph#1578. Regardless, this is the change currently applied in Cloud and k8s.sgdev.org to resolve our issue.

The text was updated successfully, but these errors were encountered:

bobheadxi · 2021-01-18T14:03:18Z

Possible solutions

⭐ Dhall saves us

Provide the metric relabel config in generated configuration (example above and in sourcegraph/deploy-sourcegraph#1644) based on namespace customization. Could be https://github.com/sourcegraph/sourcegraph/issues/17640

Monitoring generate generates exceptions

The monitoring generator could generate regex excludes on a case-by-case basis. This is really gnarly, and probably not a great idea, since we could jsut as easily accidentally break things on another deployment method (which is what happened in https://github.com/sourcegraph/sourcegraph/issues/17072)

Namespace checking in `metric_relabel_configs`

Using relabel config that tries to ensure container_label_io_kubernetes_pod_namespace and ns are the same (only get metrics within the same namespace as cAdvisor). This would be the dream solution.

Unfortunately, after much trawling of internet forums I am pretty sure Prometheus does not allow a comparison like this without knowing some information about the namespace, a la the solution applied in sourcegraph/deploy-sourcegraph#1644). Solutions like this one did not seem to work.

Fork cAdvisor to allow filtering by Kubernetes namespace

If cAdvisor can be configured to filter containers by Kubernetes namespace, we can use the Kubernetes Downward API to ensure the correct namespace is always used. There is currently no such capability.

Something configures Kubernetes Prometheus rule

Similar to the above solution, we use the Kubernetes Downward API to configure Prometheus rule for nodes (that currently picks up cAdvisor metrics - also see sourcegraph/deploy-sourcegraph#1644). prom-wrapper, which currently only configures alertmanager configuration, could do this. An init container might be able to handle this as well (this is similar to what is described in sourcegraph/deploy-sourcegraph#1578 (comment))

This could be difficult, because our Prometheus config is quite involved.

Do not use cAdvisor

cAdvisor has been a bit painful. Maybe all container monitoring tools are painful, but maybe there are some more flexible ones out there that we can consider. google/cadvisor#2215 seems to indicate a shift away from relying on cAdvisor as well.

github-actions · 2021-10-29T12:09:30Z

Heads up @davejrt @ggilmore @dan-mckean @caugustus-sourcegraph @StephanX - the "team/delivery" label was applied to this issue.

bobheadxi added team/distribution monitoring labels Jan 18, 2021

This was referenced Jan 18, 2021

[superseded] prometheus: add config for filtering cadvisor metrics by namespace sourcegraph/deploy-sourcegraph#1578

Closed

Distribution: 2021.01.11 - Blissful Dog #17137

Closed

bobheadxi mentioned this issue Jan 27, 2021

dhall: implement prometheus #17640

Closed

5 tasks

This was referenced Aug 12, 2021

cadvisor picks up metrics from non-sourcegraph services #23810

Open

doc: improve cadvisor metrics issues docs and workaround #23817

Merged

dan-mckean added team/delivery Delivery team and removed team/distribution labels Oct 29, 2021

bobheadxi mentioned this issue Jan 6, 2022

monitoring: add networking dashboard for Zoekt pods #29481

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cadvisor metrics and problematic identification #17365

cadvisor metrics and problematic identification #17365

bobheadxi commented Jan 18, 2021 •

edited by willdollman

Loading

bobheadxi commented Jan 18, 2021 •

edited

Loading

github-actions bot commented Oct 29, 2021

cadvisor metrics and problematic identification #17365

cadvisor metrics and problematic identification #17365

Comments

bobheadxi commented Jan 18, 2021 • edited by willdollman Loading

Background

Problem

bobheadxi commented Jan 18, 2021 • edited Loading

Possible solutions

⭐ Dhall saves us

Monitoring generate generates exceptions

Namespace checking in metric_relabel_configs

Fork cAdvisor to allow filtering by Kubernetes namespace

Something configures Kubernetes Prometheus rule

Do not use cAdvisor

github-actions bot commented Oct 29, 2021

bobheadxi commented Jan 18, 2021 •

edited by willdollman

Loading

bobheadxi commented Jan 18, 2021 •

edited

Loading

Namespace checking in `metric_relabel_configs`