[stable/prometheus] support scraping containers by default #22899

pohly · 2020-06-22T10:00:19Z

Is your feature request related to a problem? Please describe.

As described in prometheus/prometheus#3756, it is sometimes useful to scrape not just one port per pod, but one port per container.

The upstream discussion has been going on for a while and it is not clear how it will be solved. Upstream has disabled the example config for pods (the one with the prometheus.io/scrape annotation), but the Helm chart still has it and documents it as the way to get pods discovered on Kubernetes.

Describe the solution you'd like

A possible solution would be to scrape each container which has a container port with a fixed name and where the pod has suitable annotations. Here's an implementation of that approach:

extraScrapeConfigs: |
  # Example scrape config for containers inside a pod.
  #
  # The relabeling allows the actual container scrape endpoints to be configured via the
  # following annotations:
  #
  # * `prometheus.io/scrape`: Only scrape containers in pods that have a value of `containers`
  # * `prometheus.io/path`: If the metrics path is not `/metrics` override this. The path
  #   has to be the same for all containers because it can only be set per-pod (containers
  #   and container ports have no annotations).
  #
  # The port for each container is specified with a container port that must have the name "metrics".

  - job_name: 'kubernetes-containers'

    kubernetes_sd_configs:
      - role: pod

    relabel_configs:
      - source_labels: [__meta_kubernetes_pod_annotation_prometheus_io_scrape]
        action: keep
        regex: containers
      - source_labels: [__meta_kubernetes_pod_container_port_name]
        action: keep
        regex: metrics
      - source_labels: [__meta_kubernetes_pod_annotation_prometheus_io_path]
        action: replace
        target_label: __metrics_path__
        regex: (.+)
      - source_labels: [__address__, __meta_kubernetes_pod_container_port_number]
        action: replace
        regex: ([^:]+)(?::\d+)?;(\d+)
        replacement: $1:$2
        target_label: __address__
      - source_labels: [__meta_kubernetes_namespace]
        action: replace
        target_label: kubernetes_namespace
      - source_labels: [__meta_kubernetes_pod_name]
        action: replace
        target_label: kubernetes_pod_name
      - source_labels: [__meta_kubernetes_pod_container_name]
        action: replace
        target_label: kubernetes_pod_container_name
      - source_labels: [__meta_kubernetes_pod_node_name]
        action: replace
        target_label: kubernetes_pod_node_name # This is useful for daemon sets where pods typically have no meaningful name.
      - action: labelmap
        regex: __meta_kubernetes_pod_label_(.+)

This solution is limited to using the same path for all containers, but IMHO that is acceptable.

As it stands above, one can add that job with -f:

helm install prometheus stable/prometheus --set alertmanager.persistentVolume.enabled=false,server.persistentVolume.enabled=false -f kubernetes-container-ports.yaml

I think it would be useful to add it to the default values.yaml, including a "slow" variant.

Describe alternatives you've considered

I tried the approach of merging metrics data with https://github.com/rebuy-de/exporter-merger, but ran into issues when different containers report the same metrics.

Users could be instructed to reconfigure Prometheus manually instead of changing the defaults, but given that nothing in the job above is specific to an application, that seems sub-optimal.

The text was updated successfully, but these errors were encountered:

The Prometheus integration uses the approach from helm/charts#22899: - HTTP for metrics endpoints - container ports tell Prometheus which containers to scrape and how CSI call counts are the same as in the sidecars. This enables correlating statistics and ensures that also node-local operations are captured; kubelet doesn't seem to be instrumented. TODO: - documentation, including Prometheus+Grafana example - decide whether this should be enabled by default and whether it should be configurable (also in the operator)

stale · 2020-07-25T05:07:55Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Any further update will cause the issue/pull request to no longer be considered stale. Thank you for your contributions.

stale · 2020-08-08T09:15:22Z

This issue is being automatically closed due to inactivity.

The Prometheus integration uses the approach from helm/charts#22899: - HTTP for metrics endpoints - container ports tell Prometheus which containers to scrape and how CSI call counts are the same as in the sidecars. This enables correlating statistics and ensures that also node-local operations are captured; kubelet doesn't seem to be instrumented. TODO: - documentation, including Prometheus+Grafana example - decide whether this should be enabled by default and whether it should be configurable (also in the operator)

The Prometheus integration uses the approach from helm/charts#22899: - HTTP for metrics endpoints - container ports tell Prometheus which containers to scrape and how CSI call counts are the same as in the sidecars. This enables correlating statistics and ensures that also node-local operations are captured; kubelet doesn't seem to be instrumented. Internal communication is instrumented the same way. PMEM usage statistics are recording by querying the device manager each time the metrics data gets scraped. The metrics support is enabled unconditionally in the operator and all pre-generated deployment files and use plain HTTP for the sake of simplicity. This is based on the rationale that the data itself is not sensitive and should always be readily available if desired.

The Prometheus integration uses the approach from helm/charts#22899: - HTTP for metrics endpoints - container ports tell Prometheus which containers to scrape and how CSI call counts are the same as in the sidecars. This enables correlating statistics and ensures that also node-local operations are captured; kubelet doesn't seem to be instrumented. Internal communication is instrumented the same way. PMEM usage statistics are recorded by querying the active device manager each time the metrics data gets scraped. The metrics support is enabled unconditionally in the operator and all pre-generated deployment files and use plain HTTP for the sake of simplicity. This is based on the rationale that the data itself is not sensitive and should always be readily available if desired.

pohly mentioned this issue Jun 22, 2020

Add metrics documentation kubernetes-csi/docs#339

Open

stale bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 25, 2020

stale bot closed this as completed Aug 8, 2020

pohly mentioned this issue Aug 20, 2020

metrics: support usage inside CSI driver kubernetes-csi/csi-lib-utils#49

Merged

pohly mentioned this issue Aug 26, 2020

metrics support intel/pmem-csi#715

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[stable/prometheus] support scraping containers by default #22899

[stable/prometheus] support scraping containers by default #22899

pohly commented Jun 22, 2020

stale bot commented Jul 25, 2020

stale bot commented Aug 8, 2020

[stable/prometheus] support scraping containers by default #22899

[stable/prometheus] support scraping containers by default #22899

Comments

pohly commented Jun 22, 2020

stale bot commented Jul 25, 2020

stale bot commented Aug 8, 2020