-
Notifications
You must be signed in to change notification settings - Fork 1.1k
unable to fetch node CPU metrics #2605
Comments
It doesn't. Possibly this is an error reported by the Kubernetes API client, either from something it is attempting, or something attempted by an intermediary, or the API server itself. |
@squaremo it looks like our use of the discovery API triggers the metrics queries, this is indeed very troubling since a busy cluster with many metrics will add a huge delay to Flux sync. |
How .. even ... I don't ... |
I think we should be using |
@texasbobs can you please test |
What version is that based on, @stefanprodan ? We have not tested beyond 1.13.2 and this cluster is in a prod environment. |
It's based on master. If you have a dev cluster, can you please scale prometheus to zero and test it out? |
I confirmed that it fails in my test cluster on 1.13.2 and 1.15.0. The above image was able to function correctly. |
@texasbobs thanks a lot for testing it, I'm also running my own tests with prometheus-adapter and metrics-server. |
We are using Flux 1.13.2 and in some clusters, it no longer clones repos to be synced. The error seems to indicate it is unable to execute a query on the Prometheus api.
How can we prevent this from stopping the sync?
Why does Flux care about CPU metrics at all?
It looked like there was a PR to ignore these types of errors back in 1.12.2. #2009
The text was updated successfully, but these errors were encountered: