-
Notifications
You must be signed in to change notification settings - Fork 3.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Workflow controller metrics server only starts after leadership election is won #10037
Comments
@avestuk This is expected behavior. Can you provide your usecase for why you need all other controllers need to start the metrics server? |
@sarabala1979 |
@sarabala1979 Sorry for the delayed response. The issue with this is that if Prometheus is scraping both pods for metrics an alert is gets raised that it cannot get metrics from one pod. There's no way for Proemtheus to know which pod is the leader. We noticed that the behavior changed between v3.2.6 and v3.4.3. I'd expect that the metrics server gets started even if no metrics are served. That way Prometheus can scrape all pods without errors. |
I can see related issue been raised already, but closed without any fix #8283 |
|
Pre-requisites
:latest
What happened/what you expected to happen?
I'd expect all workflow controllers to begin serving metrics but only get proper metrics from the leader.
Version
v3.4.3
Paste a small workflow that reproduces the issue. We must be able to run the workflow; don't enter a workflows that uses private images.
Run the workflow controllers with leadership election enabled.
Logs from the workflow controller
Logs from in your workflow's wait container
n/a
It looks like the metrics server is started here: https://github.com/argoproj/argo-workflows/blob/master/workflow/controller/controller.go#L301
but it could (I think) be started earlier when the controller is instantiated here: https://github.com/argoproj/argo-workflows/blob/master/cmd/workflow-controller/main.go#LL112
The text was updated successfully, but these errors were encountered: