-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Publish node counts grouped by service and health-check status #607
Comments
@petemounce We don't have this information "easily" available. It would require doing an internal query and aggregation every interval to generate this. It may be simpler to write a small service that consumes the API to forward that information to statsd. |
So something that is maybe
? |
@petemounce You may be able to do it more easily with a blocking query against |
"with a blocking query" - do you mean "Blocking Queries" on https://consul.io/docs/agent/http.html ? Is there an example you could link me to of how I might do this? |
@petemounce yes, a "blocking query" in Consul refers to making a specific HTTP request that will wait until some change is made. You can read more about how to perform a blocking query here under the "Blocking Queries" section (right up near the top). |
@petemounce Here is an example. First, make a query to the health endpoint and note the
Make the same query, passing in the index and an (optional) time to wait.
The above query will "block" until the status of a health check changes, at which point a response very similar to the one above is received with the new list of services and a new Hope that helps! |
I guess a more simple overview API or a simple command in the command line tool would be preferable ? So people can plug the aggregated status checks into nagios or datadog ? |
Thank you for reporting and participating. Since there are ways to accomplish what you asked and given that it is not simple to build in consul, I am going to close this issue. |
Previously, we were checking that the federation is successful by only looking for the number of WAN consul members. However, sometimes those members could be unhealthy/not alive, which will cause the test to fail. This change improves federation verification and checks that all members are healthy from the perspective of both servers. It also checks that the ACL replication is running in case ACLs are used.
We use statsd, and were overjoyed to see that you support that OOTB.
I had a look, and couldn't find if consul (0.4.1) publishes counts of connected agents? I'd love to be able to see:
(where service has its name taken from the "name" element, and then sanitised to not contain any
.
characters)We terminate and recycle instances in our auto scaling groups often enough that most instances don't last more than a day. Being able to see current instance count would be fantastic for us.
The text was updated successfully, but these errors were encountered: