Skip to content

Core services healthchecks

Nicolas Degory edited this page Jul 16, 2017 · 11 revisions

This page lists the checks that should be run to validate the deployment of the core services

order service depends on context test command
1 elasticsearch single mode cluster health should be yellow curl -sf elasticsearch:9200/_cluster/health?wait_for_status=yellow&timeout=15s
1 cluster mode cluster health should be green curl -sf elasticsearch:9200/_cluster/health?wait_for_status=green&timeout=30s
2 etcd * endpoint health etcdctl --endpoints "http://etcd:2379" endpoint health | grep -qw healthy
3 nats * api availability curl -sf "nats:8222/subsz"
4 ampbeat nats, elasticsearch docker service logs amp_ampbeat 2>&1 grep -q "INFO ampbeat is running"
5 kibana elastisearch, ampbeat UI availability curl -sf "kibana:5601/app/kibana#/discover"
6 prometheus status URL curl -sf "prometheus:9090/status"
7 alertmanager prometheus metrics URL curl -sf "alertmanager:9093/metrics"
8 node_exporter prometheus metrics URL curl -sf "node-exporter:9100/metrics"
9 nats_exporter nats, prometheus metrics URL curl -sf "nats-exporter:7777/metrics"
10 haproxy_exporter haproxy, prometheus metrics URL curl -sf "haproxy-exporter:9101/metrics"
11 grafana org info curl -sf "grafana:3000/api/org"
12 proxy stats curl -sf "http://stats:stats@proxy:1936/haproxy\?stats\;csv"
13 amplifier nats, elasticsearch, etcd metrics URL curl -sf "amplifier:5100/metrics"
14 gateway amplifier docker service logs amp_gateway 2>&1 grep -q "gateway successfuly initialized" (sic)
15 agent nats ?
16 portal gateway home page curl -sf "http://portal/"