Use Datadog custom check feature to automatically check if backups are handled properly and generate an alarm if backup can not be done
Reference : http://docs.datadoghq.com/guides/agent_checks/
-
Create "backup_check.yaml" file to the "/etc/dd-agent/conf.d" directory
-
Configure "backup_check.yaml"
init_config:
instances:
# This check is for monitoring and reporting metrics on cf backup
#
# WARNING: Ensure the user account running the Agent (typically dd-agent) has read
# access to the monitored directory and files.
#
# Instances take the following parameters:
# "directory" - string, the directory to monitor. Required
# "name" - string, backup service name. Required
- directory: "/root/backup/mariadb"
name: "mariadb"
-
dd-agent should have permission to access the folder
-
Backup folder should have an 755 authority
Copy the "backup_check.py" file to the "/etc/dd-agent/checks.d" folder
This script will check the followings.
1. Whether backup is successful or not
2. File size
3. File status
sudo /etc/init.d/datadog-agent restart
sudo /etc/init.d/datadog-agent info
-
Monitos > New Monitor > Custom check
-
Configure Datadog
Pick a Custom Check : backup.state
Pick monitor scope : host:HOST_NAME
Set alert conditions
Status: Critical => 1
Status: Warning => 1
Status: OK => 1
Say what's happening
{{#is_alert}}
The backup is not successful.
{{/is_alert}}
{{#is_alert_recovery}}
The backup is successful.
{{/is_alert_recovery}}
Notify: @Something