This repository has been archived by the owner on Jun 6, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 549
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
7 changed files
with
51 additions
and
28 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -172,16 +172,16 @@ authentication: | |
|
||
# if you want to enable alert manager to send alert email, uncomment following lines and fill | ||
# the right values. | ||
alert-manager: | ||
receiver: [email protected] | ||
smtp_url: smtp.office365.com:587 | ||
smtp_from: [email protected] | ||
smtp_auth_username: [email protected] | ||
smtp_auth_password: password_for_alert_sender | ||
port: 9093 # this is optional, you should not write this if you do not want to change the port alert-manager is listening on | ||
alert-handler: | ||
port: 9095 # this is optional, you should not write this if you do not want to change the port alert-handler is listening on | ||
bearer_token: 'eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJ1c2VybmFtZSI6ImFkbWluIiwiYXBwbGljYXRpb24iOnRydWUsImlhdCI6MTU5NjE3Mzc3N30.bs97pLaVKMEX5u7fQoyWU_J5b-6qDLXp2sj59Y346yg' | ||
# alert-manager: | ||
# receiver: [email protected] | ||
# smtp_url: smtp.office365.com:587 | ||
# smtp_from: [email protected] | ||
# smtp_auth_username: [email protected] | ||
# smtp_auth_password: password_for_alert_sender | ||
# port: 9093 # this is optional, you should not write this if you do not want to change the port alert-manager is listening on | ||
# alert-handler: | ||
# port: 9095 # this is optional, you should not write this if you do not want to change the port alert-handler is listening on | ||
# bearer_token: 'your_application_token_for_rest_server' | ||
|
||
# uncomment following if you want to change customeize grafana | ||
# grafana: | ||
|
@@ -201,14 +201,14 @@ alert-manager: | |
|
||
|
||
# uncomment following if you want to change customeize prometheus | ||
prometheus: | ||
# prometheus: | ||
# port: 9091 | ||
# # How frequently to scrape targets | ||
# scrape_interval: 30 | ||
alert-handler: | ||
virtual_clusters: default|autosys | ||
percent: 0.3 | ||
time: 10s | ||
# low_gpu_utilization_job: | ||
# virtual_clusters: default | ||
# gpu_percent: 0.3 | ||
# last_time: 10m | ||
|
||
|
||
# uncomment following section if you want to customize the threshold of cleaner | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -10,9 +10,9 @@ | |
|
||
#### How to configure cluster section in service-configuration.yaml <a name="HT_Config"></a> | ||
|
||
Port configurations in this section is optional which default to 9093. All other config is mandatory. If not receiver is configured, the alert manager will not start. | ||
Port configuration in this section is optional which default to 9093. All other config is mandatory. If `receiver` is not configured, the alert manager will not start. | ||
|
||
To configure alert-manager to send out alert email, you should configure alert manager with receiver in your service-configuration like following: | ||
To configure alert-manager to send out alert emails and kill low-gpu-utilization jobs, you should configure alert manager with receiver and alert-handler in your service-configuration like following: | ||
```yaml | ||
alert-manager: | ||
receiver: [email protected] | ||
|
@@ -23,7 +23,7 @@ alert-manager: | |
port: 9093 # this is optional, you should not write this if you do not want to change the port alert-manager is listening on | ||
alert-handler: | ||
port: 9095 # this is optional, you should not write this if you do not want to change the port alert-handler is listening on | ||
bearer_token: 'application_token_for_rest_server' | ||
bearer_token: 'your_application_token_for_rest_server' | ||
``` | ||
In addition, if you deployed pai behind firewall, you should configure alert-manager with `use-pylon: True`, to make url from alert email public available. | ||
|
@@ -43,7 +43,7 @@ alert-manager: | |
port: 9093 | ||
alert-handler: | ||
port: 9095 | ||
bearer_token: 'application_token_for_rest_server' | ||
bearer_token: 'your_application_token_for_rest_server' | ||
configured: True | ||
host: master_ip | ||
url: "http://master_ip:9093" | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters