Skip to content
This repository has been archived by the owner on Jun 6, 2024. It is now read-only.

refine kill low-efficiency-job-alert email templates #5384

Merged
merged 1 commit into from
Mar 26, 2021

Conversation

suiguoxin
Copy link
Member

After this change, the template can also be applied for killing long-running jobs

@suiguoxin suiguoxin requested a review from Binyang2014 March 19, 2021 06:35
@coveralls
Copy link

Coverage Status

Coverage remained the same at 34.02% when pulling 4d17f2c on suiguoxin:alert-temp into fe18fd9 on microsoft:master.

@suiguoxin suiguoxin mentioned this pull request Mar 19, 2021
14 tasks
@suiguoxin suiguoxin merged commit 44823d5 into microsoft:master Mar 26, 2021
@suiguoxin suiguoxin deleted the alert-temp branch March 26, 2021 00:51
@suiguoxin
Copy link
Member Author

suiguoxin commented Apr 2, 2021

Test cases:

[
    {
        "labels": {
            "alertname": "PAIJobGpuPercentLowerThan0_3For1h",
            "job_name": "test_user~test_job_0",
            "virtual_cluster": "default",
            "severity": "warn"
        },
        "generatorURL": "alert/script",
        "fingerprint": "6b8102e96c9e6b2a",
        "annotations": {
            "summary": "test_user~test_job_0 has a job gpu percent lower than 30% for 1 hour "
        }
    },
    {
    "labels": {
        "alertname": "PAIDebugJobRunningFor72h",
        "job_name": "test_user~test_job_1",
        "virtual_cluster": "debug",
        "severity": "warn"
    },
    "generatorURL": "alert/script",
    "fingerprint": "test_user~test_job_1 has been running for more than 72 hours in debug vc",
    "annotations": {
            "summary": "test_user~test_job_1 has been running for more than 72 hours in debug vc"
        }
    }
]

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants