-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
emit nomad.client.allocs.oom_killed
for raw_exec jobs
#19767
Comments
Hi @shantanugadgil! Thanks for opening this issue. I'll get it marked for roadmapping.
Yeah, almost certainly. The |
🥳 🎉 |
@tgross @jrasell I am still not getting the metric This is my telemetry setting on the agent:
I verified using a cmdline as mentioned here:
... to verify that |
when I run the same python script inside a docker, the metric is indeed reported correctly: task "oom" {
template {
data = file("oom_test.py")
destination = "local/oom_test.py"
left_delimiter = "{[("
right_delimiter = ")]}"
}
driver = "docker"
config {
image = "python:3-alpine"
auth_soft_fail = true
command = "python3"
args = ["-u", "/local/oom_test.py"]
}
resources {
cpu = 512
memory = 32
}
|
I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues. |
Nomad version
Nomad v1.7.3
BuildDate 2024-01-15T16:55:40Z
Revision 60ee328
Operating system and Environment details
Amazon Linux 2/2023
Issue
Nomad 1.7.x honors the resource parameter and kills a task (as expected) when the
memory
threshold is crossed.The metric
nomad.client.allocs.oom_killed
does NOT seem to be emitted for the task which was killed.Reproduction steps
run the specified job spec as a
raw_exec
task and observe it is killed, but no metrics emitted. (I use statsd)Expected Result
the metric should be emitted for such a killed task
Actual Result
no metric is emitted
Job file (if appropriate)
The relevant
task
section:possibly related to #19204 ?
The text was updated successfully, but these errors were encountered: