-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Thanos Ruler fails to evaluate all recording rules correctly #4924
Comments
Hello, could you please dump the goroutine stacks when this happens and upload them? |
Thanks for reporting! pprof profiles available at |
Isn't this issue similar to #4772 ? |
I think the .pprof file I uploaded there is actually more relevant in this issue as the behaviour I got in my Thanos Ruler is more comparable to this issue as restarting those pods helped only for some dozens of minutes before failing again to process records. Link to the .pprof: #4772 (comment) |
I have the same issue on v0.24.0 |
Hello 👋 Looks like there was no activity on this issue for the last two months. |
Closing for now as promised, let us know if you need this to be reopened! 🤗 |
I am currently running Thanos v0.24.0-rc.0.
Few recording rules are evaluated fine while few recording rules seems to be last evaluated 2 days back. This happens very frequently. Restarting the pod fixes the issue temporarily. This issue is reproducible on v0.23.0 as well.
Here is the args passed to the ruler:
The text was updated successfully, but these errors were encountered: