-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Receiver crashes occasionally with panic #5626
Comments
Maybe somehow related to tenant prunning? 🤔 |
Yes, possibly. I will look into it this week. |
I think you might be right @GiedriusS, we had a tenant on this receiver that was pruned. It then ended up on the receiver again which caused shipper to try registering metrics again for the tenant. We use promauto everywhere, and that relies on CC @bwplotka or @kakkoyun you might have some ideas as client_golang maintainers. |
I might be out of context here, the way to work around this, either introduce a tenant label to wrap around the register. Or unregister the metrics when the tenant goes away. |
Wrapping the register is a cool idea! Maybe this already exists in Thanos? thanos/pkg/receive/multitsdb.go Lines 629 to 646 in e54da92
|
Thanos, Prometheus and Golang version used:
Thanos version main-2022-08-08-66700939
What happened:
A Thanos receiver can crash occasionally with the following error
What you expected to happen:
Registering duplicate metric collectors should either be handled gracefully, or should not happen at all.
How to reproduce it (as minimally and precisely as possible):
Haven't been able to reproduce this locally yet.
The text was updated successfully, but these errors were encountered: