-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Thanos-Compact halting with error 'err="compaction: group 0@17832940732465865817: overlapping sources detected' #6389
Comments
How is compactor configured? It looks like a historical compactor had different configuration than new one because the first block still has the replica label |
I have 5 sharded compactors running with this config:
With Btw this is the exact same config I deploy on my other clusters (different clusters environments for different buckets) and only this one is giving these errors. |
My guess is that it ran before without the "thanos_ruler_replica" dedup label? since the |
It's possible, It's been a while so I don't remember to be honest. Either way, since then what I did is remove the chunks directly from my bucket (this is staging env so I don't care that much about the data itself, I just wanted to understand how I can solve this in case it comes up in prod) However, eventually, thanos-compact halts again on a new set of chunks. Then I delete them, thanos-compact starts running until it halts again, etc It's been like this for the past 2 weeks, and I have deleted a bunch of chunks, I thought there were a bunch of corrupted chunks or something like that but I'm starting to think it will be forever like this and I can't understand why |
Facing the same issue due to which compaction is getting halted Our current thanos compact config
We have 6 shards and 2 replicas for prometheus |
Hi there, faced this issue too
tried to add no compact, also found this issue about compact marks being ignored #5603 |
Thanos, Prometheus and Golang version used:
Thanos: 0.31.0
Prometheus: 2.44.0
Object Storage Provider:
Google (GCS)
What happened:
Thanos-compact pod halted shortly after starting with error:
What you expected to happen:
I believe the thanos compact should be able to deduplicate or merge blocks if that is the case? Not really sure
Full logs to relevant components:
I inspected the bucket and the blocks ID in the error message are the following:
The text was updated successfully, but these errors were encountered: