-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
compactor: adds downsample duration histogram #4552
compactor: adds downsample duration histogram #4552
Conversation
Signed-off-by: Anugrah Vijay <[email protected]>
Based on @yeya24 's feedback on the previous PR I will be:
|
Signed-off-by: Anugrah Vijay <[email protected]>
With respect to the histogram buckets, we could get rid of the initial few buckets and make it |
Signed-off-by: Anugrah Vijay <[email protected]>
Anything you still want to update? @vanugrah If not please mark this pr ready for review. 😄 |
Bucket ranges? Currently it mimics the compactors geometric series: |
Latter sounds better. |
Signed-off-by: Anugrah Vijay <[email protected]>
Signed-off-by: Anugrah Vijay <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thanks for the contribution.
😄 Thanks for the speedy feedback! |
So for flakey e2e tests do we just rerun? |
I restarted it and it is passing now |
Wooot 🎉 Thanks for your help @yeya24 . More PR's soon to come. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
With --wait
doesn't this mean that we will have infinitely different groups? 🤔 Perhaps a histogram over all durations would be more suitable i.e. drop the group
label?
Hello! 👋 Based on the DefaultGroupKey it looks like the two parts of the label are the resolution and a hash of the meta labels. So if I'm not mistaken the cardinality would increase by num_buckets x 3 (1 for each resolution) for each unique set of meta labels, i.e. each unique source of blocks. So theoretically yes there is no upper bound on groups given that it correlates with things uploading blocks. But in practice, I haven't heard of more than a few thousand block sources for a single bucket, even on the larger side. Though, this is definitely just a heuristic. Another point to consider is that the other downsample metrics already use the group key as a label: https://github.com/thanos-io/thanos/blob/main/cmd/thanos/downsample.go#L46-L53 And we already initialize the metric for all groups: So we're not necessarily introducing anything new to risk unbounded cardinality and we'd be able to align labels with the existing metrics. Still wrapping my head around the compactor so let me know if I've misunderstood anything 😄 |
@GiedriusS Any thoughts? |
@yeya24 Your review was marked stale because I had to resolve a merge conflict on the Changelog. Care to re-review? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Let's merge it.
Signed-off-by: Anugrah Vijay [email protected]
Changes
Verification