Thanos Store should always prefer higher resolution data when possible #1170

GiedriusS · 2019-05-22T13:39:47Z

Currently, bucket.getFor() first tries to select data with the smallest resolution. Only after that is done, it jumps to the data with bigger resolution. I think that this should be changed because it's counter-intuitive.

Rationale
Range vectors would work more predictably. In our case, we retain RAW data for 31 days, data downsampled to 5 minutes is retained for 91 days, and data downsampled to 1 hour is retained for 1.5 years. Note that retention policies are a completely separate thing from when we actually perform downsampling. Those two things are defined here: https://github.com/improbable-eng/thanos/blob/master/cmd/thanos/downsample.go#L172 and https://github.com/improbable-eng/thanos/blob/master/cmd/thanos/downsample.go#L193 i.e. 5m blocks are carved out after the block becomes longer than 40 hours, and 1h blocks are carved out after they are longer than 10 days (240 hours).

Compaction happens at these block sizes: https://github.com/improbable-eng/thanos/blob/master/cmd/thanos/compact.go#L32. So, this practically means that after 2 days from the current moment you will only get 5m downsampled data, and after 14 days - only 1 hour downsampled data.

Now, imagine writing a query like: rate(http_request_total[5m]) (as suggested by Grafana's Explore UI) and you will want to execute it on a time range: from now till now-20d. It might very well happen that you will start seeing gaps after 14 days in your dashboard. In my opinion, an user would expect to see a nice, continuous graph with such a query even with a 20 day time range. It is a bit counter-intuitive to see gaps due to "missing data" because of how bucket.getFor() selects data.

The only caveat I see is higher RAM usage in Thanos Store but that could be helped with the other, on-going work of being able to select the minimum resolution.

Thoughts?

The text was updated successfully, but these errors were encountered:

bwplotka · 2019-05-30T13:27:26Z

Interesting question, let's discuss it. I think the current implementation is what we need because:

Downsampled data exists to make query faster, If you allow lower resolution by maxResolution you are happy with lower resolution, so why making it closer to raw if not needed?
So, this practically means that after 2 days from the current moment you will only get 5m downsampled data, and after 14 days - only 1 hour downsampled data.

No, because auto downsampling decide based on step. So you can query for 1h range, 14 days ago and you touch raw data (if you have those).

Now, imagine writing a query like: rate(http_request_total[5m])

Let me stop you right here, this will always not work with downsampled data because your rate interval is too low. With 5m resolution and 1h resolution you probably have not enough samples to calculate rate. You should use $internal in Grafana instead.

It might very well happen that you will start seeing gaps after 14 days in your dashboard

Sorry, don't get why the gap? Due to problem described above?

GiedriusS · 2019-06-05T13:37:37Z

Sorry for deleting the last message, had to recheck this with the latest version.

Essentially, let's use a picture because as we know one picture is worth a thousand words. For example, imagine if we have:

--retention.resolution-raw=31d --retention.resolution-5m=31d --retention.resolution-1h=31d

and --query.auto-downsampling enabled. From my experience, users do think intuitively (I got caught by this too) that a query like this:

Would return a nice, continuous line since we do actually have high resolution data in remote object storage. However, it does not due to the behavior of the function described.

bwplotka · 2019-06-05T15:07:41Z

Sure but you don't want to fall back to raw data ONLY because someone made a mistake and query 1 year data with 5m interval right?

That's why downsampling exists to avoid querying raw if not needed. Maybe we should think about some warnings? That would pop up on Grafana? We could deduce this mistake (:

bwplotka · 2019-06-05T15:08:13Z

But I get the problem now @GiedriusS thanks for picture. Wonder what others are thinking about thins problem.

cc @brancz @devnev @mjd95 ?

GiedriusS · 2019-06-05T15:27:30Z

I had an idea that maybe if Grafana will have Thanos integration, and we will have the ability to select min/max resolution then this should get solved automatically without any change because it would become evident what's happening "under the hood" then to the person who is doing a query.

brancz · 2019-06-06T07:57:41Z

I'm trying to think of a way where we don't need an explicit integration as Grafana wasn't the biggest fan of this and it also makes the "upgrade your Prometheus to Thanos" argument weaker. Maybe we can make use of Prometheus warnings in the query result? I believe Grafana renders those. For example if resolution / 2 >= smallest range selector (as most of the time we want at least two samples within a selected range) then the warning can say that most likely the graph is distorted and at least a range selector of X should be used for a continuous graph. That way there wouldn't need to be an explicit integration.

bwplotka · 2019-06-12T09:07:56Z

Warning would be nice enough, but that means also we have additional argument to drive warnings support in Grafana closely. (:

mjd95 · 2019-06-13T07:29:11Z

I think maybe there are two issues here:
(1) How to handle rate(http_requests_total[5m]) when we don't have enough samples to make sense of that?
(2) When both low and high resolution data is available, which one should we prefer?

Now (2) feeds in to (1) as preferring high resolution data means we see the issue in (1) a bit less. But (1) is going to be an issue anyway, because a user reasonably could make a request for rate(http_requests_total[1m]) from 2 weeks ago, but actually their company only keeps raw data for 1 week.

IMO:
(1) I like @brancz 's suggestion
(2) I think I'm starting to come round to preferring high resolution data as well, as it makes things a bit simpler. (Potentially even makes maxSourceResolution and autoDownsampling parameters not needed anymore?) This preference is mostly based on simplicity rather than use case though. I am also a bit worried about the resource usage for the store.

GiedriusS · 2019-06-17T12:55:21Z

Another point to consider: you can write a "bare" query like: foo_bar_seconds and select time range such as now-35d:now-34d. Grafana would automatically select a pretty small min step and you would, again, not see any graph any more because data in such granularity just does not exist on Thanos side. I guess it's a bigger, general problem here that there's no integration between Thanos and Grafana, and Grafana doesn't know that data could be downsampled.

raffraffraff · 2019-06-27T07:25:01Z

^ That's the reason I can't go to production with Thanos yet.

Not sure it's the job of Grafana to be aware of problems showing now-35d:now-34d because of things that Thanos does with its data, but it would be nice if the Prometheus data source had a 'thanos' checkbox that opened up a few more configuration options (like 'retention'). The data source could then break up long queries into sub-queries that are based on data availability, while still use the 'preferred' resolution for the data when possible.

Handling this at the data source may get around another issue: if I use Trickster to cache data, and query for now-31d:now, Thanos will request downsampled data and Trickster will cache the result. If I then modify the time range of the query to now-32d:now, Trickster would just request the data not cached, and this will result in a shorter query - raw resolution, outside the retention period = no data.

bwplotka · 2019-10-31T12:28:20Z

@raffraffraff, sorry for the delay (:

Not sure it's the job of Grafana to be aware of problems showing now-35d:now-34d because of things that Thanos does with its data,

Please see this doc to read about downsampling use cases. The use case is NOT to zoom in into old data, but rather query long time ranges. We were not clear enough with this from the beginning, sorry for that.

Handling this at the data source may get around another issue: if I use Trickster to cache data, and query for now-31d:now,

Yup, that's why we plan to contribute more to Cortex Cache for now to allow better caching. It splits per day so it won't have that issue. Right now it works fine against Thanos but will not really use downsampling data as the step most likely is too low (as we hit the issue we discuss in this ticket).

Ideally, related to the issue I think we should actually think about assuming that you have raw data all the time. I think we may consider even dropping different retentions per resolution, but let's discuss it in another ticket.

stale · 2020-01-11T03:43:50Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

mapshen · 2020-10-07T17:28:42Z

I am wondering what the current plan is. The current behavior, which chooses the lowest resolution and falls back on higher resolutions basically renders using Grafana to query historical data infeasible. We only use Store to query 5m- and 1h- resolution data and when both resolutions exist we would want to use the 5m-resolution. Raw data is only served by Sidecar as it would be
too much traffic/latency otherwise. It's hard for me to imagine a sensible user would ask Store for 1-year raw data.

After going over all the down-sampling related issues in #1705, I can't seem to find what we are trying to do to make Thanos prefer higher resolution data when different resolutions exist. Did I miss anything?

xiaozongyang · 2020-11-29T13:36:01Z

Hi @bwplotka

No, because auto downsampling decide based on step. So you can query for 1h range, 14 days ago and you touch raw data (if you have those).

We use Grafana query Thanos data, and our user's configuration setting the fixed step in their dashboards. So when the zoom out, the range increases but the step keeps unchanged, we can't serve the long-range query by downsampled data because of raw data in that range can't be returned by Thanos store in a single query.

So, What should be the best practice for both short-range(e.g now-2d) and long-range(e.g now-20d) queries? And can the step be adapted step by the query range in Grafana?

Thanks for your suggestions. :)

GiedriusS added the component: store label May 22, 2019

mjd95 mentioned this issue May 22, 2019

Store: Add min resolution to be specific what downsampling levels to use on query #1104

Closed

GiedriusS mentioned this issue Jul 26, 2019

Can not see downsampled data #1353

Closed

bwplotka mentioned this issue Nov 1, 2019

Long Term Storage Improvements [Tracking Issue] #1705

Closed

34 tasks

stale bot added the stale label Jan 11, 2020

stale bot closed this as completed Jan 18, 2020

mrliptontea mentioned this issue Sep 3, 2021

Auto downsampling should take available resolution into account #4634

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thanos Store should always prefer higher resolution data when possible #1170

Thanos Store should always prefer higher resolution data when possible #1170

GiedriusS commented May 22, 2019

bwplotka commented May 30, 2019 •

edited

Loading

GiedriusS commented Jun 5, 2019 •

edited

Loading

bwplotka commented Jun 5, 2019

bwplotka commented Jun 5, 2019

GiedriusS commented Jun 5, 2019

brancz commented Jun 6, 2019

bwplotka commented Jun 12, 2019

mjd95 commented Jun 13, 2019

GiedriusS commented Jun 17, 2019

raffraffraff commented Jun 27, 2019

bwplotka commented Oct 31, 2019 •

edited

Loading

stale bot commented Jan 11, 2020

mapshen commented Oct 7, 2020 •

edited

Loading

xiaozongyang commented Nov 29, 2020

Thanos Store should always prefer higher resolution data when possible #1170

Thanos Store should always prefer higher resolution data when possible #1170

Comments

GiedriusS commented May 22, 2019

bwplotka commented May 30, 2019 • edited Loading

GiedriusS commented Jun 5, 2019 • edited Loading

bwplotka commented Jun 5, 2019

bwplotka commented Jun 5, 2019

GiedriusS commented Jun 5, 2019

brancz commented Jun 6, 2019

bwplotka commented Jun 12, 2019

mjd95 commented Jun 13, 2019

GiedriusS commented Jun 17, 2019

raffraffraff commented Jun 27, 2019

bwplotka commented Oct 31, 2019 • edited Loading

stale bot commented Jan 11, 2020

mapshen commented Oct 7, 2020 • edited Loading

xiaozongyang commented Nov 29, 2020

bwplotka commented May 30, 2019 •

edited

Loading

GiedriusS commented Jun 5, 2019 •

edited

Loading

bwplotka commented Oct 31, 2019 •

edited

Loading

mapshen commented Oct 7, 2020 •

edited

Loading