Cache expanded postings #6417

yeya24 · 2023-06-06T00:51:18Z

Is your proposal related to a problem?

#6416 mentioned some problems when fetching postings. One thing is that, if the postings contain a lot of series IDs but after intersection it only matches few series, we still need to fetch postings for all matchers

Describe the solution you'd like

We can try to cache the postings after expansion. This could save both CPU time for merging postings and mem/bandwidth for fetching postings from caches.

Let's say a query up{cluster="us", env="prod"}. Each matcher matches 100, 100K and 500K respectively and we need to fetch 600K postings.
If we have expanded postings cached, we only need to fetch < 100 postings.

Describe alternatives you've considered

Add new methods in index cache to store and fetch expanded postings. The data format can reuse what we have: store diff+varint+streamed snappy and the cache key will be blockID + label matchers.

The text was updated successfully, but these errors were encountered:

fpetkovski · 2023-06-06T04:56:25Z

I wonder if postings would be better suited on disk with the index header, instead of in object storage 🤔

yeya24 · 2023-06-06T05:01:13Z

@fpetkovski I think it is a good point. Maybe we could experiment with that. From some real data I can see, the posting part is relatively small compared to the series section in tsdb index so probably we can try that.

But I still think it is valuable to cache expanded postings as we don't have to process label matchers and do intersections.

yeya24 added the component: store label Jun 6, 2023

yeya24 mentioned this issue Jun 6, 2023

index cache: Cache expanded postings #6420

Merged

2 tasks

GiedriusS closed this as completed in #6420 Jun 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache expanded postings #6417

Cache expanded postings #6417

yeya24 commented Jun 6, 2023

fpetkovski commented Jun 6, 2023

yeya24 commented Jun 6, 2023

Cache expanded postings #6417

Cache expanded postings #6417

Comments

yeya24 commented Jun 6, 2023

Is your proposal related to a problem?

Describe the solution you'd like

Describe alternatives you've considered

fpetkovski commented Jun 6, 2023

yeya24 commented Jun 6, 2023