-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Next steps with groupcache (galaxycache) #5037
Comments
Looking forward to the documentation. Can't wait to try it 👍 |
#5068 fixes [4] and [5]. |
Hello 👋 Looks like there was no activity on this issue for the last two months. |
Closing for now as promised, let us know if you need this to be reopened! 🤗 |
From our experience, groupcache is still slow due to the last point - we need to fetch multiple keys in one request. |
Hello 👋 Looks like there was no activity on this issue for the last two months. |
Hey @GiedriusS, I would like to pick a sub-issue of this open issue which is Make pull request that pushes changes (Thanos update) back into Cortex As I am a newbie to open source contribution & Thanos codebase, can someone share some docs to help me understand the requirement better? Any acceptance criteria for this feature? Or which flow/component in the Thanos codebase should I focus on? :) |
Amazing, thanks! So cc @yeya24 @GiedriusS does it sounds right? |
Thanks for picking this up! I am personally doubtful whether it is needed to pursue this further, I might even be leaning towards deleting groupcache before 1.0 😢 as far as I understand, the problem is that we are serializing access to a single |
Got it. I think it's nice piece to work, but best effort. Still something to consider if blockers are improved. What would be a benchmark that could reproduce the problem? What requirements we are looking for in terms of efficiency here? |
Hey @GiedriusS and @bwplotka ! These past days I was trying to tune our query path on our dev system to see what it can do, and backtracked our current bottleneck to our stores taking very long to return data to the queriers. In short order the problem was that we had no bucket_cache enabled at all, so everything was redownloaded from S3 every time it was needed. I did some research about the options available and came away with the following conclusions:
I have read your discussions about abandoning groupcache above in favor of Redis and I was a little confused considering the praise in the docs). The above discussion being over 2 years old now I wanted to ask if that still reflects your opinion or not. Looking at the options, all of these implementations seem to be broken to some degree, and I wonder which direction would be the best to go towards. So far I am sold on the GROUPCACHE approach and I wonder if it is a good idea to try to dust off the implementation, put in the enhancements that it were missing and come out with the best theoretical solution at the other end.
|
I don't think any decision has been made, but nobody has put in any effort into debugging why the groupcache approach is not as performing well. I don't think anyone would object to you putting in time to make improvements. |
After #4818 has been merged, I am writing down a list of TODO:
For implementing the last two, perhaps we could even switch to gRPC?
Maybe I have missed something?
The text was updated successfully, but these errors were encountered: