Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Thanos receive component umbrella issue #1093

Closed
9 tasks done
brancz opened this issue Apr 29, 2019 · 21 comments
Closed
9 tasks done

Thanos receive component umbrella issue #1093

brancz opened this issue Apr 29, 2019 · 21 comments

Comments

@brancz
Copy link
Member

brancz commented Apr 29, 2019

This is an umbrella issue to track the development of the accepted proposal for the Thanos receive component.

Let me know if any of this is unclear or needs more detail, should the design doc not have enough.

@bwplotka @domgreen @povilasv @metalmatze @squat @mxinden @paulfantom @s-urbaniak

@bwplotka
Copy link
Member

bwplotka commented Oct 8, 2019

I think it would be nice to also document the receive like we do for other commands here: https://thanos.io/getting-started.md/

@dawidmalina
Copy link

@bwplotka what is actual state of the receive implementation? I mean how far we are from having this ready for wider audience (at least for test purpose)?

@brancz
Copy link
Member Author

brancz commented Oct 8, 2019

@dawidmalina the issue is up to date. When everything is implemented I think it's ready for a wider audience to test. There will still be a number of features we are going to want to implement, but I think then all the things featured in the original design doc will be done.

@dawidmalina
Copy link

Good to hear that 👍 but I'm still suspicious about definition of the word "everything". According to the number of todo tasks attached to this ticket we have only one topic related to management of multiple tsdb instances. Is this the only one to consider receive implementation "ready for a wider audience" or we still rather stay away and wait until other features will be implemented?

@brancz
Copy link
Member Author

brancz commented Oct 9, 2019

Sorry I meant everything on this list. “Other features” refers to things that we may only discover as we use this more intensely :)

@bwplotka
Copy link
Member

bwplotka commented Dec 4, 2019

I think we should slowly consider making the receiver a non-experimental component.

I know multi-tsdb support is still in progress, plus docs (edited the description to add that point), but we are having feedback that receiver is quite stable for users already: https://cloud-native.slack.com/archives/CK5RSSC10/p1575471712121900 so worth to either reconsider making in stable now with some docs or to helpe with multi-tsdb work (:

@brancz
Copy link
Member Author

brancz commented Dec 4, 2019

I've been working on the multi-tsdb work for some time, and I don't think there is a way to introduce it completely disruptionless, so I would prefer to not remove the experimental naming/labeling until we have the multi-tsdb work and have it sink in for a little bit. That said, we do already use this in prod, so while there may be manual migrations necessary, it's unlikely to break very badly at this point.

@stale
Copy link

stale bot commented Jan 11, 2020

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the stale label Jan 11, 2020
@kakkoyun
Copy link
Member

Still in-progress.

@stale
Copy link

stale bot commented Feb 27, 2020

This issue/PR has been automatically marked as stale because it has not had recent activity. Please comment on status otherwise the issue will be closed in a week. Thank you for your contributions.

@stale stale bot added the stale label Feb 27, 2020
@bwplotka
Copy link
Member

bwplotka commented Feb 27, 2020 via email

@stale stale bot removed the stale label Feb 27, 2020
@stale
Copy link

stale bot commented Mar 28, 2020

This issue/PR has been automatically marked as stale because it has not had recent activity. Please comment on status otherwise the issue will be closed in a week. Thank you for your contributions.

@stale stale bot added the stale label Mar 28, 2020
@squat squat removed the stale label Mar 28, 2020
@brancz
Copy link
Member Author

brancz commented Mar 30, 2020

Actively working on finishing this :)

@dinesh4747
Copy link

@brancz - When can we expect Thanos Receiver docs and General availability just curious

@brancz
Copy link
Member Author

brancz commented Apr 15, 2020

The last feature feels fairly close to completion: #2012

After that we need to write comprehensive docs and I am expecting one configuration change in the hashring configuration, but I think it can be done in a backward compatible way.

No promises, but I don't think it will be too long.

@stale
Copy link

stale bot commented May 15, 2020

Hello 👋 Looks like there was no activity on this issue for last 30 days.
Do you mind updating us on the status? Is this still reproducible or needed? If yes, just comment on this PR or push a commit. Thanks! 🤗
If there will be no activity for next week, this issue will be closed (we can always reopen an issue if we need!). Alternatively, use remind command if you wish to be reminded at some point in future.

@stale stale bot added the stale label May 15, 2020
@kakkoyun kakkoyun removed the stale label May 15, 2020
@bwplotka
Copy link
Member

I guess last part is missing only: Docs (:

@brancz
Copy link
Member Author

brancz commented May 18, 2020

We also know of some limitations on the way that series are routed through the hashring, causing every request to translate into N internal requests, which makes a hashring only scale to the number of requests a single node can handle (in terms of parsing, so CPU bound). We need to come up with a mechanism to split up requests across multiple hashrings other than just by tenant ID. I think we might be able to do that in a backward compatible way but we should think about it.

@kakkoyun
Copy link
Member

kakkoyun commented Jun 5, 2020

For the docs #1093

@kakkoyun
Copy link
Member

kakkoyun commented Jun 9, 2020

Initial component documentation is done. Also it's no longer marked as experimental. Shall we close this one?

@brancz
Copy link
Member Author

brancz commented Jun 9, 2020

Initial implementation is done. This component is being used a lot in various production environments already, so it's been battle tested in various ways already. That said there are two main things that I would like to get agreement on before taking it our of experimental state:

That said, I think this umbrella issue can be closed as all the initial features have been implemented. Thanks to everyone involved in this, it's been ~1.5years worth of work.

@brancz brancz closed this as completed Jun 9, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants