Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docs: clarity on LIST operations #64

Open
udf2457 opened this issue Sep 25, 2023 · 3 comments
Open

docs: clarity on LIST operations #64

udf2457 opened this issue Sep 25, 2023 · 3 comments

Comments

@udf2457
Copy link

udf2457 commented Sep 25, 2023

The docs talk of use_update_marker to reduce S3 bills, but the obvious question here is what is the frequency of the LIST calls in the first place. The docs are silent on that ?

The different commercial S3 providers provide different levels of "free" LIST calls and so it would be nice to have an accurate guess at at which point people would exceed "free" and have a good idea of likely S3 bills.

@wojas
Copy link
Member

wojas commented Sep 27, 2023

This is the LS config item that controls how often we do a LIST if use_update_marker is not in use:

# Check the storage for new snapshots at this interval
#storage_poll_interval: 1s

The relationship between this config item and S3 LIST requests, and how this affects costs should indeed be documented.

If I remember correctly, with AWS and use_update_marker enabled, the estimated costs for a small deployment would be around 1 USD per month per node. Please correct me if you find this is incorrect.

For AWS a LIST command is 10x more expensive than a GET command, so use_update_marker reduces the poll costs tenfold, minus the occasional LIST forced by update_marker_force_list_interval, which defaults to 5m.

@udf2457
Copy link
Author

udf2457 commented Sep 27, 2023

Thanks @wojas . But I thought the docs said use_update_marker was only for use in test projects and not really for production use ? So I guess as you say, bumping up the storage_poll_interval would be the only way to go if that is the case.

@wojas
Copy link
Member

wojas commented Sep 27, 2023

It's fine for production use if you are not doing cross region or datacenter S3 bucket syncing.
And even if you use such syncing, the update_marker_force_list_interval will fixup any issues with updates missed due to the update marker and files not being synced in the right order. Worst case a change will be delayed by that interval.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants