-
Notifications
You must be signed in to change notification settings - Fork 563
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Storage] Replacing aws s3 sync with s5cmd with much faster speed #2291
[Storage] Replacing aws s3 sync with s5cmd with much faster speed #2291
Conversation
Thanks @landscapepainter. Just noting our offline discussion here - instead of replacing Additionally, looks like the |
Hey @landscapepainter - could we quickly check if the |
Just ran a benchmark with Sync on 1000 of 1MB files: Sync on 6 of 10GB files: Some ways to configure for the |
@romilbhardwaj |
Need to add a setup for |
This PR is stale because it has been open 120 days with no activity. Remove stale label or comment or this will be closed in 10 days. |
@romilbhardwaj @landscapepainter is this PR ready? |
This PR is stale because it has been open 120 days with no activity. Remove stale label or comment or this will be closed in 10 days. |
This PR was closed because it has been stalled for 10 days with no activity. |
This PR closes #2176
User requested to use
s5cmd sync
instead ofaws s3 sync
for our storage upload and downloads due to it's speed. Confirmed it has most of necessary functionalities with a much faster speed. Ass5cmd
does not have--include
option implemented yet, I will leavestorage.py/S3Store.get_file_sync_command
as it is.Tested (run the relevant ones):
bash format.sh
sky launch s3_storage.yaml
s3_storage.yaml
:pytest tests/test_smoke.py::TestStorageWithCredentials