Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add "sync" command #3

Closed
taraspos opened this issue Apr 24, 2018 · 10 comments · Fixed by #409
Closed

Add "sync" command #3

taraspos opened this issue Apr 24, 2018 · 10 comments · Fixed by #409

Comments

@taraspos
Copy link

Would be nice to be able to have a smart sync command like s3cmd sync or aws s3 sync, which will upload the files that changed and remove the deleted ones.

@igungor
Copy link
Member

igungor commented Apr 26, 2018

Thanks for the feature request. It'll be a great addition to the current feature set.

Even though it's not the same as "sync", you can synchronize two directories (update if a file differs) with s5cmd cp -u -s --parents s3://<bucket>/<folder> .. Please keep in mind that it's not going to remove an existing object/file.

@kjaercasper
Copy link

A aws cli sync feature would be rally useful, so object are remove at bucket. Any news on a sync feature?

@missedone
Copy link

looking forward for a faster version of s3 sync as well

@missedone
Copy link

just got this project https://github.com/sethkor/s3kor supports fast sync

@favoyang
Copy link

just got this project https://github.com/sethkor/s3kor supports fast sync

Here I quote from the s3kor repository page

Sync only compares mod timestamps and sizes as the only true way to get a MD5 or some other checksum of an S3 object is to download it comletley first. An objects ETag is not the MD5 of the object.

Then it's very similar to s5cmd cp -u -s, except it did one more scan to remove non-existed keys. Sounds like an acceptable workaround, better than nothing.

@timureh
Copy link

timureh commented Apr 30, 2021

Dear developers, do you plan to implement synchronization? For example like https://github.com/sethkor/s3kor (without checking MD5). I cannot use s3kor because I have a non-AWS endpoint. s5cmd there is just an opportunity to use the --endpoint-url flag. that's what i need. orient us please

@ilkinulas
Copy link
Member

Hi, sync is on our roadmap, please follow this issue to be notified about any updates regarding s5cmd sync implementation.

@vladimirsvicevicsrb
Copy link

Any update our new release version that includes sync operator? Thanks!

@sonmezonur
Copy link
Member

s5cmd v2.0.0-beta is released. It includes sync command. Happy to receive your feedback and contributions!

@eminegit
Copy link

Hi, sync command works great with few files. I am trying to sync 2 million files into s3 bucket which has more than 15 million files but I stops after 5 minutes with "killed" error message

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

10 participants