-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
aws s3 sync does not ignore storage-class of glacier by default #748
Comments
Yeah it seems like we should be handling the glacier storage class better. There's a few interesting edge cases we'll need to consider:
@kyleknap What are your thoughts? |
I tested some of these edge cases by creating some glacier objects of storage in a s3 bucket. Here is what I have found out and my thoughts:
|
Thanks for testing this and putting the results here. With regards to case #3, local->S3, I believe glacier files also are impacted by whether versioning is enabled. |
This is probably not the best place to ask this (sorry), but I'm not sure where else to ask… I'm trying to do a sync from 1 bucket to another (moving things to a different region). The bucket has both regular S3 stuff and glacier stuff. The S3 stuff seems to have synced as expected, but for all the Glacier stuff I'm getting the same error mentioned by @e0d at the start of this thread ("A client error (InvalidObjectState) occurred when calling the GetObject operation…"). Is the solution as simple as re-running the sync command with the --storage-class REDUCED_REDUNDANCY flag? Or would that try to move all my regular S3 stuff to Glacier? Or does the sync command not really work with the Glacier class storage at the moment? Apologies for what are probably ignorant questions – I'm a CLI newbie and this is the only page that turned up in a search that seemed remotely relevant. |
No worries. Currently, the sync command does not currently work with Glacier class storage objects. You are getting the error because Glacier objects stored in s3 cannot be downloaded/copied without restoring them to standard objects in s3. So no matter what to transfer the glacier objects you will need to restore the glacier objects to standard objects. Currently, the s3 commands do not have a feature for restoring glacier objects. One of the future goals is to better handle Glacier objects by being able to throw a flag to ignore them, throw warning about ignoring them, and/or being able to restore them to standard objects. |
+1 -- same issue with |
Guys, this is a pretty big issue when using S3/Glacier as a backup/restore solution. Any info on a patch? |
This (InvalidObjectState) error also causes the sync process to be extremely slow. We have millions of files in an S3 bucket and millions more in glacier. So although the sync process successfully completes in the end, it takes many hours more than it should because of millions of exceptions. Any ETA on the fix? |
+1 with this issue, I can't perform a |
+1 on this issue as well. |
+1, also being able to exclude or include Glacier object types for "ls" would be useful. |
+1, has anyone figured out a work around for this? |
+1 |
3 similar comments
+1 |
+1 |
+1 |
it would be so nice if the API provided an MD5 or any equivalent for files in the glacier. Is there any sort of unique ID like an object ID or date + object ID that could be used as a unique ID in order to allow s3 sync to work? |
With:
awscli==1.3.6
botocore==0.40.0
While the default param for aws sync for storage-class is documented as STANDARD, files with storage-class of GLACIER are matched when the file list is generated, but throw errors when download is attempted.
The text was updated successfully, but these errors were encountered: