-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
does az storage blob download take advantage of sparse files? It should.. #5872
Comments
@marvinthepa thanks for pointing this out, we are aware of the current limitation. The cli leverages the storage sdk: https://github.com/Azure/azure-storage-python |
I believe it does. @zezha-msft to confirm |
@seguler We have the sparse file optimization for upload but not for download. For download, we are currently treating all blob types as equal and simply downloading everything. Perhaps we can add this item to our backlog. |
@williexu I have added this item in our backlog. |
+1 on sparse enabled downloads using az cli - the only packaged alternative appears to be to use AzCopy, which is not portable to Linux/Mac agents. Want to use this in VSTS. Edit: AzCopy is available on Linux - it was in Azure Automation that it wasn't readily suitable to consume. |
This issue will be solved in the new AzCopy V10. The related issue is here. |
@zezha-msft - Please note AzCopy is NOT available under Azure Automation: Even if I manually downloaded AzCopy within the Azure Automation, Azure Automation didn't/doesn't allow running of arbitrary binaries. It would be preferable that |
Hi @iyerusad, thanks for the clarification! I see that it's still necessary to provide this functionality in the Python SDK. |
I've logged this item to be included in the next generation of the Storage SDK: https://github.com/Azure/azure-sdk-for-python/tree/master/sdk/storage/azure-storage-blob. It will be part of the GA criteria. |
Piping to gzip example. Normal piping works now. Remove "-f myfile.vhd" if there so you are doing something like below after grant disk access. az storage blob download --blob-url "mysasvhdfile-URI" | gzip > myhd.vhd.gz |
Either I cannot find out how to do it, or
az storage blob download
does not seem to take advantage of sparse files.Downloading a vm os disk snapshot of 30GB, containing only about 1.7GB of data takes more than 60 minutes while azcopy downloads the same file in 7 minutes.
The only reference to something like this in the documentation is
--max-connections
:However, setting max-connections to 1 does not seem to make a difference to using the default of 2.
Am I doing something wrong?
Environment summary
Tried in two different environments, mac os:
Also using docker image (on linux)
The text was updated successfully, but these errors were encountered: