-
Notifications
You must be signed in to change notification settings - Fork 62
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Azure
Azure Blob Interface
#99
Conversation
Note: Do not use |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
||
self.container_client = None | ||
|
||
# TODO:: Figure this out, since azure by default has 15 workers |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Where is parallelism in this setup? Does the Azure CLI internally dispatch parallel requests? Or are you explicitly managing this? Or is this nested parallelism?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, azure internally dispatches parallel requests. It is specified by the self.max_concurrency parameter.
src_object_name, dst_file_path = str(src_object_name), str(dst_file_path) | ||
src_object_name = src_object_name if src_object_name[0] != "/" else src_object_name | ||
|
||
def _download_object_helper(offset, **kwargs): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How is offset used? I thought that the chunk interface you tested was slower than atomic objects?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Correct. It's not used.
Azure Blob storage interface.
Notes:
read_all()
faster thanstream.chunks()
AZURE_STORAGE_CONNECTION_STRING
read from OS environment.Test with command:
python skylark/test/test_azure_interface.py
TODO:
Azure
Azure interface #102)