-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Globus transfer optimization #214
Globus transfer optimization #214
Conversation
@lukaszlacinski @golaz I reviewed the code (including the first commit, also found in #185) as well I was able to. Overall, the code appears ok to me. I do have a few questions though:
From #171:
I see the
My test appears to be in the first case (more time to create a tarball than transfer it). I had 48 tars and 48 Globus transfers. I also want to confirm that we should not expect simultaneous Globus transfers. I didn't check the time stamps of all 48 Globus, but it appears there was a maximum of one transfer occurring at one time. (I suppose simultaneous transfers would only be useful in the second case anyway, when we're creating tars faster than we can transfer them). |
Commands that get tarballs from the storage check if Globus endpoints are activated kind of
My mistake (fixed). I removed Globus transfer optimization in this PR relies on a fact that two processes: 1) creating files (tarballs, database) and 2) transferring the files can be run in parallel as Producer-Consumer. Transferring files and extracting them in
My mistake (fixed). I added it to
That's correct with one exception. The last two batches of files (tarballs and the database) in |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the updates, @lukaszlacinski. The unit tests pass with these changes. So, I'll merge this. Also, I will just close #185 since that commit is included in this one.
This PR addresses #171.