storage: resumable uploads #298

stephenplusplus · 2014-11-11T01:05:48Z

Big files take big time. Big time means big chances for failure. Currently, if something goes wrong during an upload at 95%, the user is forced to re-upload starting from 0%. Using the resumable upload capabilities of the storage API, we should be able to handle this for our users.

It's not exactly trivial, however. My current idea for a solution involves storing a configuration file on the user/server's drive, where we can store tokens/state of uploads. Yeoman happens to have a tool to enable this for us, https://github.com/yeoman/configstore

Unless there are any objections, or more ideas for solutions, I will try to have a PR this week with this functionality.

ryanseys · 2014-11-11T01:23:18Z

What strategy will be taken if writing to disk is not possible due to
permission issues or some other arbitrary reason?

stephenplusplus · 2014-11-11T01:24:23Z

I guess fallback to non-resumable behavior. What would make the most sense?

On Monday, November 10, 2014, Ryan Seys [email protected] wrote:

What strategy will be taken if writing to disk is not possible due to
permission issues or some other arbitrary reason?

—
Reply to this email directly or view it on GitHub
#298 (comment)
.

ryanseys · 2014-11-11T01:27:05Z

Exposing the id and other necessary information used to me as a
developer may be nice so I can store it wherever I want (maybe in data
store? ;) )

On Monday, November 10, 2014, Stephen Sawchuk [email protected]
wrote:

I guess fallback to non-resumable behavior. What would make the most
sense?

On Monday, November 10, 2014, Ryan Seys <[email protected]
javascript:_e(%7B%7D,'cvml','[email protected]');> wrote:

What strategy will be taken if writing to disk is not possible due to
permission issues or some other arbitrary reason?

—
Reply to this email directly or view it on GitHub
<
https://github.com/GoogleCloudPlatform/gcloud-node/issues/298#issuecomment-62487064>

.

—
Reply to this email directly or view it on GitHub
#298 (comment)
.

silvolu · 2014-11-11T01:30:53Z

The boto approach might be interesting: the library will automatically retry x times on failures; a tracker file is optional and can be used to save/read the state.

stephenplusplus · 2014-11-11T01:39:44Z

That boto approach makes sense. We should be able to do the same thing. If a tracker file is provided, we first read it to see if there's already data to use to resume the upload. If not, we store the state of the upload in it and are able to re-use it the next time through.

does the tracker file get written to if the upload is successful?
is there a standard format the tracker file is written in?

silvolu · 2014-11-11T01:51:45Z

does the tracker file get written to if the upload is successful?

If the option is passed the file is written, and it's then removed if the upload is successful or if any non-retryable exception is raised.

is there a standard format the tracker file is written in?

Looks like is just a line in the tracker file that contains the URI.

stephenplusplus · 2014-11-11T02:15:56Z

SGTM 👍

stephenplusplus · 2014-11-11T16:05:25Z

After thinking more on this, I think it might be beneficial to still attempt to handle resumable uploads automatically, without a tracker file having to be provided:

fs.createReadStream("./photos.zip")
  .pipe(file.createWriteStream())
  .on("error", callback)

bucket.upload("./photos.zip", callback)

Start resumable upload
Store resumable ID in a config file (using yeoman/configstore)
If it succeeds, remove id from file
If it fails, emit error/callback with resumable ID / message that says "try again to resume"

Handling the streaming upload should be possible by counting bytes: https://cloud.google.com/storage/docs/concepts-techniques#unknownresumables

** edit: can't read. JSON docs: https://cloud.google.com/storage/docs/json_api/v1/how-tos/upload#resumable **

silvolu · 2014-11-11T21:50:56Z

After thinking more on this, I think it might be beneficial to still attempt to handle resumable uploads automatically, without a tracker file having to be provided:

Yeah, as mentioned the tracker file is optional.

Store resumable ID in a config file (using yeoman/configstore)

How is this different from using a tracker file? Can't we just store the ID in memory and on failure emit it?

stephenplusplus · 2014-11-11T21:57:04Z

Yeah, as mentioned the tracker file is optional ... How is this different from using a tracker file? Can't we just store the ID in memory and on failure emit it?

By defaulting to "we'll store it in a file for you" but allowing "give us a file to store it in", I think that would create a confusing situation where it's not clear when it makes sense to provide your own file. The file is useless outside of gcloud-node, as far as I know. After playing with this today, I can't even think of a good reason to return the resumable ID to the user -- with it alone, it's not very handy. If they don't use it with gcloud-node, they need to somehow assemble their own request (authenticate the headers, read the last byte sent, discard unused bytes from the retried upload data stream, etc). I'd rather just handle everything, and emit an error as we currently do - if the upload fails for any reason. The user will try again, and we'll automatically pick up where we left off, without them having to think about it.

The configstore is preferred because the user won't even need to know there is a config file.

silvolu · 2014-11-11T22:00:01Z

SGTM.

BREAKING CHANGE: The library now supports Node.js v10+. The last version to support Node.js v8 is tagged legacy-8 on NPM. New feature: methods with pagination now support async iteration.

* chore: update codeowners * 🦉 Updates from OwlBot See https://github.com/googleapis/repo-automation-bots/blob/main/packages/owl-bot/README.md Co-authored-by: Owl Bot <gcf-owl-bot[bot]@users.noreply.github.com> Co-authored-by: Benjamin E. Coe <[email protected]>

🤖 I have created a release *beep* *boop* --- ## [3.2.0](https://togithub.com/googleapis/nodejs-bigquery-storage/compare/v3.1.1...v3.2.0) (2022-11-11) ### Features * Add location to WriteStream and add WriteStreamView support ([#295](https://togithub.com/googleapis/nodejs-bigquery-storage/issues/295)) ([ba3c5ef](https://togithub.com/googleapis/nodejs-bigquery-storage/commit/ba3c5ef05366b1e9a542b9b13fc0c7a25118b2a3)) --- This PR was generated with [Release Please](https://togithub.com/googleapis/release-please). See [documentation](https://togithub.com/googleapis/release-please#release-please).

🤖 I have created a release \*beep\* \*boop\* --- ### [2.1.5](https://www.github.com/googleapis/nodejs-recaptcha-enterprise/compare/v2.1.4...v2.1.5) (2021-07-12) ### Bug Fixes * **deps:** google-gax v2.17.1 ([#297](https://www.github.com/googleapis/nodejs-recaptcha-enterprise/issues/297)) ([15640f1](https://www.github.com/googleapis/nodejs-recaptcha-enterprise/commit/15640f1a4a931caa9ece4b749994e7d6d609200e)) --- This PR was generated with [Release Please](https://github.com/googleapis/release-please). See [documentation](https://github.com/googleapis/release-please#release-please).

This PR was generated using Autosynth. 🌈 Synth log will be available here: https://source.cloud.google.com/results/invocations/cc99acfa-05b8-434b-9500-2f6faf2eaa02/targets - [ ] To automatically regenerate this PR, check this box. Source-Link: googleapis/synthtool@799d8e6

stephenplusplus added the enhancement label Nov 11, 2014

stephenplusplus mentioned this issue Nov 11, 2014

storage: support resumable uploads #299

Merged

6 tasks

stephenplusplus added this to the 1.0.0 milestone Nov 14, 2014

silvolu closed this as completed in #299 Dec 3, 2014

jgeewax added the api: storage Issues related to the Cloud Storage API. label Feb 2, 2015

jgeewax modified the milestones: 1.0.0, Storage Stable Feb 2, 2015

yoshi-automation added 🚨 This issue needs some love. triage me I really want to be triaged. labels Apr 6, 2020

JustinBeckwith assigned silvolu Feb 1, 2021

sofisl mentioned this issue Sep 15, 2022

migrate code from googleapis/nodejs-redis #3360

Merged

4 tasks

release-please bot mentioned this issue Sep 15, 2022

chore: release main #3352

Merged

sofisl mentioned this issue Sep 16, 2022

migrate code from googleapis/nodejs-os-login #3366

Closed

4 tasks

sofisl pushed a commit that referenced this issue Sep 16, 2022

chore: update jsdoc.js (#298)

22724e3

sofisl pushed a commit that referenced this issue Sep 27, 2022

chore: release 2.4.0 (#298)

61d4026

sofisl mentioned this issue Sep 27, 2022

migrate code from googleapis/nodejs-video-intelligence #3374

Closed

4 tasks

This was referenced Nov 11, 2022

migrate code from googleapis/nodejs-containeranalysis #3550

Merged

migrate code from googleapis/nodejs-cloudbuild #3566

Merged

sofisl mentioned this issue Nov 11, 2022

migrate code from googleapis/nodejs-bigquery-storage #3599

Merged

4 tasks

sofisl mentioned this issue Nov 16, 2022

migrate code from googleapis/nodejs-recaptcha-enterprise #3645

Merged

4 tasks

sofisl pushed a commit that referenced this issue Nov 17, 2022

build: only pipe to codecov if tests run in Node 10 (#298)

76ff60b

This was referenced Nov 17, 2022

migrate code from googleapis/nodejs-compute #3666

Merged

migrate code from googleapis/nodejs-security-center #3669

Merged

sofisl mentioned this issue Jan 17, 2023

migrate code from googleapis/nodejs-vision #3860

Merged

4 tasks

sofisl pushed a commit that referenced this issue Jan 17, 2023

Release v0.24.0 (#298)

a00ae15

takeratta mentioned this issue Jul 6, 2023

[Snyk] Fix for 1 vulnerabilities takeratta/google-cloud-node#3

Open

takeratta mentioned this issue Nov 28, 2023

[Snyk] Fix for 21 vulnerabilities takeratta/google-cloud-node#28

Open

takeratta mentioned this issue May 13, 2024

[Snyk] Fix for 2 vulnerabilities takeratta/google-cloud-node#70

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage: resumable uploads #298

storage: resumable uploads #298

stephenplusplus commented Nov 11, 2014

ryanseys commented Nov 11, 2014

stephenplusplus commented Nov 11, 2014

ryanseys commented Nov 11, 2014

silvolu commented Nov 11, 2014

stephenplusplus commented Nov 11, 2014

silvolu commented Nov 11, 2014

stephenplusplus commented Nov 11, 2014

stephenplusplus commented Nov 11, 2014

silvolu commented Nov 11, 2014

stephenplusplus commented Nov 11, 2014

silvolu commented Nov 11, 2014

storage: resumable uploads #298

storage: resumable uploads #298

Comments

stephenplusplus commented Nov 11, 2014

ryanseys commented Nov 11, 2014

stephenplusplus commented Nov 11, 2014

ryanseys commented Nov 11, 2014

silvolu commented Nov 11, 2014

stephenplusplus commented Nov 11, 2014

silvolu commented Nov 11, 2014

stephenplusplus commented Nov 11, 2014

stephenplusplus commented Nov 11, 2014

silvolu commented Nov 11, 2014

stephenplusplus commented Nov 11, 2014

silvolu commented Nov 11, 2014