Consider relying on eTags (or other headers) for service worker dependencies to check for updates #839

delapuente · 2016-02-23T11:04:30Z

For the sake of modularization and isolation. Could it be possible to improve the update algorithm to rely on the eTag / content-size / other headers sent by the server to decide when a service worker changed? Now we need to include a mark of change in the sw and this forces a lot of developments to postprocess service worker files.

annevk · 2016-02-23T11:50:36Z

I think @ehsan and I had a proposal for this at some point.

jakearchibald · 2016-02-23T12:09:19Z

If the request is made with If-Modified-Since or If-None-Match, and the response is 200, we could assume this is a new SW even if it's byte identical. Although this would cause unnecessary service worker updates on servers that send out Etag or Last-Modified but don't correctly return 304.

Else we just add Service-Worker-Force-Update: 1 or something.

delapuente · 2016-02-23T13:58:57Z

More than checking if the main sw file changed, I'm referring to its dependencies imported via importScripts().

jakearchibald · 2016-02-23T14:46:59Z

Yep, understand that, you'd be sending an etag (or last-modified) that represented the SW and its dependencies.

Maybe that's too hacky?

delapuente · 2016-02-23T19:58:49Z

I think that undermines the purpose of the ETag about digesting the content being served. It would do the trick but I think is better to come up with something more pluggable in the actual ecosystem not requiring to hack with semantics. ;)

jakearchibald · 2016-02-25T09:11:56Z

Yeah, I think you're right. Service-Worker-Force-Update: 1 would work.

jakearchibald · 2016-04-10T18:10:33Z

I think we should have a JS API for this, reg.update({force: true}) perhaps.

jakearchibald · 2016-04-12T22:40:30Z

F2F: agreement on serviceworker.skipWaiting() and reg.update({force: true}) - but may need to look at naming of force.

Lots of problems using etags for this, eg if CDN stop serving etags for some reasons.

Rough idea around some-header-name: value where value is a digest like etags.

jakearchibald · 2016-04-12T22:41:04Z

reg.update({force: true}) will leave you with the existing worker if the update fails, as usual.

annevk · 2016-04-14T10:09:38Z

Was anyone able to dig up the previous proposal from @ehsan and I? I can't find it.

collimarco · 2016-07-09T11:27:55Z

+1 for this

There should be a way to update the imported scripts even when the main code of the service worker is unchanged.

This is also a big issue for BaaS whose scripts are imported in the customers' service workers: I have more extensively described our situation in this blog post.

wanderview · 2016-07-09T13:23:33Z

@jakearchibald Can you remind me why we don't just check importScripts() for byte changes as well? I seem to recall it was since importScripts could be called async way back when, but now we've made that throw. Since we only allow sync importScripts maybe we can just include it in the byte check.

mfalken · 2016-07-11T00:36:44Z

See issue #639 for the importScripts discussion, which roughly concluded with "let's revisit this later".

jakearchibald · 2016-07-19T14:47:58Z

@wanderview

Can you remind me why we don't just check importScripts() for byte changes as well?

It could mean a lot of network requests just for an update check. I was worried about that. But also it means a change to a third party script would result in a whole new SW install, which sounds a bit… invasive.

I like reg.update({force: true}) as it give us a script-land way to do Chrome's "update on reload", but maybe for third party scripts we need to revisit the idea of providing access to the cache SW uses to store its scripts.

importScripts('//example.com/whatever.js');

// then later…
self.registration.getCache().then(cache => {
  cache.add('//example.com/whatever.js');
});

My big worry here is security. We'd only want the SW to be able to access this, as giving access from pages would turn a minor XSS into a huge problem.

wanderview · 2016-07-19T15:00:17Z

It could mean a lot of network requests just for an update check.

Sure, but this is a trade off sites can make for themselves. Do I want the convenience of structuring my code in separate files? Or do I want to minimize the number of SW script 304s my server has to send?

I could see smaller shops opting for developer convenience while huge sites compacting into a single file to minimize server load.

But also it means a change to a third party script would result in a whole new SW install, which sounds a bit… invasive.

What I'm hearing is that this is what developers expect to happen and they are surprised when it doesn't.

I like reg.update({force: true}) as it give us a script-land way to do Chrome's "update on reload"

I like this too, but I think its a different use case. From what I can tell developers want to compose their service worker scripts from decoupled sources and have things just update to the latest. Every step we add to get the updates to trigger creates friction and requires more tight coupling between modules.

At the very least it seems we could do this as an opt-in to register(). Something like update-checks-imports:true or something.

jakearchibald · 2016-07-19T15:09:54Z

update-checks-imports:true is interesting. Or something called during the install event to set which scripts should be checked.

If I show a "please refresh for latest version" message when there's an update waiting, I'm not sure I'd want that just because Google Analytics or whoever had updated something.

Then again, third party services like Analytics will live in Foreign Fetch instead.

collimarco · 2016-07-19T18:47:58Z

this is what developers expect to happen and they are surprised when it doesn't.

Exactly! IMHO The web is so great (and Javascript is becoming pervasive) for its simplicity. Please don't create a giant and complex monster: leave that to native apps. And please don't fall into premature optimization.

update-checks-imports:true is interesting

I agree. But I don't think that that choice should be left to the user. What if he denies? Then he would never get updates and the scripts will finally break.

I think that if you don't want all scripts to be refreshed by default you should leave to the developer the choice. For example: importScripts('//example.com/whatever.js', check-for-updates: true);. So the developer can prevent the refresh for large files (like Analytics) and allow small and more useful scripts to be refreshed automatically.

delapuente · 2016-07-26T10:50:57Z

update-checks-imports:true is interesting. Or something called during the install event to set which scripts should be checked.

I was thinking about this:

importScripts('//3rd-party.com/whatever.js', { forceUpdate: true }); // makes update algorithm to byte-to-byte compare the dependency.

This way, the developer can mark the dependencies causing updates, this trade off @wanderview was talking about is made explicit at the same time you can preserve file sanity via modularization.

We could make forceUpdate to be true by default (so we should change current spec but it will end with a more predictable API) or false (preserving current spec).

Furtheremore, if, at some time, the developer want a dependency to be part of the checking no longer, she simply flips the flag.

jakearchibald · 2016-07-27T18:10:30Z

Having the option in importScripts makes real sense. Nice. Unfortunately it doesn't cover JavaScript modules so well.

wanderview · 2016-07-27T18:15:06Z

But do we want to make a service worker specific importScripts() interface? This would not work if someone uses a library that then uses the existing importScripts() API internally. The top level import would get updated but not any of its dependencies.

I think it would be better to put this on the install or activate event personally. It can then automatically apply for all importScripts, modules, or other future added methods of bringing in script.

jakearchibald · 2016-07-27T18:52:05Z

Sure. The thing I liked about the importScripts solution is it was at a resource level, but I'm sure we can achieve that via another API.

pondering

If the API was something like alsoCheckTheByteEqualityOfThese(requests), could you include things that you weren't using in modules and importScripts? That would enable you to have a single resource that echoed the version number. Dunno if that's useful.

delapuente · 2016-07-27T19:41:23Z

Sure. The thing I liked about the importScripts solution is it was at a resource level, but I'm sure we can achieve that via another API.

That was the idea.

But do we want to make a service worker specific importScripts() interface? This would not work if someone uses a library that then uses the existing importScripts() API internally.

Well, what I would find extremely uncomfortable is to re-declare my imports for marking purposes only. Perhaps introducing a new import function (importScriptForcingUpdate(...))?

Dealing with ES6 modules is complicated but what about a pragma:

import "my-library";

"force update";
import "my-other-library";

I don't really like it and I don't really know if there is a standard mechanism to introduce "use strict"-like pragmas in ES6 but declarative APIs are this kind of inconvenient.

jakearchibald · 2016-07-29T18:22:42Z

F2F:

Should we check all imported scripts by default? Yes
Check the flattened imported scripts & the main script, if any of them are byte-by-byte different, including !ok responses, trigger an update (where a !ok response will fail the update)
The browser may optimise for this, eg if the main script has changed it doesn't need to check its imported scripts
No opt-out of this

jakearchibald · 2016-12-12T11:03:32Z

@ithinkihaveacat

Are the byte-for-byte checks shared across different service workers? (If two service workers import the same script, are the etags/hashes "shared"?)

When a service worker fetches a script (either the main script or imported), it will go to the network (optionally) via the HTTP cache. It won't go to the cache API or the script cache of any other service workers.

ithinkihaveacat · 2016-12-12T11:14:38Z

@jakearchibald Can that lead to a situation where the resources A and B are both updated at the origin, but the browser only notices that B has changed (because of timing-related artefacts of the HTTP cache), and so updates the SW using the "old" version of A and the "new" version of B?

jakearchibald · 2016-12-12T11:45:13Z

@ithinkihaveacat yep. Same is true for HTML documents. We have made the HTTP cache opt-in because of developer confusion around this (#893). Developers who opt into the HTTP cache should understand how it works.

This changes the behavior of the service worker script resource comparison. Before this, only the main service worker script was compared to a new version. With this change, all the imported scripts stored in the imported scripts map as well as the main script are inspected against the corresponding network resources (based on the urls.) Note: - Service worker's script resource map has been renamed and moved to service worker's script resource's imported scritps map. - registration's last update check time's always updated whenever the response is fetched from the network (regardless it's a main script or an imported script.) Fixes #839.

wanderview · 2016-12-12T17:11:12Z

Yea, I agree they are orthogonal issues.

jakearchibald · 2016-12-13T12:51:43Z

An interesting point has been raised internally - is it possible we could damage sites relying on caching by making this change. I'll reach out to our biggest users and see how they feel. Worst comes to worst, we could make no-cache opt-in.

wanderview · 2016-12-13T14:41:03Z

@jakearchibald When you talk to these sites, can you also mention there is a work around if they are serving unique hashed resources? They can set cache-control:immutable with a very large max-age to avoid these network requests at all in firefox/chrome.

ithinkihaveacat · 2016-12-13T14:59:54Z

@wanderview Sites that are able to generate unique hashed resources wouldn't really need this feature though, right?

I thought the point was to make it possible for service workers to do e.g. importScripts('https://www.gstatic.com/firebasejs/firebase-app.js'); and be able to quickly and reliably pick up changes to https://www.gstatic.com/firebasejs/firebase-app.js even if the service worker itself remained byte-for-byte identical.

If the default for all network activity related to service worker update checks becomes no-cache (as per #839 (comment)) then that's going to result in a lot of 304s for any widely deployed resource. (However, not doing this would lead to browsers potentially getting themselves into an inconsistent state #839 (comment).)

wanderview · 2016-12-13T15:19:47Z

I thought the point was to make it possible for service workers to do e.g. importScripts('https://www.gstatic.com/firebasejs/firebase-app.js'); and be able to quickly and reliably pick up changes to https://www.gstatic.com/firebasejs/firebase-app.js even if the service worker itself remained byte-for-byte identical.

I guess I thought people typically versioned 3rd party dependencies. Allowing external dependencies to float at-will in production seems kind of crazy to me.

ithinkihaveacat · 2016-12-13T15:32:12Z

I guess I thought people typically versioned 3rd party dependencies. Allowing external dependencies to float at-will in production seems kind of crazy to me.

I suppose it depends on the use case. Something like https://www.google-analytics.com/ga.js isn't versioned, and that works out fine.

https://www.hodinkee.com/OneSignalSDKWorker.js consists of one line:

importScripts('https://cdn.onesignal.com/sdks/OneSignalSDK.js');

Obviously whatever's in OneSignalSDK.js could be inlined into OneSignalSDKWorker.js (would even save a network request), but then OneSignal need to get Hodinkee to deploy a new version every time they update their SDK.

This changes the behavior of the service worker script resource comparison. Before this, only the main service worker script was compared to a new version. With this change, all the imported scripts stored in the imported scripts map as well as the main script are inspected against the corresponding network resources (based on the urls.) Note: - Service worker's script resource map has been renamed and moved to service worker's script resource's imported scritps map. - registration's last update check time's always updated whenever the response is fetched from the network (regardless it's a main script or an imported script.) Fixes #839.

delapuente · 2017-02-14T09:22:37Z

Is this already implemented in Chrome or Firefox?

wanderview · 2017-02-14T14:14:08Z

Is this already implemented in Chrome or Firefox?

Updating based on importScripts() in FF has been started, but not completed:

https://bugzilla.mozilla.org/show_bug.cgi?id=1290951

Related to this, defaulting updates to no-cache is already implemented in FF53:

https://bugzilla.mozilla.org/show_bug.cgi?id=1290944

jakearchibald · 2017-04-04T07:02:12Z

@KenjiBaheux & I should email SW users to make sure big users of SW are aware of this.

mfalken · 2017-04-04T08:09:09Z

The F2F resolution was to check importScripts in the byte-for-byte comparison; however issue #893 changed so that useCache would specify caching the importScripts by default.

NB. Currently, updating the data resource files will not make the service worker re-cache them. The service worker file itself will need to be updated. However, browser are working on this. See: w3c/ServiceWorker#839

jungkees · 2018-03-07T05:59:11Z

While working on this issue with @mattto, I found out we need to discuss about when to fetch and compare the imported classic scripts for Update. (See #1283 (comment).) Now we have two options:

Fetch imported scripts during the first evaluation of the main script in Update.
Fetch imported scripts (of newestWorker) before evaluating the main script.

(2) allows us to return early even before starting a worker. @jakearchibald seemed to be concerned about double-download (https://github.com/w3c/ServiceWorker/pull/1023/files#r92201798) here, but we can avoid importScripts() in the main script downloading the scripts from the network because we fill in the cache before that time.

But if the imported scripts in (2) have errors, we can't avoid running the main script and the cached scripts before catching those errors anyway.

Thoughts?

/cc @jakearchibald @wanderview @aliams @cdumez

EDIT: I tried it with (1) in #1283.

mfalken · 2018-03-07T07:04:05Z

I responded in #1283 (comment), but to reiterate here, I'm much more concerned about needlessly starting a service worker in the common case than in the error case. I think we should avoid starting a service worker until the byte-to-byte update check (including importScripts) shows that an update is possible. Otherwise almost every navigation will start a new service worker to do an update check which will usually just be wasted.

jungkees · 2018-03-07T07:16:28Z

That's a fair point. I agree to (2). We can do this without doing a double-download.

jakearchibald added the enhancement label Feb 23, 2016

jakearchibald added this to the Version 2 milestone Feb 23, 2016

delapuente mentioned this issue Feb 28, 2016

consider exposing install time on ServiceWorker DOM object #842

Closed

jakearchibald mentioned this issue Apr 10, 2016

Take into accounts etags on initialization importScripts responses to update the Service Worker #830

Closed

This was referenced Jun 7, 2016

Introduce Service-Worker-Max-Age header #721

Closed

Allow preventing the update process to finish #761

Open

jakearchibald mentioned this issue Jul 19, 2016

Create F2F agenda - 28-29 July 2016 #932

Closed

jungkees mentioned this issue Dec 12, 2016

Include imported scripts to byte-check #1023

Closed

mfalken mentioned this issue Dec 13, 2016

consider fetching service worker scripts with no-cache by default #893

Closed

delapuente mentioned this issue Jan 28, 2017

Updated framework so it doesn't have to be modified to be used delapuente/karma-sw-mocha#2

Closed

webmaxru mentioned this issue Mar 5, 2017

Better explanation of how importScripts can extend default behavior GoogleChromeLabs/sw-precache#49

Open

jakearchibald added the apr-2017-f2f-v1 label Mar 31, 2017

jakearchibald mentioned this issue Mar 31, 2017

Create F2F agenda - 3-4 April 2017 #1053

Closed

jakearchibald self-assigned this Apr 4, 2017

asutherland mentioned this issue Jun 5, 2017

Proposal: pass custom params in ServiceWorkerRegistration for future use #1157

Closed

ithinkihaveacat mentioned this issue Sep 18, 2017

SDK auto-update mechanism OneSignal/OneSignal-Website-SDK#266

Closed

mfalken mentioned this issue Mar 7, 2018

Improve service worker script caching and update #1283

Merged

jungkees closed this as completed in 8e25c26 Mar 13, 2018

Consider relying on eTags (or other headers) for service worker dependencies to check for updates #839

Consider relying on eTags (or other headers) for service worker dependencies to check for updates #839

Comments

delapuente commented Feb 23, 2016

annevk commented Feb 23, 2016

jakearchibald commented Feb 23, 2016

delapuente commented Feb 23, 2016

jakearchibald commented Feb 23, 2016

delapuente commented Feb 23, 2016

jakearchibald commented Feb 25, 2016

jakearchibald commented Apr 10, 2016

jakearchibald commented Apr 12, 2016

jakearchibald commented Apr 12, 2016

annevk commented Apr 14, 2016

collimarco commented Jul 9, 2016

wanderview commented Jul 9, 2016

mfalken commented Jul 11, 2016

jakearchibald commented Jul 19, 2016

wanderview commented Jul 19, 2016

jakearchibald commented Jul 19, 2016

collimarco commented Jul 19, 2016

delapuente commented Jul 26, 2016

jakearchibald commented Jul 27, 2016 • edited Loading

wanderview commented Jul 27, 2016

jakearchibald commented Jul 27, 2016

delapuente commented Jul 27, 2016

jakearchibald commented Jul 29, 2016

jakearchibald commented Dec 12, 2016

ithinkihaveacat commented Dec 12, 2016

jakearchibald commented Dec 12, 2016

wanderview commented Dec 12, 2016

jakearchibald commented Dec 13, 2016

wanderview commented Dec 13, 2016

ithinkihaveacat commented Dec 13, 2016 • edited Loading

wanderview commented Dec 13, 2016

ithinkihaveacat commented Dec 13, 2016

delapuente commented Feb 14, 2017

wanderview commented Feb 14, 2017

jakearchibald commented Apr 4, 2017

mfalken commented Apr 4, 2017

jungkees commented Mar 7, 2018 • edited Loading

mfalken commented Mar 7, 2018

jungkees commented Mar 7, 2018

jakearchibald commented Jul 27, 2016 •

edited

Loading

ithinkihaveacat commented Dec 13, 2016 •

edited

Loading

jungkees commented Mar 7, 2018 •

edited

Loading