Watch cache regression: changes behavior of "resource version too old" error #25151

smarterclayton · 2016-05-04T18:18:49Z

When attempting to fetch a resource version that is too old, the behavior of watch (for both HTTP and web sockets) changed when watch cache was enabled. The watch cache returns a 410 - without the watch cache, we return 200 and then write an "error" entry to the watch stream. This broke a client that dependent on the watch behavior, and moreover is inconsistent with other errors we set in the registry. Also, for watch over web sockets, most javascript clients will be unable to get at a 410 error from the connection, because browsers don't get access to the data.

We should restore the watch cache behavior to be consistent with the previous behavior as this is a regression in API semantics.

Resource without watch cache, watching too old:

$ curl ... "https:///oapi/v1/namespaces/clayton-dev/builds?watch=1&resourceVersion=1"
> GET /oapi/v1/namespaces/clayton-dev/builds?watch=1&resourceVersion=1 HTTP/1.1
> Accept: application/json, */*
> User-Agent: oc/v1.3.0 (darwin/amd64) kubernetes/2787678
> 
< HTTP/1.1 200 OK
< Cache-Control: no-store
< Transfer-Encoding: chunked
< Date: Wed, 04 May 2016 18:10:26 GMT
< Content-Type: text/plain; charset=utf-8
< Transfer-Encoding: chunked
< 
{"type":"ERROR","object":{"kind":"Status","apiVersion":"v1","metadata":{},"status":"Failure","message":"401: The event in requested index is outdated and cleared (the requested history has been cleared [4046756/2]) [4047755]","reason":"Expired","code":410}}

Resource with watch cache, watching too old:

$ curl ... "https:///apis/extensions/v1beta1/namespaces/clayton-dev/horizontalpodautoscalers?watch=1&resourceVersion=1"
> GET /apis/extensions/v1beta1/namespaces/clayton-dev/horizontalpodautoscalers?watch=1&resourceVersion=1 HTTP/1.1
> Accept: application/json, */*
> User-Agent: oc/v1.3.0 (darwin/amd64) kubernetes/2787678
> 
< HTTP/1.1 410 Gone
< Cache-Control: no-store
< Content-Type: application/json
< Date: Wed, 04 May 2016 18:08:25 GMT
< Content-Length: 146
< 
{"kind":"Status","apiVersion":"v1","metadata":{},"status":"Failure","message":"too old resource version: 1 (4034675)","reason":"Gone","code":410}

The text was updated successfully, but these errors were encountered:

smarterclayton · 2016-05-04T18:19:19Z

@kubernetes/sig-api-machinery I think this is candidate for back port to 1.2 to restore the previous behavior.

smarterclayton · 2016-05-04T18:20:01Z

@jwforres since this broke your javascript client, @liggitt FYI because of watch cache

ncdc · 2016-05-04T18:22:46Z

+1 for backport

wojtek-t · 2016-05-04T19:20:28Z

hmm - we enabled watch cache in 1.1 - so this should also be broken in 1.1

lavalamp · 2016-05-04T22:08:18Z

It's possible that it was always nondeterministic and just depends on how fast the call to the storage layer executes. I think all of our in-tree clients will handle it either way.

smarterclayton · 2016-05-04T22:52:05Z

I'm fairly certain this never used to behave this way, because OpenShift
hasn't enabled the watch catch and we have never had a report of this until
we enabled the watch cache for Kube resources just recently.

On Wed, May 4, 2016 at 6:08 PM, Daniel Smith [email protected]
wrote:

It's possible that it was always nondeterministic and just depends on how
fast the call to the storage layer executes. I think all of our in-tree
clients will handle it either way.

—
You are receiving this because you authored the thread.
Reply to this email directly or view it on GitHub
#25151 (comment)

smarterclayton · 2016-05-04T22:54:06Z

I see what you mean about ordering - etcd remote call would almost never be
fast enough to beat the watch event. In practice, I suspect no one ever
saw this without the watch error.

Either way, for web sockets if we don't fix this clients can't get this
error (the browser eats the 410 during the websocket negotiation and client
code can't get access). So it's at least broken for web sockets.

On Wed, May 4, 2016 at 6:52 PM, Clayton Coleman [email protected] wrote:

I'm fairly certain this never used to behave this way, because OpenShift
hasn't enabled the watch catch and we have never had a report of this until
we enabled the watch cache for Kube resources just recently.

On Wed, May 4, 2016 at 6:08 PM, Daniel Smith [email protected]
wrote:

It's possible that it was always nondeterministic and just depends on how
fast the call to the storage layer executes. I think all of our in-tree
clients will handle it either way.

—
You are receiving this because you authored the thread.
Reply to this email directly or view it on GitHub
#25151 (comment)

lavalamp · 2016-05-05T23:26:05Z

Well, it should be easy to just not wait at all, maybe? I'm OK with changing this behavior.

lavalamp · 2016-05-05T23:26:47Z

Although, are web clients XSS hardened? I'm not sure that we're optimizing for them currently.

liggitt · 2016-05-05T23:55:31Z

If you mean CSRF, it isn't an issue with bearer token auth. Basic auth should already have a big warning around it saying "not safe with browsers"

liggitt · 2016-05-09T19:59:40Z

It's possible that it was always nondeterministic and just depends on how fast the call to the storage layer executes.

Maybe I'm missing something, but it seems deterministic to me. This is the uncached watch impl:

// Implements storage.Interface.
func (h *etcdHelper) WatchList(ctx context.Context, key string, resourceVersion string, filter storage.FilterFunc) (watch.Interface, error) {
    if ctx == nil {
        glog.Errorf("Context is nil")
    }
    watchRV, err := storage.ParseWatchResourceVersion(resourceVersion)
    if err != nil {
        return nil, err
    }
    key = h.prefixEtcdKey(key)
    w := newEtcdWatcher(true, h.quorum, exceptKey(key), filter, h.codec, h.versioner, nil, h)
    go w.etcdWatch(ctx, h.etcdKeysAPI, key, watchRV)
    return w, nil
}

once the resourceVersion parses, there is no path that returns a direct error, so all errors would have to be returned as an event via the ResultChan().

I don't see us reading from that until after we've already written a 200 and flushed: https://github.com/kubernetes/kubernetes/blob/master/pkg/apiserver/watch.go#L177

Automatic merge from submit-queue Return 'too old' errors from watch cache via watch stream Fixes #25151 This PR updates the API server to produce the same results when a watch is attempted with a resourceVersion that is too old, regardless of whether the etcd watch cache is enabled. The expected result is a `200` http status, with a single watch event of type `ERROR`. Previously, the watch cache would deliver a `410` http response. This is the uncached watch impl: ``` // Implements storage.Interface. func (h *etcdHelper) WatchList(ctx context.Context, key string, resourceVersion string, filter storage.FilterFunc) (watch.Interface, error) { if ctx == nil { glog.Errorf("Context is nil") } watchRV, err := storage.ParseWatchResourceVersion(resourceVersion) if err != nil { return nil, err } key = h.prefixEtcdKey(key) w := newEtcdWatcher(true, h.quorum, exceptKey(key), filter, h.codec, h.versioner, nil, h) go w.etcdWatch(ctx, h.etcdKeysAPI, key, watchRV) return w, nil } ``` once the resourceVersion parses, there is no path that returns a direct error, so all errors would have to be returned as an `ERROR` event via the ResultChan().

cben · 2020-03-11T10:28:11Z

Having read the docs, I was very surprised that "type": "ERROR" notices are sent as 200 (even over plain http), with the semantic code e.g. 410 only represented under object.code inside.
I see there was good motivation here to stick to 200, but it should be called out in the documentation.

https://kubernetes.io/docs/reference/using-api/api-concepts/#efficient-detection-of-changes doesn't show at all how watch returns errors, and it even says "clients must handle the case by recognizing the status code 410 Gone" which really makes one expect an actual HTTP 410.

lavalamp · 2020-03-11T16:23:21Z

@cben I agree this should be called out in the docs. Can you file a bug there and/or send a PR? I don't think this ~4 year old issue is the best place to track that. (Good detective work finding this, though!)

Bug 1753649: UPSTREAM: 89937: portAllocator sync local data before allocate Origin-commit: d548b192d8b9e38402772509c8f0abc60ac7069c

References: * ManageIQ/kubeclient#452 (comment) * fabric8io/kubernetes-client#1800 (comment) * kubernetes/kubernetes#25151 (comment) * kubernetes/kubernetes#35068 (comment) * https://github.com/kubernetes/kubernetes/blob/dde6e8e7465468c32642659cb708a5cc922add64/test/e2e/apimachinery/protocol.go#L68-L75 * https://kubernetes.io/docs/reference/using-api/api-concepts/#410-gone-responses * https://kubernetes.io/docs/reference/using-api/api-concepts/#efficient-detection-of-changes * https://www.baeldung.com/java-kubernetes-watch#1-resource-versions

smarterclayton added the area/api Indicates an issue on api area. label May 4, 2016

smarterclayton added the priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. label May 4, 2016

smarterclayton changed the title ~~Watch cache changes behavior of "expired" error~~ Watch cache regression: changes behavior of "resource version too old" error May 4, 2016

roberthbailey added sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. team/control-plane labels May 8, 2016

liggitt mentioned this issue May 9, 2016

Return "410 Gone" errors via watch stream when using watch cache #25369

Merged

k8s-github-robot closed this as completed in #25369 May 11, 2016

cben mentioned this issue Mar 11, 2020

document that watcher stops see #273 and add each_with_retry ManageIQ/kubeclient#275

Open

cben mentioned this issue Jul 10, 2020

How to detect 410 Gone in watch response? ManageIQ/kubeclient#452

Open

tustvold mentioned this issue Jan 6, 2021

kubernetes_logs source doesn't detect new pods vectordotdev/vector#5846

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Watch cache regression: changes behavior of "resource version too old" error #25151

Watch cache regression: changes behavior of "resource version too old" error #25151

smarterclayton commented May 4, 2016

smarterclayton commented May 4, 2016

smarterclayton commented May 4, 2016

ncdc commented May 4, 2016

wojtek-t commented May 4, 2016

lavalamp commented May 4, 2016

smarterclayton commented May 4, 2016

smarterclayton commented May 4, 2016

lavalamp commented May 5, 2016

lavalamp commented May 5, 2016

liggitt commented May 5, 2016

liggitt commented May 9, 2016

cben commented Mar 11, 2020

lavalamp commented Mar 11, 2020

Watch cache regression: changes behavior of "resource version too old" error #25151

Watch cache regression: changes behavior of "resource version too old" error #25151

Comments

smarterclayton commented May 4, 2016

smarterclayton commented May 4, 2016

smarterclayton commented May 4, 2016

ncdc commented May 4, 2016

wojtek-t commented May 4, 2016

lavalamp commented May 4, 2016

smarterclayton commented May 4, 2016

smarterclayton commented May 4, 2016

lavalamp commented May 5, 2016

lavalamp commented May 5, 2016

liggitt commented May 5, 2016

liggitt commented May 9, 2016

cben commented Mar 11, 2020

lavalamp commented Mar 11, 2020