Disable retry for deletion operations #676

sssoleileraaa · 2019-12-19T18:59:59Z

Description

Clicking on "Retry" in the error bar resumes the queue and retries the last failed job. If the job is to delete a Source then we need to either:

disable "Retry" for deletion operations
add logic that reopens the deletion dialog again so that the user is aware of what's happening

This is to avoid the scenario where a user clicks on "Retry" without knowing exactly which task they are retrying and unknowingly deleting a Source.

sssoleileraaa · 2019-12-19T20:57:30Z

Just chatted with @redshiftzero about this issue who pointed out that the deletion job will still be in the queue when the queue is paused and can potentially be processed at a much later time, depending on when network issues are resolved and the MetadataSyncJob succeeds and resumes the queues. Disabling "Retry" for deletion operations doesn't actually remove the failed job from the queue. We also don't remove jobs from the queue based on how long they've been sitting there.

In order to avoid the scenario where a deletion happens without the user realizing it, we should:

update the DeleteSourceJob so that it is removed from the queue after 5 attempts rather than leaving it there
add a way to clear the queue after jobs have been sitting for a long time (replies will be saved currently, but this means we will drop staring and downloads [until we add support for range requests])

sssoleileraaa · 2019-12-19T20:58:22Z

just want to get @rmol's attention on this as well

rmol · 2019-12-19T23:15:36Z

I'm missing how the deletion would happen without the user realizing it. The job was created because they chose to delete the source. If the job failed, I think it should be retried until their intent is carried out. But if we require or permit manual retry, that should just work. We don't need to disable retry for deletions, as deletion should be idempotent.

This is discussed at the end of section 3.1 in RFC 7232. It's legitimate to respond with a successful status code if the requested state of the resource matches its current state. We could send the If-Match header and if we get a 412 Precondition Failed response, show a message explaining that the source was gone before this request, but I'd vote for keeping it simple, sending the DELETE as we do now, and just returning a 200 or 204.

Right now the SecureDrop API returns a 404 if the source to be deleted is not found, so that would need to change.

sssoleileraaa · 2019-12-19T23:39:22Z

I see your point about if user says to do it, we should try to do it for as long as it takes, and they shouldn't be surprised by that. It's what we're currently doing. Certain operations like sending a reply are time-sensitive because you might want to change your message if it's been a long time since you attempted to send it, but maybe deleting a source isn't time sensitive and we should retry for as long as possible. My concern is that it might be surprising to retry a job for X minutes or hours or days. That's where I'm coming from.

We don't need to disable retry for deletions, as deletion should be idempotent.

My point isn't that I'm concerned about idempotent deletions so much as I'm concerned that retrying until the client is closed might be the wrong approach. Sometimes it makes sense for users to manually try again when there have been a certain number of attempts/ enough time has gone by. Sometimes we should accept that a job failed and should be removed from the queue. Again I think this is more important for replies, but it is worth thinking about for deletions and other operations.

As far as idempotency goes, I like your suggestion about sending a successful status code if the Source has already been deleted. +1 to that! That's solving a different issue, but I agree.

sssoleileraaa · 2019-12-20T18:52:25Z

Right now the SecureDrop API returns a 404 if the source to be deleted is not found, so that would need to change.

@rmol - right now a 404 doesn't cause the queue to pause; we drop jobs that fail for anything other than the api being inaccessible or the request timing out.

So if we make a bunch of deletion requests on the same object we'll just get back 404 and move on to the next job in the queue. I'm realizing that if we do decide to switch to a returning 200 or 204, then we will continuously tell the user that the Source has been deleted. It might be best to continue to use 404 and silently move on the way we're currently doing.

rmol · 2019-12-20T19:01:59Z

I think it's valid to interpret a 404 from a DELETE request as an indication that the resource is in the desired state.

But we just drop jobs if we get 4xx or 5xx from the server? Posting a reply to a deleted source wouldn't result in any kind of error message?

eloquence · 2020-02-19T22:41:28Z

The plan of record is to remove the explicit Retry (#811) but an implicit retry would still happen after connectivity is restored.

I think the way to avoid confusion about the state of the client is to implement a minimal iteration of a pending deletion state (#534) and to prompt the user about unfinished operations (#420).

We may want to separately offer a "Cancel" option for pending deletions, very similar to a "cancel reply" option (#810).

However, all of these are different ideas than the scope of this issue, which I would recommend closing.

zenmonkeykstop · 2024-02-08T18:57:03Z

Closing as per above discussion

sssoleileraaa mentioned this issue Feb 25, 2020

Standardize connection errors to "Trying to reconnect" language without Retry #811

Closed

zenmonkeykstop closed this as completed Feb 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable retry for deletion operations #676

Disable retry for deletion operations #676

sssoleileraaa commented Dec 19, 2019

sssoleileraaa commented Dec 19, 2019

sssoleileraaa commented Dec 19, 2019

rmol commented Dec 19, 2019

sssoleileraaa commented Dec 19, 2019 •

edited

Loading

sssoleileraaa commented Dec 20, 2019

rmol commented Dec 20, 2019

eloquence commented Feb 19, 2020

zenmonkeykstop commented Feb 8, 2024

Disable retry for deletion operations #676

Disable retry for deletion operations #676

Comments

sssoleileraaa commented Dec 19, 2019

Description

sssoleileraaa commented Dec 19, 2019

sssoleileraaa commented Dec 19, 2019

rmol commented Dec 19, 2019

sssoleileraaa commented Dec 19, 2019 • edited Loading

sssoleileraaa commented Dec 20, 2019

rmol commented Dec 20, 2019

eloquence commented Feb 19, 2020

zenmonkeykstop commented Feb 8, 2024

sssoleileraaa commented Dec 19, 2019 •

edited

Loading