use priority queue for job processing #486

redshiftzero · 2019-07-22T23:09:46Z

Description

Fixes #423.

After investigation, #423 is a worthwhile architectural change as we get some major benefits:

We can continue to use a single general (priority) queue instead of many separate queues (what we were previously considering) for different actions (i.e. a priority queue is the right data structure for the situation where e.g. we want a metadata sync to occur with high priority every 5 minutes and then other tasks at lower priorities when we get time e.g. starring or flagging). Still worth having the file download queue to be separate so that file downloads (which can take a long time) occur concurrently with other shorter-lived actions (all in the general priority queue).
This PR adds an execution priority for each server-related (job) action - unimplemented options are commented out intentionally- and replaces the use of Python's Queue with Python's PriorityQueue.
Removes RunnableQueue.last_job while continuing to preserve job ordering in cases where jobs timeout/queue execution pauses.

Other notes to be aware of:

A quirk of Python PriorityQueue is that it does not preserve FIFO ordering of objects with equal priorities. A counter was added to our job objects to ensure that the sort order of objects with equal priorities is stable (added notes inline and in commit messages to make this clear).
These job counters will need to persist if/when we persist the queues.

Test Plan

I would follow the standard test plan for this one

Checklist

If these changes modify code paths involving cryptography, the opening of files in VMs, network (via the RPC service) traffic, or fine tuning of the graphical user interface, Qubes testing is required. Please check as applicable:

I have tested these changes in Qubes
I do not have a Qubes OS workstation (the reviewer will need to test these changes in Qubes)
This really only modifies how the queue processes jobs, so testing on Qubes is not strictly necessary.

TODO: properly handle last_job

the next job in a priority queue is set by sorted(list(entries))[0] this means that the job objects themselves must be sortable, i.e. they must all implement __lt__. in this commit I'm just implementing __lt__ for two job types where we don't care about the order as long as all jobs of that type have the same priority. see related discussion in https://bugs.python.org/issue31145 and for a solution using dataclasses (not in python 3.5)

sssoleileraaa

will run through some tests in the morning but looks good for the initial pass-through

securedrop_client/api_jobs/base.py

securedrop_client/queue.py

sssoleileraaa

This bug on master now does not occur. Please confirm that this PR fixes it.

However a similar but different bug is happening, STR:

log into the client in Qubes
cut connection to server
send a few replies and wait until they timeout (you'll see the red bars)
send a message as the source with the failed replies
press refresh
reconnect server
wait until refresh is over
send another message as the source and refresh

Expected:

The replies will be sent and no longer red. The messages from the source will show up in the conversation view.

Actual:

The replies are still red (failed). The messages from the source do not show up in the conversation view. When you close the client and reopen the red replies are blue and the messages appear, so this seems to only be a UI refresh issue.

sssoleileraaa · 2019-07-23T23:55:32Z

send a few replies and wait until they timeout (you'll see the red bars)

Wait for at least one to time out. The actual behavior happens both when a reply is in the middle of being processed and when all replies are finished being processed and failed.

sssoleileraaa · 2019-07-24T19:41:59Z

I added a commit to test and fix the small UI bug I mentioned in the comment: bdd6a93

Now when replies begin succeeding again, the previous replies that failed show up as successful and new source messages appear.

sssoleileraaa

At this point, all comments have been addressed so I think it's time to

redshiftzero added 4 commits July 18, 2019 16:23

queue: submit jobs with priorities to queues

4b2cbab

TODO: properly handle last_job

queue: workaround lack of sort stability in heapq

9ad3313

queue: get rid of last_job

fb41b99

redshiftzero requested review from sssoleileraaa and heartsucker as code owners July 22, 2019 23:09

sssoleileraaa reviewed Jul 23, 2019

View reviewed changes

securedrop_client/api_jobs/base.py Outdated Show resolved Hide resolved

securedrop_client/queue.py Outdated Show resolved Hide resolved

redshiftzero added 5 commits July 23, 2019 16:11

queue: update type sigs for PriorityQueue and related ApiJob changes

0a11a36

queue: make job_priorities dict a class variable for testability

deba67d

test: update existing tests to add job_priorities, remove last_job

87bd329

test: new tests for ApiJob.__lt__ (ApiJobs are sortable now)

94f92d7

test: queue jobs executed in proper order

09851e6

redshiftzero force-pushed the spike-priority-queue branch from aaebfd2 to 09851e6 Compare July 23, 2019 23:12

sssoleileraaa self-requested a review July 23, 2019 23:27

sssoleileraaa suggested changes Jul 23, 2019

View reviewed changes

redshiftzero and others added 2 commits July 24, 2019 12:05

queue: address review comments from priorityqueue implementation

c15426b

refresh session whenever a reply succeeds

bdd6a93

sssoleileraaa force-pushed the spike-priority-queue branch from 8952d13 to bdd6a93 Compare July 24, 2019 19:36

sssoleileraaa self-requested a review July 24, 2019 19:43

sssoleileraaa approved these changes Jul 24, 2019

View reviewed changes

sssoleileraaa merged commit 4633946 into master Jul 24, 2019

sssoleileraaa deleted the spike-priority-queue branch July 24, 2019 19:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use priority queue for job processing #486

use priority queue for job processing #486

redshiftzero commented Jul 22, 2019

sssoleileraaa left a comment

sssoleileraaa left a comment •

edited

Loading

sssoleileraaa commented Jul 23, 2019

sssoleileraaa commented Jul 24, 2019

sssoleileraaa left a comment

use priority queue for job processing #486

use priority queue for job processing #486

Conversation

redshiftzero commented Jul 22, 2019

Description

Test Plan

Checklist

sssoleileraaa left a comment

Choose a reason for hiding this comment

sssoleileraaa left a comment • edited Loading

Choose a reason for hiding this comment

sssoleileraaa commented Jul 23, 2019

sssoleileraaa commented Jul 24, 2019

sssoleileraaa left a comment

Choose a reason for hiding this comment

sssoleileraaa left a comment •

edited

Loading