Parallelize back-end requests (e.g. for `/jobs`) #28

soxofaan · 2022-01-11T10:54:07Z

The aggregator has to combine results from multiple back-ends for some API endpoints, e.g. /collections, /processes, /file_formats, /udf_runtimes, ... At the moment these are not user-specific and don't change often, so it's not hard to avoid performance bottlenecks with a bit of caching (e.g. see #2).

The /jobs endpoint however is user-specific and dynamic, thus allows very little caching opportunity. At the moment the requests to the underlying back-ends are done one after the other, resulting in sometimes long response times (order of tens of seconds, related: #27, openEOPlatform/architecture-docs#179). By doing the back-end requests in parallel in some way (async, threads, ...) response time can be improved considerably.

Note that the web editor polls /jobs regularly (and maybe a future notebook component will do too) , so it's worthwhile to optimize this endpoint (at least for perceived performance)

(internal ref EP-4122)

The text was updated successfully, but these errors were encountered:

soxofaan · 2022-08-03T19:50:03Z

#33 was duplicate:

In various places (and especially the large area processing feature), the aggregator makes synchronous back-end requests in series. While this was the easiest to set up a proof of concept implementation in the existing openeo_driver framework, it is obviously bad for performance because some request can take quite long (e.g. batch job starting).
A lot can be gained by waiting for back-end responses in parallel.

…bs`)

soxofaan · 2023-10-05T09:52:05Z

implemented parallellized handling of /jobs requests

I don't think there are other opportunities for this kind of perf optimization by parallelization (collection/process metadata requests are now optimized through caching).

going to close (for now)

soxofaan added enhancement New feature or request performance labels Jan 11, 2022

soxofaan mentioned this issue Aug 3, 2022

Add parallelization for performance #33

Closed

soxofaan added architecture partitioned jobs evolution (SAP01) labels Aug 3, 2022

soxofaan mentioned this issue Aug 3, 2022

Aggregator Evolution (Small Activity Proposal) Planning #52

Open

soxofaan self-assigned this Sep 29, 2023

soxofaan mentioned this issue Sep 29, 2023

aggregator: job listing performance #119

Closed

soxofaan added a commit that referenced this issue Oct 4, 2023

Issue #28 initial implementation of parallel requests (on /jobs)

6ae502a

soxofaan added a commit that referenced this issue Oct 5, 2023

fixup! Issue #28 initial implementation of parallel requests (on `/jo…

1584d4c

…bs`)

soxofaan added a commit that referenced this issue Oct 5, 2023

Issue #28 initial implementation of parallel requests (on /jobs)

16b1a43

soxofaan closed this as completed Oct 5, 2023

soxofaan mentioned this issue Oct 12, 2023

parallelize /health requests #126

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelize back-end requests (e.g. for `/jobs`) #28

Parallelize back-end requests (e.g. for `/jobs`) #28

soxofaan commented Jan 11, 2022 •

edited

Loading

soxofaan commented Aug 3, 2022

soxofaan commented Oct 5, 2023

Parallelize back-end requests (e.g. for /jobs) #28

Parallelize back-end requests (e.g. for /jobs) #28

Comments

soxofaan commented Jan 11, 2022 • edited Loading

soxofaan commented Aug 3, 2022

soxofaan commented Oct 5, 2023

Parallelize back-end requests (e.g. for `/jobs`) #28

Parallelize back-end requests (e.g. for `/jobs`) #28

soxofaan commented Jan 11, 2022 •

edited

Loading