services/horizon: /order_book slower when reading from offers graph #1963

bartekn · 2019-11-20T22:58:47Z

What version are you using?

Horizon 0.23.1

What did you do?

I checked response times pre/post upgrade from 0.22.2 to 0.23.1. I was surprised that response times for /order_book are actually higher even though 0.23.0 started using in-memory offers graph. Count of responses with duration > 0.1:

Not super urgent because p99 of duration for this route is actually below 0.10-0.13 but it's worth checking why it's slow (deploy around 22:10):

What did you expect to see?

Smaller duration of /order_book responses.

What did you see instead?

Higher duration of /order_book responses.

The text was updated successfully, but these errors were encountered:

tamirms · 2020-06-10T19:49:51Z

It seems that since the 0.23.1 release the performance of the /order_book endpoint with the in memory order book graph has improved significantly.

In the 1.4.0 release we started to query the Horizon DB to determine the order book spread instead of using the in memory order book graph. It turns out the querying the DB is slower than using the in memory graph which is what we would expect. However the response times are still very low using the Horizon DB.

Before deploying Horizon 1.4.0 the the average response time for /order_book requests was 5ms. After 1.4.0 was deployed the average response time increased to 9ms.

Similarly, before deploying Horizon 1.4.0, the P99 response time was 10 ms. After 1.4.0 was deployed the P99 response time increased to 20ms.

ire-and-curses · 2020-06-10T20:53:16Z

Thanks for measuring this! For now this seems not urgent to improve since it is fast enough. If we did want to improve performance in the future, are there any obvious steps to take?

bartekn · 2020-06-10T22:46:14Z

Thanks for checking this @tamirms! Super interesting. Too bad it's hard to load the old data and check the metrics once again (maybe I did something wrong with my query). I zoomed out to check 7d charts and found that in-memory /order_book p99 and avg were slightly higher from time to time (still lower than DB version) so I started wondering that maybe it has something to do with a type of queries (limit or market params actually affect response time):

Anyway, I think the response times are great even with a DB version so maybe we shouldn't revert it. Could be interesting to compare CPU profile of 0.23.1 and 1.3.0. Maybe it will give us some useful hints connected to graph optimization.

ire-and-curses · 2020-06-10T23:12:14Z

I'm against reverting because this is big for us because i) it means we (or anyone) can scale request-serving Horizons horizontally through read-only replicas and ii) it removes the need for request-serving Horizons to ingest, breaking the N-N mapping between Horizon and core. This is a big deal for fast txmeta scalability.

bartekn · 2020-06-10T23:18:51Z

@ire-and-curses sorry, I wasn't clear. I meant reverting to the code that's using order-book graph as a data source instead of a DB (reading part), not reverting ingestion on front-end nodes (writing part). Once graph is updated by OrderBookStream it can be used by any Horizon component (in the previous code /order_book was using the same graph as /paths).

tamirms · 2020-06-10T23:58:08Z

Sorry I should have clarified that, since Horizon 1.4.0, the order book graph is populated by polling the Horizon DB instead of via distributed ingestion #2630. This means the order book graph is available on all the request serving Horizon nodes. So we could go back to querying the order book graph instead of the DB to serve /order_book responses. Doing so would not require the request serving Horizon nodes to participate in ingestion again.

The reason why I implemented orderbook queries using the Horizon DB in #2617 is because I thought reducing our dependency on the in memory order book graph would make it easier to remove the frontend Horizon nodes from participating in distributed ingestion. While implementing #2630 I realized that we could still have an in memory order book graph without having to force the frontend nodes to participate in ingestion.

If we went back to querying the in memory order book graph I would want to check that the extra queries on the order book graph would not have a negative impact in terms of contention. A read write lock ensures that the order book graph can be updated in a thread safe manner while other go routines are trying to read from the order book graph.

If we ever want to join information from the order book db query with data found in other tables that will be easier than combining in memory order book graph queries with db queries.

bartekn · 2021-03-03T16:34:00Z

Closing now as /order_book works fine. We can reopen in the future.

bartekn added the horizon label Nov 20, 2019

bartekn added this to the Horizon 0.24.0 milestone Nov 20, 2019

tamirms self-assigned this Nov 26, 2019

bartekn modified the milestones: Horizon 0.24.0, Horizon 0.25.0 Dec 3, 2019

ire-and-curses modified the milestones: Horizon 0.25.0, Horizon 0.26.0 Jan 7, 2020

ire-and-curses modified the milestones: Horizon 1.0.0-stable, Horizon 1.1.0 Feb 5, 2020

bartekn mentioned this issue Feb 10, 2020

Ingestion nodes cannot be separated from request serving horizon nodes #2250

Closed

tamirms removed their assignment Mar 11, 2020

ire-and-curses removed this from the Horizon 1.0.1 milestone Mar 17, 2020

tamirms mentioned this issue May 22, 2020

services/horizon: Use Horizon DB to look up orderbook details #2617

Merged

7 tasks

tamirms added this to the Horizon 1.4.0 milestone Jun 5, 2020

bartekn removed this from the Horizon 1.4.0 milestone Jun 22, 2020

bartekn closed this as completed Mar 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

services/horizon: /order_book slower when reading from offers graph #1963

services/horizon: /order_book slower when reading from offers graph #1963

bartekn commented Nov 20, 2019 •

edited

Loading

tamirms commented Jun 10, 2020

ire-and-curses commented Jun 10, 2020

bartekn commented Jun 10, 2020

ire-and-curses commented Jun 10, 2020

bartekn commented Jun 10, 2020

tamirms commented Jun 10, 2020

bartekn commented Mar 3, 2021

services/horizon: /order_book slower when reading from offers graph #1963

services/horizon: /order_book slower when reading from offers graph #1963

Comments

bartekn commented Nov 20, 2019 • edited Loading

What version are you using?

What did you do?

What did you expect to see?

What did you see instead?

tamirms commented Jun 10, 2020

ire-and-curses commented Jun 10, 2020

bartekn commented Jun 10, 2020

ire-and-curses commented Jun 10, 2020

bartekn commented Jun 10, 2020

tamirms commented Jun 10, 2020

bartekn commented Mar 3, 2021

bartekn commented Nov 20, 2019 •

edited

Loading