feat(page_service): timeout-based batching of requests #9321

VladLazar · 2024-10-08T16:52:10Z

Problem

We don't take advantage of queue depth generated by the compute
on the pageserver. We can process getpage requests more efficiently
by batching them.

Summary of changes

Batch up incoming getpage requests that arrive within a configurable time window (server_side_batch_timeout).
Then process the entire batch via one get_vectored timeline operation.
By default, no merging takes place.

Testing

Functional: page_service: getpage batching: refactor & minor fixes #9792
Performance: will be done in staging/pre-prod

@problame

Authored by @problame during Lisbon hackathon

github-actions · 2024-10-08T18:24:53Z

5490 tests run: 5264 passed, 0 failed, 226 skipped (full report)

Code coverage* (full report)

functions: 31.5% (7932 of 25181 functions)
lines: 49.6% (62944 of 126880 lines)

* collected from Rust tests only

_{The comment gets automatically updated with the latest test results
2bea398 at 2024-11-18T20:25:20.479Z :recycle:}

problame

The terminology is 50% debounce 50% batching.

I think batching is better, let's adjust the names.

As a person who doesn't mind wordy names, page_service_server_side_batching is how I'd call the functionality / code module.

Maybe server_side_batch_timeout for the config flag?

And BatchedFeMessageS instead of DebouncedFeMessage for the type?

During the hackathon, and now, I wonder(ed) whether we could refactor the batching loop into a Stream / $itertoolsadapter

Like, at the start of handle_pagerequests, split the pgb into reading and writing end (maybe fake that using Arc<Mutex<Pgb>>).
The wrap the reading end into the thing that implements Stream, spitting out DebouncedFeMessage as the strema item.

Orthogonal to that, can we please move the batching into a helper function so the high-level control flow of handle_pagerequests remains clear-ish?

I flagged a couple of places that warn! aggressively.
I actually don't know if we log getpage errors. If we do, then I think it's fine to just leave these warn! messages & dismiss the conversation.

pageserver/src/page_service.rs

pageserver/src/pgdatadir_mapping.rs

VladLazar · 2024-10-11T14:03:13Z

The terminology is 50% debounce 50% batching.

I think batching is better, let's adjust the names.

As a person who doesn't mind wordy names, page_service_server_side_batching is how I'd call the functionality / code module.

Maybe server_side_batch_timeout for the config flag?

And BatchedFeMessageS instead of DebouncedFeMessage for the type?

You're right. I'll touch up the names.

During the hackathon, and now, I wonder(ed) whether we could refactor the batching loop into a Stream / $itertoolsadapter

Like, at the start of handle_pagerequests, split the pgb into reading and writing end (maybe fake that using Arc<Mutex<Pgb>>). The wrap the reading end into the thing that implements Stream, spitting out DebouncedFeMessage as the strema item.

I also wondered about this while going through the code, but resisted the temptation to rewrite.
Let's create an epic and add a follow-up in there.

Orthogonal to that, can we please move the batching into a helper function so the high-level control flow of handle_pagerequests remains clear-ish?

Sure, I'll give it a go.

The refactor is straightforward, but we also fix a bug. Previously, a batch could have been empty if handling the first message resulted in an error (would have panicked). This has been fixed by including errors in the current batch.

Conflicts: Pageserver config was also updated in main

We already log get page errors the pagestream level and on the ingest path.

Switch to over to using Vec for the `Vec::spare_capacity_mut` interface. Fill the `MayebUninit` slots carefully and `Vec::set_len` at the end.

VladLazar · 2024-10-14T10:40:34Z

Switched naming to batching
Looked into logging. We already log on both pagestream and ingest flows, so I've removed the redundant logging.
Reworked response filling to work around one copy
Refactored batching into a separate method.

problame

I see a lot of plumbing-through of the server_side_batch_timeout variable.
Personally I wouldn't mind just plumbing through the &'static PageServerConf.
Or building a PageServiceConf object out of PageServerConf.
Easily punt-able though.

I think the only two things I'm hard-requesting-changes for is the use of smallvec<1> for the return value of read_batch_from_connection, and the elimination of the panic!(unsupported)

pageserver/src/page_service.rs

pageserver/src/pgdatadir_mapping.rs

Smallvec 1.13.2 contains [an UB fix](servo/rust-smallvec#345). Upstream opened [a request](rustsec/advisory-db#1960) for this in the advisory-db but it never got acted upon. Found while working on #9321.

…-get-page-requests Conflicts (all syntactic, not semantic) libs/pageserver_api/src/config.rs pageserver/Cargo.toml pageserver/src/config.rs pageserver/src/page_service.rs pageserver/src/pgdatadir_mapping.rs

…n of PageReconstructError=>PageStreamError for case of Timeline cancellation

This PR adds a benchmark to demonstrate the effect of server-side getpage request batching added in #9321. Refs: - Epic: #9376 - Extracted from #9792

This PR adds two benchmark to demonstrate the effect of server-side getpage request batching added in #9321. For the CPU usage, I found the the `prometheus` crate's built-in CPU usage accounts the seconds at integer granularity. That's not enough you reduce the target benchmark runtime for local iteration. So, add a new `libmetrics` metric and report that. The benchmarks are disabled because [on our benchmark nodes, timer resolution isn't high enough](https://neondb.slack.com/archives/C059ZC138NR/p1732264223207449). They work (no statement about quality) on my bare-metal devbox. They will be refined and enabled once we find a fix. Candidates at time of writing are: - #9822 - #9851 Refs: - Epic: #9376 - Extracted from #9792

VladLazar force-pushed the vlad/pageserver-merge-get-page-requests branch from 2fd0a67 to 14883d2 Compare October 8, 2024 16:52

VladLazar added 4 commits October 8, 2024 17:53

pageserver: merge get page requests

135cc64

Authored by @problame during Lisbon hackathon

pageserver: add debounce timeout pageserver config

e452e14

pageserver: remove batched requests metric

ff86639

pageserver: clean up tracing a bit

96afede

VladLazar force-pushed the vlad/pageserver-merge-get-page-requests branch from 14883d2 to 96afede Compare October 8, 2024 16:53

VladLazar added 2 commits October 8, 2024 18:07

ci: cargo fmt

6aa9a93

ci: clippy

f8e222a

pageserver/tests: instrument get rel page calls

31530ab

problame reviewed Oct 11, 2024

View reviewed changes

VladLazar added 8 commits October 11, 2024 17:54

review: update naming to use batching

5b36b76

review: refactor batching into separate function

fd4a1c8

The refactor is straightforward, but we also fix a bug. Previously, a batch could have been empty if handling the first message resulted in an error (would have panicked). This has been fixed by including errors in the current batch.

Merge branch 'main' into vlad/pageserver-merge-get-page-requests

2188191

Conflicts: Pageserver config was also updated in main

review: tweak get_rel_page_at_lsn_batched comment

66241c4

review: remove sorting of vectored get keys

ade3e77

review: remove some redundant logging

0a554e0

We already log get page errors the pagestream level and on the ingest path.

sq: update name to use batching

303b372

review: rewrite response filling logic to avoid extra copy

a983a49

Switch to over to using Vec for the `Vec::spare_capacity_mut` interface. Fill the `MayebUninit` slots carefully and `Vec::set_len` at the end.

This was referenced Oct 14, 2024

pageserver: LSN wait location follow-up #9379

Open

pageserver: batching observability #9380

Open

VladLazar added 2 commits October 14, 2024 13:00

review: remove stale wip comment

b514d68

ci: fix rust docs gen

70aac4b

VladLazar requested a review from problame October 14, 2024 11:06

VladLazar marked this pull request as ready for review October 14, 2024 11:06

VladLazar requested a review from a team as a code owner October 14, 2024 11:06

VladLazar added 2 commits October 14, 2024 15:16

fixup: return early when batching is disabled

46e2095

ci: add db creation support back

9507ebc

problame linked an issue Oct 23, 2024 that may be closed by this pull request

pageserver: batch get page requests and serve them with one vectored get #9377

Open

problame requested changes Oct 23, 2024

View reviewed changes

problame self-assigned this Nov 12, 2024

problame added 3 commits November 17, 2024 12:01

use smallvec inside BatchOrEof

39510b2

https://github.com/neondatabase/neon/pull/9321#discussion_r1812699096

431b7a7

https://github.com/neondatabase/neon/pull/9321#discussion_r1812714554

cc97115

problame mentioned this pull request Nov 17, 2024

build(deps): bump smallvec to 1.13.2 to get UB fix #9781

Merged

problame added 4 commits November 17, 2024 12:26

https://github.com/neondatabase/neon/pull/9321#discussion_r1812728994

4dba135

https://github.com/neondatabase/neon/pull/9321#discussion_r1812731796

800d412

https://github.com/neondatabase/neon/pull/9321#discussion_r1812734240

7ae217b

https://github.com/neondatabase/neon/pull/9321#discussion_r1812726944

e9ab0ff

problame added 5 commits November 17, 2024 21:45

create all the spans inside read_batch_from_connection; #9321 (comment)

8c17488

remove todo

294e189

Merge remote-tracking branch 'origin/main' into vlad/pageserver-merge…

2c64378

…-get-page-requests Conflicts (all syntactic, not semantic) libs/pageserver_api/src/config.rs pageserver/Cargo.toml pageserver/src/config.rs pageserver/src/page_service.rs pageserver/src/pgdatadir_mapping.rs

python codestyle

c4500c4

address the CI failures: they were all due to incorrect classificatio…

234799e

…n of PageReconstructError=>PageStreamError for case of Timeline cancellation

problame changed the title ~~pageserver: merge get page requests~~ pageserver: option to merge getpage requests Nov 17, 2024

problame changed the title ~~pageserver: option to merge getpage requests~~ feat(page_service): timeout-based batching of requests Nov 17, 2024

problame removed their assignment Nov 18, 2024

use humantime for config flag

2bea398

problame mentioned this pull request Nov 18, 2024

pageserver: batch get page requests and serve them with one vectored get #9377

Open

problame approved these changes Nov 18, 2024

View reviewed changes

problame enabled auto-merge (squash) November 18, 2024 19:29

problame merged commit d7662fd into main Nov 18, 2024
76 checks passed

problame deleted the vlad/pageserver-merge-get-page-requests branch November 18, 2024 20:24

problame added a commit that referenced this pull request Nov 20, 2024

page_service: add benchmark for batching

b695907

This PR adds a benchmark to demonstrate the effect of server-side getpage request batching added in #9321. Refs: - Epic: #9376 - Extracted from #9792

problame mentioned this pull request Nov 20, 2024

page_service: add benchmark for batching #9820

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(page_service): timeout-based batching of requests #9321

feat(page_service): timeout-based batching of requests #9321

VladLazar commented Oct 8, 2024 •

edited by problame

Loading

github-actions bot commented Oct 8, 2024 •

edited

Loading

problame left a comment

VladLazar commented Oct 11, 2024

VladLazar commented Oct 14, 2024

problame left a comment

feat(page_service): timeout-based batching of requests #9321

feat(page_service): timeout-based batching of requests #9321

Conversation

VladLazar commented Oct 8, 2024 • edited by problame Loading

Problem

Summary of changes

Testing

github-actions bot commented Oct 8, 2024 • edited Loading

5490 tests run: 5264 passed, 0 failed, 226 skipped (full report)

Code coverage* (full report)

problame left a comment

Choose a reason for hiding this comment

VladLazar commented Oct 11, 2024

VladLazar commented Oct 14, 2024

problame left a comment

Choose a reason for hiding this comment

VladLazar commented Oct 8, 2024 •

edited by problame

Loading

github-actions bot commented Oct 8, 2024 •

edited

Loading