Optimize send path error handling #12472

rockwotj · 2023-07-26T16:52:51Z

Travis mentioned in slack that ssx::handle_shutdown_errors was showing up in profiles.

This attempts to address that problem by creating less continuations when dispatching sends in the rpc transport layer. Additionally, switch ssx::handle_shutdown_errors to be a corountine.

Backports Required

Release Notes

none

Create less of a continuation chain by handling all the exceptions in a single coroutine. Signed-off-by: Tyler Rockwood <[email protected]>

rockwotj · 2023-08-08T14:05:16Z

CI Failure was #12291 which is now fixed.

rockwotj · 2023-08-08T14:05:25Z

/ci-repeat 1

rockwotj · 2023-08-08T16:55:31Z

CI Failures:

CI Failure (Timeout on manifest_has_one_segment) in AdjacentSegmentMergingTest.test_reupload_of_local_segments #8457

travisdowns · 2023-08-08T20:04:41Z

src/v/ssx/future-util.h

-      .handle_exception_type([](const seastar::broken_promise&) {})
-      .handle_exception_type([](const seastar::broken_condition_variable&) {});
+    try {
+        co_await std::move(fut);


This seems good since handle_exception_type is relatively expensive whether an exception occurs or not, but we this transformation means we only take the exception matching cost when an exception is actually thrown.

I guess that was the intent of this transformation?

Based on my reading of https://github.com/redpanda-data/seastar/blob/245e0ccfa6d58d7e0dca2b4034ce1bc43e39bdc5/include/seastar/core/future.hh#L1858-L1862

It seems like before we had 5 continuations and 5 try catch blocks, and now we only have 1 of each (I think, I haven't fully wrapped my brain around how a compiler transforms coroutines). That was the intent here to cut down on both of those in a hope to lessen the cost of this function you noticed in the profiles.

travisdowns

LGTM

dotnwat · 2023-08-10T19:42:51Z

src/v/rpc/transport.cc

+    if (!_dispatch_gate.try_enter()) {
+        return;
+    }
+    ssx::background = ssx::ignore_shutdown_exceptions(do_dispatch_send())
+                        .then_wrapped([this](ss::future<> fut) {
+                            if (fut.failed()) {
+                                vlog(
+                                  rpclog.info,
+                                  "Error dispatching socket write:{}",
+                                  fut.get_exception());
+                                _probe->request_error();
+                                fail_outstanding_futures();
+                            }
+                            // Now that we've handled errors it's safe to
+                            // release the gate.
+                            _dispatch_gate.leave();
+                        });


there is a slight preference to use raii gate holder rather than manual enter/leave. you can move that into the then_wrapped continuation as as capture, or dangle it off the end in a finally block capture.

Good suggestion. dispatch_send shouldn't throw, which is why I manually used try_enter and leave, but it seems we still can check if the gate is closed before holding the gate open.

Remove the double dose of ssx::handle_shutdown_exceptions by inlining the gate handling into a single place. Remove the extra overhead of some lambdas and a couple of extra continuations by handling unexpected errors and closing the gate in a single continuation. Signed-off-by: Tyler Rockwood <[email protected]>

vbotbuildovich · 2023-08-15T04:33:52Z

/backport v23.2.x

vbotbuildovich · 2023-08-15T04:33:53Z

/backport v23.1.x

vbotbuildovich · 2023-08-15T04:33:54Z

/backport v22.3.x

vbotbuildovich · 2023-08-15T04:34:46Z

Failed to run cherry-pick command. I executed the commands below:

git checkout -b backport-pr-12472-v22.3.x-824 remotes/upstream/v22.3.x
git cherry-pick -x 9670361db3d5c8a514a1984512300d17fa1122ec 0b3fa8c05e5d4a88b3ba05a47f92ce679c556159

Workflow run logs.

vbotbuildovich · 2023-08-15T04:34:48Z

Failed to run cherry-pick command. I executed the commands below:

git checkout -b backport-pr-12472-v23.1.x-559 remotes/upstream/v23.1.x
git cherry-pick -x 9670361db3d5c8a514a1984512300d17fa1122ec 0b3fa8c05e5d4a88b3ba05a47f92ce679c556159

Workflow run logs.

ssx: Optimize the implementation of handle_shutdown_exceptions

9670361

Create less of a continuation chain by handling all the exceptions in a single coroutine. Signed-off-by: Tyler Rockwood <[email protected]>

github-actions bot added the area/redpanda label Jul 26, 2023

rockwotj requested review from travisdowns, ballard26 and StephanDollberg July 26, 2023 16:53

rockwotj force-pushed the rockwood/optimize-exception-handling branch from 56c9add to 8b5e350 Compare July 26, 2023 16:54

travisdowns reviewed Aug 8, 2023

View reviewed changes

travisdowns previously approved these changes Aug 8, 2023

View reviewed changes

dotnwat reviewed Aug 10, 2023

View reviewed changes

rockwotj dismissed travisdowns’s stale review via 0b3fa8c August 10, 2023 19:57

rockwotj force-pushed the rockwood/optimize-exception-handling branch from 8b5e350 to 0b3fa8c Compare August 10, 2023 19:57

rockwotj requested review from dotnwat and travisdowns August 10, 2023 19:59

dotnwat approved these changes Aug 15, 2023

View reviewed changes

dotnwat merged commit 7ff6bad into redpanda-data:dev Aug 15, 2023
25 checks passed

This was referenced Aug 15, 2023

[v22.3.x] Optimize send path error handling #12788

Closed

[v23.1.x] Optimize send path error handling #12789

Closed

[v23.2.x] Optimize send path error handling #12790

Merged

This was referenced Aug 16, 2023

[v23.1.x] Optimize send path error handling #12859

Closed

[v22.3.x] Optimize send path error handling #12861

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize send path error handling #12472

Optimize send path error handling #12472

rockwotj commented Jul 26, 2023

rockwotj commented Aug 8, 2023

rockwotj commented Aug 8, 2023

rockwotj commented Aug 8, 2023

travisdowns Aug 8, 2023

travisdowns Aug 8, 2023

rockwotj Aug 8, 2023

travisdowns left a comment

dotnwat Aug 10, 2023

rockwotj Aug 10, 2023

vbotbuildovich commented Aug 15, 2023

vbotbuildovich commented Aug 15, 2023

vbotbuildovich commented Aug 15, 2023

vbotbuildovich commented Aug 15, 2023

vbotbuildovich commented Aug 15, 2023

Optimize send path error handling #12472

Optimize send path error handling #12472

Conversation

rockwotj commented Jul 26, 2023

Backports Required

Release Notes

rockwotj commented Aug 8, 2023

rockwotj commented Aug 8, 2023

rockwotj commented Aug 8, 2023

travisdowns Aug 8, 2023

Choose a reason for hiding this comment

travisdowns Aug 8, 2023

Choose a reason for hiding this comment

rockwotj Aug 8, 2023

Choose a reason for hiding this comment

travisdowns left a comment

Choose a reason for hiding this comment

dotnwat Aug 10, 2023

Choose a reason for hiding this comment

rockwotj Aug 10, 2023

Choose a reason for hiding this comment

vbotbuildovich commented Aug 15, 2023

vbotbuildovich commented Aug 15, 2023

vbotbuildovich commented Aug 15, 2023

vbotbuildovich commented Aug 15, 2023

vbotbuildovich commented Aug 15, 2023