reduce unnecessary tikvServerBusy backoff when able to try next replica #1184

crazycs520 · 2024-02-27T01:04:04Z

PR 923 introduce the skip tikvServerBusy backoff logic, but it it only work for stale-read, and if serverIsBusy.EstimatedWaitMs > 0 is true, it can't skip.

This PR expanded the scope of skips tikvServerBusy backoff logic, make non-leader read requests can skip tikvServerBusy backoff, and if serverIsBusy.EstimatedWaitMs > 0 is true, non-leader read requests can skip too.

Signed-off-by: crazycs520 <[email protected]>

internal/locate/region_request.go

Signed-off-by: crazycs520 <[email protected]>

internal/locate/region_request.go

Signed-off-by: crazycs520 <[email protected]>

…-1166 Signed-off-by: crazycs520 <[email protected]>

Signed-off-by: crazycs520 <[email protected]>

you06

LGTM

you06 · 2024-02-27T08:20:26Z

internal/locate/region_request.go

-// the state is changed in accessFollower.next when leader is unavailable.
-func (s *replicaSelector) canFallback2Follower() bool {
+// canSkipServerIsBusyBackoff returns true if the request can be sent to next replica and can skip ServerIsBusy backoff.
+func (s *replicaSelector) canSkipServerIsBusyBackoff() bool {


Maybe we can reuse this function handling region unavailable and changing it's name in the future. Not this PR's work.

internal/locate/region_request.go

MyonKeminta

LGTM, but I'm thinking perhaps the better structure is to determine whether to backoff just before the next attempt, instead of just after encountering error, so that what @zyguan tried to do in #990 becomes a unified way.
Btw I just noticed that it seems that this PR and #990 looks trying to solve the same problem 🤔 cc @zyguan

crazycs520 · 2024-02-28T07:51:17Z

/hold

Signed-off-by: crazycs520 <[email protected]>

crazycs520 · 2024-02-29T02:34:25Z

LGTM, but I'm thinking perhaps the better structure is to determine whether to backoff just before the next attempt, instead of just after encountering error, so that what @zyguan tried to do in #990 becomes a unified way. Btw I just noticed that it seems that this PR and #990 looks trying to solve the same problem 🤔 cc @zyguan

Nice catch. #990 introduces pending backoff for fast retry, which is great idea. After discussing with @zyguan, I introduce the pending backoff idea into this PR and unified the fast retry logic. And later, we can add more backoff to pending to do fast retry.

Signed-off-by: crazycs520 <[email protected]>

internal/locate/region_request.go

Signed-off-by: crazycs520 <[email protected]>

MyonKeminta

Rest LGTM

internal/locate/region_request.go

Signed-off-by: crazycs520 <[email protected]>

…into issue-1166

Signed-off-by: crazycs520 <[email protected]>

crazycs520 · 2024-03-01T03:18:24Z

/unhold

internal/locate/region_cache.go

internal/locate/region_request.go

Signed-off-by: crazycs520 <[email protected]>

cfzjywxk · 2024-03-04T03:37:03Z

internal/locate/region_request.go

+
+// canFastRetry returns true if the request can be sent to next replica.
+func (s *replicaSelector) canFastRetry() bool {
+	accessLeader, ok := s.state.(*accessKnownLeader)


Should we just do fast retry when tikv_client_read_timeout is configured? It looks a little bit dangerous to use

if not fast retry return false return true

as most of the request would be fast retried .

Besides, maybe we could consult slow information to decide whether fast retry should work. /cc @zyguan @MyonKeminta

I see the original logic of canFallback2Follower is trying to do fast backoff when retry next replica, so fast backoff is not only work for tikv_client_read_timeout is configured.
cc @you06

I see the original logic of canFallback2Follower is trying to do fast backoff when retry next replica, so fast backoff is not only work for tikv_client_read_timeout is configured. cc @you06

@crazycs520 It the current logic the same? If so we could continue first.

The logic is not same with before, this PR is for the fix issue #1166, and keep some optimization from #990 and #923

…-1166 Signed-off-by: crazycs520 <[email protected]>

reduce unnecessary tikvServerBusy backoff when

2a92b1a

Signed-off-by: crazycs520 <[email protected]>

crazycs520 commented Feb 27, 2024

View reviewed changes

internal/locate/region_request.go Outdated Show resolved Hide resolved

fix lint

6439bf0

Signed-off-by: crazycs520 <[email protected]>

cfzjywxk requested review from cfzjywxk, zyguan, you06 and MyonKeminta February 27, 2024 02:02

cfzjywxk reviewed Feb 27, 2024

View reviewed changes

internal/locate/region_request.go Outdated Show resolved Hide resolved

crazycs520 added 5 commits February 27, 2024 15:41

refine code and add test

d393317

Signed-off-by: crazycs520 <[email protected]>

refine code

0ffcce3

Signed-off-by: crazycs520 <[email protected]>

add test

79ec047

Signed-off-by: crazycs520 <[email protected]>

Merge branch 'master' of https://github.com/tikv/client-go into issue…

4e5bd12

…-1166 Signed-off-by: crazycs520 <[email protected]>

fix test

d6a259a

Signed-off-by: crazycs520 <[email protected]>

you06 approved these changes Feb 27, 2024

View reviewed changes

MyonKeminta reviewed Feb 27, 2024

View reviewed changes

internal/locate/region_request.go Outdated Show resolved Hide resolved

MyonKeminta approved these changes Feb 27, 2024

View reviewed changes

MyonKeminta added the hold label Feb 28, 2024

crazycs520 added 5 commits February 28, 2024 22:12

add pending backoff for fast retry

0dfd389

Signed-off-by: crazycs520 <[email protected]>

add comment and refine code

d7ab10b

Signed-off-by: crazycs520 <[email protected]>

add comment and refine code

5c49640

Signed-off-by: crazycs520 <[email protected]>

fix lint

f8a0b8e

Signed-off-by: crazycs520 <[email protected]>

Merge branch 'master' into issue-1166

a7fe007

fix test

6a3ae20

Signed-off-by: crazycs520 <[email protected]>

zyguan approved these changes Feb 29, 2024

View reviewed changes

Merge branch 'master' into issue-1166

3cc9eee

MyonKeminta reviewed Feb 29, 2024

View reviewed changes

internal/locate/region_request.go Outdated Show resolved Hide resolved

internal/locate/region_request.go Show resolved Hide resolved

crazycs520 added 2 commits February 29, 2024 17:01

add more comment

eb0f14d

Signed-off-by: crazycs520 <[email protected]>

Merge branch 'master' into issue-1166

ff2c7d9

MyonKeminta reviewed Feb 29, 2024

View reviewed changes

internal/locate/region_request.go Outdated Show resolved Hide resolved

crazycs520 added 3 commits February 29, 2024 22:49

address comment

5c82a08

Signed-off-by: crazycs520 <[email protected]>

Merge branch 'issue-1166' of https://github.com/crazycs520/client-go …

6ed4aee

…into issue-1166

refine comment

589c2d8

Signed-off-by: crazycs520 <[email protected]>

MyonKeminta removed the hold label Mar 1, 2024

cfzjywxk reviewed Mar 4, 2024

View reviewed changes

internal/locate/region_cache.go Outdated Show resolved Hide resolved

internal/locate/region_request.go Show resolved Hide resolved

address comment

a156b79

Signed-off-by: crazycs520 <[email protected]>

cfzjywxk reviewed Mar 4, 2024

View reviewed changes

Merge branch 'master' of https://github.com/tikv/client-go into issue…

c048653

…-1166 Signed-off-by: crazycs520 <[email protected]>

cfzjywxk approved these changes Mar 4, 2024

View reviewed changes

cfzjywxk merged commit 50c4085 into tikv:master Mar 4, 2024
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reduce unnecessary tikvServerBusy backoff when able to try next replica #1184

reduce unnecessary tikvServerBusy backoff when able to try next replica #1184

crazycs520 commented Feb 27, 2024 •

edited

Loading

you06 left a comment

you06 Feb 27, 2024

MyonKeminta left a comment

crazycs520 commented Feb 28, 2024

crazycs520 commented Feb 29, 2024

MyonKeminta left a comment

crazycs520 commented Mar 1, 2024

cfzjywxk Mar 4, 2024

crazycs520 Mar 4, 2024

cfzjywxk Mar 4, 2024 •

edited

Loading

crazycs520 Mar 4, 2024 •

edited

Loading

reduce unnecessary tikvServerBusy backoff when able to try next replica #1184

reduce unnecessary tikvServerBusy backoff when able to try next replica #1184

Conversation

crazycs520 commented Feb 27, 2024 • edited Loading

you06 left a comment

Choose a reason for hiding this comment

you06 Feb 27, 2024

Choose a reason for hiding this comment

MyonKeminta left a comment

Choose a reason for hiding this comment

crazycs520 commented Feb 28, 2024

crazycs520 commented Feb 29, 2024

MyonKeminta left a comment

Choose a reason for hiding this comment

crazycs520 commented Mar 1, 2024

cfzjywxk Mar 4, 2024

Choose a reason for hiding this comment

crazycs520 Mar 4, 2024

Choose a reason for hiding this comment

cfzjywxk Mar 4, 2024 • edited Loading

Choose a reason for hiding this comment

crazycs520 Mar 4, 2024 • edited Loading

Choose a reason for hiding this comment

crazycs520 commented Feb 27, 2024 •

edited

Loading

cfzjywxk Mar 4, 2024 •

edited

Loading

crazycs520 Mar 4, 2024 •

edited

Loading