-
Notifications
You must be signed in to change notification settings - Fork 727
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
qps drop to zero during pd leader io hang with error code 9001 #6889
Comments
/type bug |
pd client always try to connect the pd server which has been io hang, I will investigate it.
|
pd-1: leader->follower
pd-2: follower->leader
|
Another strange phenomenon, tikv also always chooses pd-1 instead of pd-2, which has become the leader, until the end of the injection delay
|
but pd-0 didn't exit watch loop in time, although etcd client has updated. pd-0: follower->follower
pd client will get old leader and directly return when connected pd-0, so pd client cannot update member normally.
|
close #6889 Signed-off-by: lhy1024 <[email protected]>
Bug Report
What did you do?
1、run tpcc with 1000 warehouse,10 thread
2、inject pd leader io hang last for 10mins
inject fault:2023-08-02 20:13:14
recover fault:2023-08-02 20:18:14
What did you expect to see?
qps can recover whitin 2mins
What did you see instead?
qps drop to zero during pd leader io hang with error code 9001
What version of PD are you using (
pd-server -V
)?./pd-server -V
Release Version: v7.3.0-alpha
Edition: Community
Git Commit Hash: 0c537bb
Git Branch: heads/refs/tags/v7.3.0-alpha
UTC Build Time: 2023-08-01 11:37:27
2023-08-02T18:02:53.544+0800
The text was updated successfully, but these errors were encountered: