-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DocDB] New DB API to query master leader for the last RAFT heartbeat time of the followers #18788
Comments
Hi, Hope you are doing well! I am Garvit Mohata, a MS CS student at UT Austin. My teammate (Kulin Shah) and I are taking the graduate Distributed Systems course and as a part of the course project, we are exploring the opportunity of contributing to the yugabyte open source repository. This issue seems closely related to the content of the course so we were wondering if we can contribute to solve this issue. Can you provide a bit more specific expectations and details to solve this issue and pointers if possible? Regards, |
Hi @mohatagarvit , I'm excited to hear you're exploring contributing to Yugabyte. I think this and the other issue you commented on (#16954) are a bit hard to pick up for new contributors since they require a lot of system context. I recommend browsing issues tagged with the "Good first issue" tag. Those are ones we explicitly think are self-contained. If you have any more questions, feel free to DM me on the community slack channel. |
…owers Summary: This diff adds a new RPC to the `MasterAdmin` to get the number of milliseconds since the master leader has successfully processed a consensus update from each of the master followers. The implementation just plumbs through the `last_successful_communication_time` field of the consensus queue up. This is the same field used by leaders to decide the health of a peer and whether to evict a peer (although masters do not evict peer masters). I intend to do a little more cleanup work on the unit tests, but I wanted to get out a diff for review sooner. Jira: DB-7670 Test Plan: ``` ybd --cxx-test tablet_health_manager-itest --gtest_filter '*GetFollowerUpdateDelay*' ``` Reviewers: asrivastava, rahuldesirazu Reviewed By: asrivastava Subscribers: ybase, bogdan, slingam Differential Revision: https://phorge.dev.yugabyte.com/D30479
… of master followers Summary: Original commit: 2dfc818 / D30479 This diff adds a new RPC to the `MasterAdmin` to get the number of milliseconds since the master leader has successfully processed a consensus update from each of the master followers. The implementation just plumbs through the `last_successful_communication_time` field of the consensus queue up. This is the same field used by leaders to decide the health of a peer and whether to evict a peer (although masters do not evict peer masters). I intend to do a little more cleanup work on the unit tests, but I wanted to get out a diff for review sooner. Jira: DB-7670 Test Plan: ``` ybd --cxx-test tablet_health_manager-itest --gtest_filter '*GetFollowerUpdateDelay*' ``` Reviewers: asrivastava, rahuldesirazu Reviewed By: asrivastava Subscribers: slingam, bogdan, ybase Tags: #jenkins-ready Differential Revision: https://phorge.dev.yugabyte.com/D32603
Jira Link: DB-7670
Description
This is needed as part of the following flow related to master replacement
Warning: Please confirm that this issue does not contain any sensitive information
The text was updated successfully, but these errors were encountered: