-
Notifications
You must be signed in to change notification settings - Fork 589
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
node wise recovery improvements #15394
Conversation
/dt |
new failures in https://buildkite.com/redpanda/redpanda/builds/42572#018c5c79-699d-4e7f-95f6-317755a710a1:
new failures in https://buildkite.com/redpanda/redpanda/builds/42572#018c5c79-69a4-4294-9b51-0d811376a80f:
|
e8d907b
to
293887a
Compare
293887a
to
4ebcf8a
Compare
/dt |
auto validation_err | ||
= _topics.local().validate_force_reconfigurable_partitions(result); | ||
if (validation_err) { | ||
co_return errc::concurrent_modification_error; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe we can return a validation_err
here ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this was intentional because if a user makes a get_majority_lost_partitions() they don't expect an error unless something concurrently modified the state while processing the request (hence concurrent_modification_error).. this validation_error is only possible in that situation.
just two comments, otherwise looks good |
missed this in the original PR.
This command will replace defunct_nodes_cmd in the subsequent commits.
Adds logic for processing force reconfiguration partitions. Additionally undos most of the defunct_node_cmd state processing.
Additionally renames defunct_node -> dead_node in most APIs to clearup terminology.
.. for node-wise-recovery APIs
4ebcf8a
to
f43c399
Compare
/dt |
ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/42670#018c6338-c294-4847-8836-7c0c7258f6b6 |
Some changes from previous
defunct nodes
implementation.defunct nodes
is no longer a thing, will be reimplemented asoffline
nodes in a future PR.API examples:
fetch a list of partitions losing majority - GET http://:9644/v1/partitions/majority_lost?dead_nodes=5,6,7
node wise recovery them - POST http://:9644/v1/partitions/force_recover_from_nodes
example payload.
Backports Required
Release Notes
Features