Investigate Bookkeeper disruption cases #39

pbelgundi · 2020-05-06T07:55:38Z

Description

We need to investigate and implement an action plan to cover all BookKeeper disruption cases. Some disruption cases that come to my mind are:

Graceful termination. The BK pod receives a TERM signal to gracefully shutdown. Some situations that cause this are:
kubectl drain to remove a node from the K8 cluster.
kubectl delete pod to delete a particular Bookkeeper pod, probably accidentally.
Scale down event.
Unexpected termination. This kind of disruptions will give pods no chance to gracefully terminate. Examples are hardware errors, VM deletion, hard evictions, etc.
For graceful terminations, we may want to run a pre-delete hook, preStop handler in K8 terminology, to make sure that ledgers stored in that Bookie are rereplicated before it is shut down. Probably by running a Bookkeeper manual recovery process.

For unexpected terminations, we may want to rely on Bookkeeper's autorecovery feature and the pod disruption budget to prevent a second pod graceful termination until the terminated pod is rescheduled and recovered.

Importance

must-have

Suggestions for an improvement

Data issues caused because of a pod being unavailable for more than a certain time period should be handled by AutoRecovery.
Currently Bookkeeper auto-recovery is enabled by operator, but its execution frequency, trigger and efficiency is governed by the some configurable properties which need to be tuned to decide how long we want to tolerate data loss before kick starting the recovery process.
Note that the recovery process has its own execution overhead, so we may want to strike a balance between data loss and performance overhead the recovery process would add.
This could also vary from usecase to usecase and hence should be configurable.

For details of Auto Recovery process see: https://bookkeeper.apache.org/docs/4.7.3/admin/autorecovery/#autorecovery

Some Configuration Properties that impact auto-recovery:

auditorPeriodicBookieCheckInterval - The time interval between auditor bookie checks, in seconds. The auditor bookie check, checks ledger metadata to see which bookies should contain entries for each ledger. If a bookie that should contain entries is unavailable, then the ledger containing that entry is marked for recovery. Defaults to once per day.
auditorPeriodicCheckInterval - The time interval, in seconds, at which the auditor will check all ledgers in the cluster. By default this runs once a week. Defaults to once per week.
lostBookieRecoveryDelay - How long to wait, in seconds, before starting autorecovery of a lost bookie. Defaults to 0.
underreplicatedLedgerRecoveryGracePeriod - The grace period (in seconds) for underreplicated ledgers recovery. If ledger is marked underreplicated for more than this period then it will be reported by placementPolicyCheck in Auditor. Setting this to 0 will disable this check. Default is 0.

pbelgundi mentioned this issue May 6, 2020

Investigate Bookkeeper disruption cases pravega/pravega-operator#114

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate Bookkeeper disruption cases #39

Investigate Bookkeeper disruption cases #39

pbelgundi commented May 6, 2020 •

edited

Loading

Investigate Bookkeeper disruption cases #39

Investigate Bookkeeper disruption cases #39

Comments

pbelgundi commented May 6, 2020 • edited Loading

Description

Importance

Suggestions for an improvement

pbelgundi commented May 6, 2020 •

edited

Loading