-
Notifications
You must be signed in to change notification settings - Fork 312
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Start replica server failed due to incomplete created RocksDB directory #1450
Comments
This issuse is reported when I try to fix #1383, I injected a write error in the write path of a replica server, the replica server will try to recover (i.e. open new rocksdb instance) replicas after the "injected corrupted" replicas automatically closed. If the server crashed when some rocksdb instances are during creating, the instances maybe incomplete, then reproduce this issue. |
Got another incomplete RocksDB instance and crashed too, logs:
|
…#1451) #1450 If replica server attempt to open an incomplete RocksDB instance (maybe caused by a previous crash), it will crash before moving the incomplete path to ".err" trash path, and it will crash again if restart the server. This patch avoid to crash before moving the incomplete RocksDB path to ".err" path, thus the replica has an opportunity to recovery automatically without move the incomplete RocksDB path manually.
…B instance (apache#1451) 对应社区commit: https://github.com/apache/incubator-pegasus/pull/1451/files 注: 由于单测部分变更较大,本次未合入 apache#1450 If replica server attempt to open an incomplete RocksDB instance (maybe caused by a previous crash), it will crash before moving the incomplete path to ".err" trash path, and it will crash again if restart the server. This patch avoid to crash before moving the incomplete RocksDB path to ".err" path, thus the replica has an opportunity to recovery automatically without move the incomplete RocksDB path manually.
Bug Report
Please answer these questions before submitting your issue. Thanks!
Construct an incomplete RocksDB directory, the column families are not completed before a crash.
Replica server could start normally even if the RocksDB directory is incomplete.
Replica server start failed, the error logs like:
The master branch.
The text was updated successfully, but these errors were encountered: