-
Notifications
You must be signed in to change notification settings - Fork 3.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
coordinator throwing exception trying to load segments #8137
Comments
seems this could be related to #3362 |
is there any workaround for this? |
Hi @pdeva, it looks like a bug, but I have no clue how it could happen. Would you please give some more context? Are you using the The error means the executorService is terminated when |
the reason for #3362 reference is because of the msg "Replicant create queue stuck after 15+ runs". so literally nothing in my cluster has changed. zookeeper is perfectly fine. |
and no i am not using the supervisor script. just starting via |
here is another more detailed exception:
|
I noticed that it could happen once the coordinator loses leadership and gets it back. Do you see this kind of behavior from the coordinator logs? |
we only have 1 coordinator node |
Oh, now I notice it can also happen if one of historical has gone.. |
i restarted the coordinator (previously just the historicals were restarted) and it seems to be loading again... for now.. |
I am able to replicate this issue by running a coordinator and killing historical servers. This looks related to #7088, where some executors are getting shutdown by the load queue peons, but the peons don't own the executors. Investigating a fix. |
i am also begining to see this msg. not sure if related. should i open a different bug for this:
|
Not sure how come it could be related to this issue yet, so yes. Please open another issue for it. |
Affected Version
0.15
Description
started happening all of a sudden. no segments can be loaded in the historical!
also seeing this:
The text was updated successfully, but these errors were encountered: