-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Delay allocation #39
Delay allocation #39
Conversation
00950fa
to
4e9b318
Compare
Hi @jmcarp, |
Good point--how about leaving the routing allocation change in post-start for now, with a |
@axelaris @Infra-Red Any more feedback on this (and this #38) PR? |
@@ -5,6 +5,11 @@ set -e | |||
out=$(mktemp health-XXXXXX) | |||
remaining=<%= p("elasticsearch.health.timeout") %> | |||
|
|||
# Ensure shard allocation is enabled for updates from previous release | |||
# TODO: Deprecate on next release | |||
curl -X PUT -s <%= p('elasticsearch.master_hosts').first %>:9200/_cluster/settings |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It seems that you missed \
at the line end :-(
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fixed!
Hi @cnelson, |
b13c921
to
92cb9a6
Compare
Now all is seems to be working ok. |
If you want this to be squashed into a single commit, the github "squash and merge" button should do it. |
Ok, it seems thats working. |
Can we get a new final release built with these improvements? |
Hi @geofffranks, |
As @cnelson described in cloud-gov/product#673 (comment), restarting the cluster can lead to outages for a few reasons:
This patch addresses both issues. We added a post-start script that blocks until the node is listening on 9200--and, for data nodes, until the cluster is healthy. We also optionally increase the node timeout on drain, then restore it on post-deploy, to avoid "shard shuffle". This also means we don't need to rely on shard routing settings to keep the cluster healthy during restarts: elastic/elasticsearch#19739.