-
Notifications
You must be signed in to change notification settings - Fork 24.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Elastic Coordination nodes dead due to heap memory #55618
Comments
Thanks for raising this issue. Do you have any specific steps to reproduce this? Note: This one might be a duplicate of #46116 despite the versions being different. |
Pinging @elastic/es-analytics-geo (:Analytics/Aggregations) |
I'm working on a thread that started with #54758 that should give us better control over memory usage on the coordinating node. I don't expect it to bear fruit super soon though. In the mean time I suggest lowering the size of the composite aggregation and/or lowering the precision on the cardinality agg. |
We had an issue on Master nodes, on 4-29-2020 due to many garbage collection cycles. Our cluster we Red and went into cycles from Red to Yellow to Red for hours. Until it recovered by itself. Seems like we had the same garbage collection issue on co-ordination nodes too. |
I can't really comment on what is up with the master nodes. The stack trace that you linked when creating the issue is a coordinating node issue. Are you sending queries to your master nodes? In general that is ok, but if you are having issues like this with agg reduction running you out of memory then I'd avoid it. |
We have a cluster of 18, with 3 master nodes, 12 data nodes, 1 ingest node and 2 coordination nodes. And your master nodes, don't take any queries. Seems like there is a memory management issue with elastic 7.5.2 |
Your best bet for the master nodes is to get a heap dump and have a look at what is in there and open a new issue when you have an idea what is up there. The issue you've opened here is a coordinating node one and I'm working on it. Just, slowly. |
There has been several improvements in memory management in the latests release. In particular for coordinating nodes, for example #46751 or #54758. I am closing for the time being this issue as there is no action to be done at this moment. Please feel free to report the issue again if it keeps happening. |
Elasticsearch version (
bin/elasticsearch --version
): 7.5.2Plugins installed: []
JVM version (
java -version
):OS version (
uname -a
if on a Unix-like system): CentOSDescription of the problem including expected versus actual behavior: 2 coordination nodes part of elastic search cluster dead with 3 mins apart due to Heap memory
Heap Memory: 16GB
Server Ram: 32GB
We upgraded to 7.5.2, 2 weeks ago. We started with 7.3.2 last year, but we haven't faced any issue like this. We would like to get more information on why this happened.
Steps to reproduce:
Please include a minimal but complete recreation of the problem, including
(e.g.) index creation, mappings, settings, query etc. The easier you make for
us to reproduce it, the more likely that somebody will take the time to look at it.
Provide logs (if relevant):
The text was updated successfully, but these errors were encountered: