Entries repeatedly logged with sub-second frequency filling up disk space #19164

krystalcode · 2016-06-29T19:35:47Z

Elasticsearch version:

5.0.0-alpha3

JVM version:

The docker image inherits from the official java:8-jre image.

OS version:

Official docker image (https://hub.docker.com/_/elasticsearch/, Debian Jessie) running on a Fedora 23 host.

Description of the problem including expected versus actual behavior:

Certain log entries are repeatedly added to the log file with sub-second frequency, resulting in filling up the disk, which can trigger shard migration and possibly node failure. In a matter of hours the log file can be several gigabytes. Ironically, one of the log entries exhibiting such behaviour is warning about low disk space. The log records I have seen added with such frequency are given below.

Expected behaviour is to log entries with less frequency. I certainly don't need to be notified 2 times per second that my disk usage is over 90%. Once every 15 or 30 minutes would suffice.

Would it be an option to allow users to configure how often certain entries would be logged? That would require the program to be intelligent enough and know when an entry was already logged. Not sure how this could be accomplished - one idea could be to store in memory the timestamp of the last entry of a type and take it into account when a new entry of the same type is about to be logged.

Steps to reproduce:

Run elasticsearch on a node with more than 90% usage.
Observe the log file, the related entry is added 2 times per second.

I do not know how to reproduce the second log entry referenced below, I will file a separate issue if it seems to be a bug.

Provide logs (if relevant):

The following log entry is written up to 7 times per second.

{"log":"[2016-06-27 16:49:55,313][WARN ][cluster.routing.allocation.decider] [Sabretooth] high disk watermark [90%] exceeded on [ExDIO2orQJm6a5XHEvxxgg][Sabretooth][/usr/share/elasticsearch/data/elasticsearch/nodes/0] free: 4.6gb[7.7%], shards will be relocated away from this node\n","stream":"stdout","time":"2016-06-27T16:49:55.314948647Z"}
{"log":"[2016-06-27 16:49:55,314][INFO ][cluster.routing.allocation.decider] [Sabretooth] rerouting shards: [high disk watermark exceeded on one or more nodes]\n","stream":"stdout","time":"2016-06-27T16:49:55.315047615Z"}

The following log entry is written 1 or 2 times per second.

{"log":"[2016-06-29 14:55:30,597][WARN ][cluster.action.shard     ] [Thor] [.monitoring-data-2][0] received shard failed for target shard [[.monitoring-data-2][0], node[zUY_PHA0SPet5YLXwPZKSA], [P], s[INITIALIZING], a[id=0xG_VO4iQYC-9EArmgdU1w], unassigned_info[[reason=ALLOCATION_FAILED], at[2016-06-29T14:55:30.398Z], failed_attempts[14], details[failed recovery, failure RecoveryFailedException[[.monitoring-data-2][0]: Recovery failed from null into {Thor}{zUY_PHA0SPet5YLXwPZKSA}{172.17.0.3}{172.17.0.3:9300}]; nested: IndexShardRecoveryException[failed to recovery from gateway]; nested: EngineCreationFailureException[failed to create engine]; nested: EOFException; ]]], source shard [[.monitoring-data-2][0], node[zUY_PHA0SPet5YLXwPZKSA], [P], s[INITIALIZING], a[id=0xG_VO4iQYC-9EArmgdU1w], unassigned_info[[reason=ALLOCATION_FAILED], at[2016-06-29T14:55:30.398Z], failed_attempts[14], details[failed recovery, failure RecoveryFailedException[[.monitoring-data-2][0]: Recovery failed from null into {Thor}{zUY_PHA0SPet5YLXwPZKSA}{172.17.0.3}{172.17.0.3:9300}]; nested: IndexShardRecoveryException[failed to recovery from gateway]; nested: EngineCreationFailureException[failed to create engine]; nested: EOFException; ]]], message [failed recovery], failure [RecoveryFailedException[[.monitoring-data-2][0]: Recovery failed from null into {Thor}{zUY_PHA0SPet5YLXwPZKSA}{172.17.0.3}{172.17.0.3:9300}]; nested: IndexShardRecoveryException[failed to recovery from gateway]; nested: EngineCreationFailureException[failed to create engine]; nested: EOFException; ]\n","stream":"stdout","time":"2016-06-29T14:55:30.597899825Z"}

The text was updated successfully, but these errors were encountered:

krystalcode · 2016-07-05T22:05:57Z

The situation corresponding to the second log entry can be reproduced as described here: #19275

clintongormley · 2016-07-06T11:42:56Z

I think the disk watermark message was logged every time it tried to assign a shard. Given that the shard is corrupt, this happens very frequently. @dakrone is this the case?

Note: #18467 should have stopped trying to reallocate the shard after five attempts, which would have stopped this excessive logging i think?

ywelsch · 2016-07-06T12:11:35Z

The reason that #18467 does not stop reallocating this shard (failed_attempts[14]) is because it's a primary shard. For primary shards we currently force allocation if there are only shard copies on nodes where the deciders say no. The reason for this is to ensure for example that a primary is still allocated even if the node is above the low watermark. This does not play nicely with the allocation decider that checks the number of allocation attempts, however. We should rethink how allocation deciders work in case of primary shard allocation.

dakrone · 2016-07-06T22:08:21Z

I think the disk watermark message was logged every time it tried to assign a shard. @dakrone is this the case?

This message (the exceeded message) should only occur when a new cluster info is retrieved and the node is still over the limit. By default this is every 30 seconds, or whenever a new data node has joined. I just tested this locally and I get the message every 30 seconds, as expected.

krystalcode · 2016-07-07T00:38:11Z

@clintongormley unfortunately I have not kept these logs. However, from what I remember, the disk watermark message was present without the failed shard recovery message and could be unrelated. The failed shard recovery message appeared later on the log, due to #19275, after a restart. By that time, I had probably cleaned up some disk space and I believe I was not getting the high watermark message any more.

jasontedor · 2018-03-14T02:27:59Z

We handle disk-full events more gracefully now since #25541. I am closing this issue, please feel free to open a new issue if an actionable issue persists.

krystalcode changed the title ~~Logs infinitely logged filling up disk space~~ Entries repeatedly logged with sub-second frequency filling up disk space Jun 29, 2016

krystalcode mentioned this issue Jul 5, 2016

Monitoring plugin indices stuck in failed recovery status #19275

Closed

clintongormley removed the Pioneer Program label May 9, 2017

lcawl added :Core/Infra/Core Core issues without another label and removed :Exceptions labels Feb 13, 2018

jasontedor closed this as completed Mar 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Entries repeatedly logged with sub-second frequency filling up disk space #19164

Entries repeatedly logged with sub-second frequency filling up disk space #19164

krystalcode commented Jun 29, 2016 •

edited

Loading

krystalcode commented Jul 5, 2016

clintongormley commented Jul 6, 2016

ywelsch commented Jul 6, 2016

dakrone commented Jul 6, 2016

krystalcode commented Jul 7, 2016

jasontedor commented Mar 14, 2018

Entries repeatedly logged with sub-second frequency filling up disk space #19164

Entries repeatedly logged with sub-second frequency filling up disk space #19164

Comments

krystalcode commented Jun 29, 2016 • edited Loading

krystalcode commented Jul 5, 2016

clintongormley commented Jul 6, 2016

ywelsch commented Jul 6, 2016

dakrone commented Jul 6, 2016

krystalcode commented Jul 7, 2016

jasontedor commented Mar 14, 2018

krystalcode commented Jun 29, 2016 •

edited

Loading