Unable to remove items from persistent queue storage if the storage is full #7198

swiatekm · 2023-02-14T11:48:40Z

Describe the bug
The persistent queue removes items from storage after they're successfully exported. This removal happens in a transaction which also updates the list of currently dispatched items. Depending on the implementation details of the underlying storage, this transaction may fail if the storage device is full.

As a result, we can take items out of the queue, but they're not actually removed from the storage, and no new items can be put in.

Steps to reproduce
See the unit test in the linked PR.

Additional context
I've confirmed that filestorage can behave this way via the following test: open-telemetry/opentelemetry-collector-contrib@dbe3105. I suspect that this will be true of any transactional storage engine, as some amount of transaction data needs to be persisted to disk before it can be committed.

How often this can happen in practice is difficult to estimate. It depends heavily on how the size of queue items aligns with available disk space. Anecdotally, I've seen it happen during an incident, on a volume with multiple queues sharing space.

The text was updated successfully, but these errors were encountered:

swiatekm · 2023-02-14T11:50:21Z

CC @djaglowski for the storage part. Anyone who sees this with the necessary power, please assign this to me.

swiatekm · 2023-06-22T10:01:50Z

Fixed in #7396

swiatekm added the bug Something isn't working label Feb 14, 2023

swiatekm mentioned this issue Feb 14, 2023

[exporterhelper] Fix persistent storage behaviour with no available space on device #7199

Closed

djaglowski assigned swiatekm Feb 14, 2023

swiatekm mentioned this issue Mar 20, 2023

[exporterhelper] Fix persistent storage behaviour with no available space on device #7396

Merged

swiatekm mentioned this issue Apr 3, 2023

[exporterhelper] log storage errors with higher severity #7477

Closed

swiatekm closed this as completed Jun 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to remove items from persistent queue storage if the storage is full #7198

Unable to remove items from persistent queue storage if the storage is full #7198

swiatekm commented Feb 14, 2023 •

edited

Loading

swiatekm commented Feb 14, 2023

swiatekm commented Jun 22, 2023

Unable to remove items from persistent queue storage if the storage is full #7198

Unable to remove items from persistent queue storage if the storage is full #7198

Comments

swiatekm commented Feb 14, 2023 • edited Loading

swiatekm commented Feb 14, 2023

swiatekm commented Jun 22, 2023

swiatekm commented Feb 14, 2023 •

edited

Loading