sidecar: Handle or do not allow `delete_series` for Thanos with object store backup #415

bwplotka · 2018-07-05T15:25:33Z

Basically https://github.com/prometheus/prometheus/blob/master/docs/querying/api.md#delete-series is unsafe to run with Thanos sidecar that uploads blocks to bucket because:

Prometheus rewrites ALL blocks because they are immutable. This causes Thanos sidecar to upload new ones (without removal of old ones) which causes compactor to stop because of overlap.
Race conditions (rewrite during Thanos shipper upload)

It is worth to note that it does not make sense much to do delete_series on local Prometheus with Thanos backup, because most likely all blocks are already uploaded at the moment when Prometheus replaces these with rewritten blocks. So deleted series will be seen only locally. Plus every older block will have still that series

Solution:

We don't want to replace the blocks in sidecar. We don't want keep whole process coordination free, so only compactor is allowed to remove blocks on regular basis (compaction process)
We could instead of specifying which ULIDs where deployed in Thanos meta, specify also time ranges, and ignore or error out the rewritten blocks with new ULID and same time range.

Thanks [slack]@jean-louis for spotting this!

The text was updated successfully, but these errors were encountered:

dupondje · 2018-07-06T06:43:35Z

Great we found the root cause :)

Even better would be (in a dream scenario), that you can do delete_series in Thanos. And that Thanos deletes series in the bucket also.

But it might be a good start to let the compactor just delete the new/adjusted blocks on each run.

bwplotka · 2018-07-09T08:21:47Z

But it might be a good start to let the compactor just delete the new/adjusted blocks on each run.

What do you mean? I think it is way more clear to block this behaviour and be explicit about it. (not sure yet how). Basically if we do this ("let the compactor just delete the new/adjusted blocks on each run.") we have few problems:

Maybe these new blocks are not because of delete_series but some other configuration errors? (e.g same external labels)
User is confident that actually she/he deleted these series everywhere, but actually, fresh series are only deleted.
What if user deleted series before Thanos being able to upload some of the blocks? Really big place of inconsistencies.

All of this makes me think to do either:
a. Block or explicitly do not allow this logic
b. Block or explicitly do not allow this logic, but implement some batch job code that will delete series from the whole history (from bucket) - actually user can implement that code on its own even now.
c. Integrate delete series with the whole stack (sidecar + compactor) - but I think that is way to complex for the actual value.

Can we first enumerate what are the use cases for delete series?
I thought the main one is to reduce the amount of unnecessary metric after they are stored in Prometheus, but with Thanos, space is not that much of a problem.
Second, maybe some "right to be forgotten" rule, but never seen that needed in metrics.

Any ideas?

dupondje · 2018-07-09T08:27:22Z

The reason I deleted a serie was that the client was initially configured incorrectly, and sent metrics with wrong tags to Prometheus.
This caused graphs to be broken for that host.

After reconfiguration it was fine, but somehow I need to get rid of the broken samples in the TSDB.

xjewer · 2018-07-18T20:17:08Z

I guess, it's a rare case. TSDB buckets are supposed to be immutable, so deleting the series from the storage means literally recalculate whole data, which could be unwarrantedly expensive.

Bear in mind this is seldom case, to solve the right to be forgotten problem, it could be a query/store side decision: somehow preserving/applying rules (could be limiting by time ranges to apply to requests when series had existed?) to the series you want to be deleted, just skipping them in a query time.

dupondje · 2018-07-19T10:56:37Z

@xjewer: its rare indeed. And doesn't happen by default.
But at least it should be documented/handled in Thanos when it does happen :) Or there should be a warning in the docs.

Allex1 · 2018-08-22T07:45:43Z

I have an issue where the delete series from S3/GCS is really needed.
When running compactor I get:
level=error ts=2018-08-20T21:55:21.849911888Z caller=main.go:160 msg="running command failed" err="compaction: gather index issues for block /var/lib/thanos-compact/compact/0@{monitor=\"master\",replica=\"1\"}/01CN46CYFR2CQCH567H5Q33007: out-of-order label set {ArchPath=\"MC.BBM.whatever1.cluster.whatever2\",__name__=\"test_metric\",az=\"us-east-1b\",cluster=\"whatever\",env=\"stage\",instance=\"xxx\",job=\"node\",key1=\"0\",key1=\"1\",service=\"xxxx\"} for series 346654"

as someone created some funky metrics which breaks the compactor process. The only reasonable way I can continue using Thanos now is by deleting these metrics from S3. I cannot afford to delete all the blocks

Allex1 · 2018-10-01T15:37:07Z

Small update: the "out-of-order label set" error in the compactor was caused by the honor_labels: true option for that specific job. I guess Prometheus label overwrite messes up the way Thanos handles the metrics when compacting.

bwplotka · 2018-10-01T21:46:17Z

Cool, have you put an issue on Prometheus github?

Let's figure out how to unblock you. I think that all of those issues, like out of order label set etc, is nice to know and super important to solve, but fixing this is generally not necessary for overall thanos compaction. It seems like the solution here would be to just alert on this (produce metric, log) but continue the compaction as it's critical for system healthiness. Will try to find some time to produce a fix for this.

Not sure, if delete series helps here as you would have something produced broken series continously (: so you would endup deleting this series every 3h (:

antonio · 2018-12-18T12:05:12Z

Can we first enumerate what are the use cases for delete series?
I thought the main one is to reduce the amount of unnecessary metric after they are stored in Prometheus, but with Thanos, space is not that much of a problem.

This is generally true, but there is another dimension to consider: memory. We have been experiencing recently problems with the thanos store processes being OOMkilled. The underlying cause is the number of series we have in the object bucket, which is significant as a result of a long period of time in which we (inadvertently) have been uploading metrics with high cardinality dimensions. Now that we have identified the issue, we'd like to be able to remove those metrics from the storage in order to reduce the memory footprint of the store processes, but alas, I haven't found a way to do so yet that does not involve removing most of the blocks.

bwplotka · 2018-12-18T15:09:34Z

Ok, so:
Use cases

- Reduce cardinality once happen and fixed root cause. 
- Potentially, "right to be forgotten" rule?

We have a couple of options:

Tombstones. This is what Prometheus uses for blocks. Tombstones are used when querying to exclude certain series, but also used on compaction. Compactor understand those and removes the series on certain time ranges from there. This is faster than rewriting whole block on the request, but what if there are more compactions for the block? (it's the biggest one). It also does not help immdiately with block size, you need to wait for compaction to sort it out...
Rewrite of whole block. This is time and resource consuming process, but doable. The question is where. Compactor? CLI tool? if tool then how to sync this with compactor?

rafaeljesus · 2019-03-06T18:45:49Z

Can we first enumerate what are the use cases for delete series?
I thought the main one is to reduce the amount of unnecessary metric after they are stored in Prometheus, but with Thanos, space is not that much of a problem.

This is generally true, but there is another dimension to consider: memory. We have been experiencing recently problems with the thanos store processes being OOMkilled. The underlying cause is the number of series we have in the object bucket, which is significant as a result of a long period of time in which we (inadvertently) have been uploading metrics with high cardinality dimensions. Now that we have identified the issue, we'd like to be able to remove those metrics from the storage in order to reduce the memory footprint of the store processes, but alas, I haven't found a way to do so yet that does not involve removing most of the blocks.

I have the same scenario, I found metrics with high cardinality and need to remove them because of.

Store is going down due to the amount of data it fetches from S3.
Query memory goes pretty high due to the amount of data store retrieves.

AFAIK the only solution would be either scaling up thanos resources or to delete data from S3?

bwplotka · 2019-03-28T18:35:24Z

Related to #903

caarlos0 · 2019-09-24T02:30:30Z

is it possible to just delete entire tsdb blocks?

for me the first blocks had a lot of metrics with high cardinality, and, if deleting those metrics is not possible, I would be happy to be able to delete entire blocks instead.

bwplotka · 2019-10-04T11:37:27Z

@caarlos0 you can do that just fine - but better turn off compactor temporarily for that. The concern is that compactor might be compacting that very block to bigger block and thus deletion would not help.

stale · 2020-01-28T11:02:13Z

This issue/PR is has been automatically marked as stale because it has not had recent activity. Please comment on status otherwise the issue will be closed in a week. Thank you for your contributions.

bwplotka added the feature request/improvement label Jul 13, 2018

This was referenced Mar 11, 2019

Allow setting retention per metric (e.g rule aggregation) #903

Open

compactor: change GatherIndexIssueStats validation logic on sorted labels #831

Closed

FUSAKLA mentioned this issue Mar 23, 2019

too many compact_{thanos_compactor_pod_name}_runtime_* metrics #870

Closed

bwplotka added help wanted priority: P0 labels Mar 28, 2019

antonio mentioned this issue Jun 8, 2019

Delete Certain Metrics From Longterm Storage #1237

Closed

bwplotka added the difficulty: hard label Jun 18, 2019

bwplotka mentioned this issue Aug 28, 2019

Reduce store memory usage by reworking block.indexCache #1471

Closed

bwplotka mentioned this issue Oct 2, 2019

Delete series for object storage. #1598

Closed

bwplotka mentioned this issue Jan 7, 2020

Not able to access delete_series api from thanos query component #1946

Closed

stale bot added the stale label Jan 28, 2020

stale bot closed this as completed Feb 4, 2020

darshanime mentioned this issue Apr 27, 2020

Add deleteSeries skeleton to return bad request #2530

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sidecar: Handle or do not allow `delete_series` for Thanos with object store backup #415

sidecar: Handle or do not allow `delete_series` for Thanos with object store backup #415

bwplotka commented Jul 5, 2018

dupondje commented Jul 6, 2018

bwplotka commented Jul 9, 2018

dupondje commented Jul 9, 2018

xjewer commented Jul 18, 2018

dupondje commented Jul 19, 2018

Allex1 commented Aug 22, 2018

Allex1 commented Oct 1, 2018

bwplotka commented Oct 1, 2018

antonio commented Dec 18, 2018

bwplotka commented Dec 18, 2018

rafaeljesus commented Mar 6, 2019

bwplotka commented Mar 28, 2019

caarlos0 commented Sep 24, 2019

bwplotka commented Oct 4, 2019 •

edited

Loading

stale bot commented Jan 28, 2020

sidecar: Handle or do not allow delete_series for Thanos with object store backup #415

sidecar: Handle or do not allow delete_series for Thanos with object store backup #415

Comments

bwplotka commented Jul 5, 2018

dupondje commented Jul 6, 2018

bwplotka commented Jul 9, 2018

dupondje commented Jul 9, 2018

xjewer commented Jul 18, 2018

dupondje commented Jul 19, 2018

Allex1 commented Aug 22, 2018

Allex1 commented Oct 1, 2018

bwplotka commented Oct 1, 2018

antonio commented Dec 18, 2018

bwplotka commented Dec 18, 2018

rafaeljesus commented Mar 6, 2019

bwplotka commented Mar 28, 2019

caarlos0 commented Sep 24, 2019

bwplotka commented Oct 4, 2019 • edited Loading

stale bot commented Jan 28, 2020

sidecar: Handle or do not allow `delete_series` for Thanos with object store backup #415

sidecar: Handle or do not allow `delete_series` for Thanos with object store backup #415

bwplotka commented Oct 4, 2019 •

edited

Loading