Allow setting retention per metric (e.g rule aggregation) #903

bwplotka · 2019-03-11T13:31:53Z

In ideal world retention is not necessary on downsampling/raw leve, but on aggregation level.

We need to have the way to bring it through in LTS system like Thanos.

AC:

User can specify certain aggregation to be retained longer than others
Design doc (proposal is in place)

It cames to the fact that you want per metric retention ideally on compactor. This is bit related to delete_series as it might involve block rewrite in edge case... We need to design this.

Thoughts @improbable-ludwik @brancz @devnev @domgreen

brancz · 2019-03-11T17:31:56Z

There have been discussions on per time-series retention on upstream Prometheus before, I think at least having a discussion with the team is worth it, just to see if there are any insights from back then.

proposal is in place

What do you mean by this? As far as I can tell there is no design written up anywhere, but may very well have missed it.

Off the top of my head, this could be a configuration which is a combination of a Prometheus style label selector, as well a respective rule of which resolution to keep for how long.

As a whole this is definitely not trivial, but I agree much needed.

bwplotka · 2019-03-14T12:54:22Z

Sorry - no, proposal has to be in place, that's what I meant.

(: Relabel-like config makes sense but essentially we are talking about rewrite in compactor for this right?

brancz · 2019-03-15T08:18:45Z

Configuration is a technicality, I'm not entirely sure relabelling would work exactly but something close to that probably yes. I agree the compactor is the component that needs to take care of this by re-writing blocks.

ipstatic · 2019-03-18T13:55:02Z

With the federation system we have in place now, we have trained users that want metrics to be preserved (federated) have to use a specific recording rule style name so that the metrics would get federated. We would love to continue this practice with Thanos and only retain metrics that meet a specific format for an extended period of time.

bwplotka · 2019-04-01T21:02:57Z

Some offline discussions revealed that users still do it with Thanos, but using federation + Thanos on top.

The point was to shard ingestion and execute recording rules against each sharded ingestion gateway (which runs with minimal retention).  We have some very high-cardinality metrics and centralized ingestion alone was prohibitively expensive.  That prometheus federation layer allows us to compute/ingest aggregates (without retaining the raw metrics).

I think we should aim to allow users to avoid this, but cannot see immdiately blocker for that, otherwise than more complex system and query being able to fetch data with some lag (rule eval lag + federated scrape)

smalldirector · 2019-11-04T22:52:18Z

Just saw these discussion threads on potential feature requirement per metric retention on compactor, do we still look for this feature on compactor side?

Actually we are having same per metric retention requirement in our business scenario. We are trying to implement one policy based retention function to replace current compactor's retention function. Our idea is that providing one policy config file for retention function, the users can specify the promql expression to define the retention time for some metrics. The below is one sample policy config file:

policies:
  - expr: "{}"
    retentions:
      res-raw: 180d
      res-5m: 240d
      res-1h: 400d
  - expr: "{__name__=\"^go_memstats_.*\"}"
    retentions:
      res-raw: 90d
      res-5m: 180d
      res-1h: 360d
  - expr: "{__name__=\"go_memstats_gc_cpu_fraction\"}"
    retentions:
      res-raw: 200d
      res-5m: 300d
      res-1h: 400d

I would like to know your thoughts on this idea.

wogri · 2019-11-06T07:37:38Z

Just an FYI - not retaining raw data will lead to problems (you won't be able to zoom into your metrics anymore): https://thanos.io/components/compact.md/#downsampling-resolution-and-retention:

In other words, if you set --retention.resolution-raw less then --retention.resolution-5m and --retention.resolution-1h - you might run into a problem of not being able to “zoom in” to your historical data.

Reamer · 2019-11-06T08:54:17Z

@wogri I think you are using grafana for visualization. Checkout PR grafana/grafana#19121. At the moment this PR is not in a grafana release, therefore I use the master.
I added three prometheus datasources, each with an other max_source_resolution parameter. Now I can switch to the best resolution and zoom into my metrics. I disabled also auto-downsampling in thanos query component, because it doesn't really work good.
If you are using rates you should make your interval flexible, because on resolution one hour you should set the timerange to two hours.
I think, I will write a thanos PR with some more explanation, if grafana is released with the above PR.

wogri · 2019-11-06T10:07:09Z

Thanks @Reamer!

smalldirector · 2019-11-11T23:24:31Z

Just saw these discussion threads on potential feature requirement per metric retention on compactor, do we still look for this feature on compactor side?

Actually we are having same per metric retention requirement in our business scenario. We are trying to implement one policy based retention function to replace current compactor's retention function. Our idea is that providing one policy config file for retention function, the users can specify the promql expression to define the retention time for some metrics. The below is one sample policy config file:
policies:
  - expr: "{}"
    retentions:
      res-raw: 180d
      res-5m: 240d
      res-1h: 400d
  - expr: "{__name__=\"^go_memstats_.*\"}"
    retentions:
      res-raw: 90d
      res-5m: 180d
      res-1h: 360d
  - expr: "{__name__=\"go_memstats_gc_cpu_fraction\"}"
    retentions:
      res-raw: 200d
      res-5m: 300d
      res-1h: 400d
I would like to know your thoughts on this idea.

@bwplotka any thoughts on this idea to support per metric retention on compactor?

bwplotka · 2019-12-10T18:47:45Z

Extra context can be found here: prometheus/prometheus#1381

stale · 2020-01-11T03:43:31Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

brancz · 2020-01-13T08:16:15Z

This is being worked on on prometheus, once done there I would say we can implement it here with the same semantics and configuration.

bwplotka · 2020-01-29T12:12:54Z

The current plan is to first tackle #1598 then try to implement this. Putting this issue as GSoC project as well.

This is being worked on prometheus, once done there I would say we can implement it here with the same semantics and configuration.

The current decision is that Prometheus will not implement this, and the work has to be done first externally. It would be nice though if our work would work could be reused for vanilla Prometheus as well (as usual).

Jigar3 · 2020-01-29T15:52:03Z

Hey @bwplotka, I would like to work on this and it would be very helpful if you could suggest me some resources to get started.

stale · 2020-02-28T16:10:09Z

This issue/PR has been automatically marked as stale because it has not had recent activity. Please comment on status otherwise the issue will be closed in a week. Thank you for your contributions.

bwplotka · 2020-02-28T16:42:35Z

We need that still (:

…

On Fri, 28 Feb 2020 at 16:10, stale[bot] ***@***.***> wrote: This issue/PR has been automatically marked as stale because it has not had recent activity. Please comment on status otherwise the issue will be closed in a week. Thank you for your contributions. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#903?email_source=notifications&email_token=ABVA3O2LMUBFONYWRPI4Z2DRFEZOHA5CNFSM4G5CNURKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOENJBKWA#issuecomment-592581976>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABVA3O26ETGQMQE54LYWLSLRFEZOHANCNFSM4G5CNURA> .

stale · 2021-07-10T20:17:45Z

Hello 👋 Looks like there was no activity on this issue for the last two months.
Do you mind updating us on the status? Is this still reproducible or needed? If yes, just comment on this PR or push a commit. Thanks! 🤗
If there will be no activity in the next two weeks, this issue will be closed (we can always reopen an issue if we need!). Alternatively, use remind command if you wish to be reminded at some point in future.

doug-ba · 2021-07-10T21:58:52Z

I’m still interested in this feature.

yeya24 · 2021-07-13T03:37:35Z

I am still interested in this as well. Now with the new bucket rewrite tool and the compactv2 library, this is already doable I think.

stale · 2021-09-19T01:32:47Z

Hello 👋 Looks like there was no activity on this issue for the last two months.
Do you mind updating us on the status? Is this still reproducible or needed? If yes, just comment on this PR or push a commit. Thanks! 🤗
If there will be no activity in the next two weeks, this issue will be closed (we can always reopen an issue if we need!). Alternatively, use remind command if you wish to be reminded at some point in future.

markmsmith · 2021-09-19T01:34:30Z

Still needed.

stale · 2022-01-09T10:33:52Z

Hello 👋 Looks like there was no activity on this issue for the last two months.
Do you mind updating us on the status? Is this still reproducible or needed? If yes, just comment on this PR or push a commit. Thanks! 🤗
If there will be no activity in the next two weeks, this issue will be closed (we can always reopen an issue if we need!). Alternatively, use remind command if you wish to be reminded at some point in future.

doug-ba · 2022-01-09T16:56:50Z

Still needed

stale · 2022-04-17T04:54:38Z

Hello 👋 Looks like there was no activity on this issue for the last two months.
Do you mind updating us on the status? Is this still reproducible or needed? If yes, just comment on this PR or push a commit. Thanks! 🤗
If there will be no activity in the next two weeks, this issue will be closed (we can always reopen an issue if we need!). Alternatively, use remind command if you wish to be reminded at some point in future.

markmsmith · 2022-04-18T12:23:48Z

Still needed.

stale · 2022-09-21T02:13:36Z

Hello 👋 Looks like there was no activity on this issue for the last two months.
Do you mind updating us on the status? Is this still reproducible or needed? If yes, just comment on this PR or push a commit. Thanks! 🤗
If there will be no activity in the next two weeks, this issue will be closed (we can always reopen an issue if we need!). Alternatively, use remind command if you wish to be reminded at some point in future.

markmsmith · 2022-09-22T22:30:10Z

Still needed :(

DanielCastronovo · 2023-04-18T16:24:21Z

Still needed

aamargant · 2023-05-09T09:57:10Z

Any news about this feature? @bwplotka @csmarchbanks

kamelohr · 2023-11-09T08:59:44Z

Chiming in here, as I'm also highly in need for this feature for a project

NotAFile · 2024-04-08T13:36:57Z

Does the narrower "multi-tenant compactor", or more generally matching only on external labels fall under this too?

As I understand it, while similar it would require a less invasive changes, as each tenant has it's own separate TSDB and blocks. So with that it would be possible to only take the external labels in ThanosMeta and look up a deletion policy based on that, right?

~~I was wondering whether this was actually already possible but as I understand the docs there seems to be no way to only compact specific external labels right now.~~ There is, the documentation is just a bit confusing. However, you still need one compact instance per retention policy.

jahknem · 2024-05-13T16:17:27Z

Still needed. I guess the work that was to be started as part of GSoC has not been finished? Is there maybe some branch or fork in which work on this has begun?

bwplotka changed the title ~~Per rule aggregation retention~~ Allow setting retention per rule aggregation Mar 11, 2019

bwplotka mentioned this issue Mar 28, 2019

sidecar: Handle or do not allow delete_series for Thanos with object store backup #415

Closed

bwplotka mentioned this issue May 27, 2019

compact: Filter time series by regular expression #897

Closed

bwplotka mentioned this issue Oct 2, 2019

Delete series for object storage. #1598

Closed

bwplotka changed the title ~~Allow setting retention per rule aggregation~~ Allow setting retention per metric (e.g rule aggregation) Dec 10, 2019

bwplotka mentioned this issue Dec 10, 2019

Feature request: Custom Retention policies for single metrics or groups of metrics #1861

Closed

stale bot added the stale label Jan 11, 2020

stale bot removed the stale label Jan 13, 2020

bwplotka added the pinned label Jan 13, 2020

bwplotka removed the pinned label Jan 28, 2020

stale bot added the stale label Feb 28, 2020

stale bot removed the stale label Feb 28, 2020

stale bot added the stale label Jul 10, 2021

stale bot removed the stale label Jul 10, 2021

GiedriusS mentioned this issue Aug 23, 2021

only keep some of the metrics in the object storage #4588

Closed

stale bot added the stale label Sep 19, 2021

stale bot removed the stale label Sep 19, 2021

stale bot added the stale label Jan 9, 2022

stale bot removed the stale label Jan 9, 2022

stale bot added the stale label Apr 17, 2022

stale bot removed the stale label Apr 18, 2022

yeya24 mentioned this issue Jul 7, 2022

Add series deletion with compactor proposal #5436

Open

2 tasks

stale bot added the stale label Sep 21, 2022

stale bot removed the stale label Sep 22, 2022

valyala mentioned this issue Jan 25, 2023

Add ability to set per-label retention period VictoriaMetrics/VictoriaMetrics#289

Closed

roth-wine mentioned this issue May 30, 2024

Extend Thanos bucket rewrite to support filtered archiving of existing blocks #7402

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow setting retention per metric (e.g rule aggregation) #903

Allow setting retention per metric (e.g rule aggregation) #903

bwplotka commented Mar 11, 2019

brancz commented Mar 11, 2019

bwplotka commented Mar 14, 2019 •

edited

Loading

brancz commented Mar 15, 2019

ipstatic commented Mar 18, 2019

bwplotka commented Apr 1, 2019

smalldirector commented Nov 4, 2019

wogri commented Nov 6, 2019

Reamer commented Nov 6, 2019

wogri commented Nov 6, 2019

smalldirector commented Nov 11, 2019

bwplotka commented Dec 10, 2019

stale bot commented Jan 11, 2020

brancz commented Jan 13, 2020 •

edited

Loading

bwplotka commented Jan 29, 2020

Jigar3 commented Jan 29, 2020

stale bot commented Feb 28, 2020

bwplotka commented Feb 28, 2020 via email

stale bot commented Jul 10, 2021

doug-ba commented Jul 10, 2021

yeya24 commented Jul 13, 2021 •

edited

Loading

stale bot commented Sep 19, 2021

markmsmith commented Sep 19, 2021

stale bot commented Jan 9, 2022

doug-ba commented Jan 9, 2022

stale bot commented Apr 17, 2022

markmsmith commented Apr 18, 2022

stale bot commented Sep 21, 2022

markmsmith commented Sep 22, 2022

DanielCastronovo commented Apr 18, 2023

aamargant commented May 9, 2023

kamelohr commented Nov 9, 2023

NotAFile commented Apr 8, 2024 •

edited

Loading

jahknem commented May 13, 2024

Allow setting retention per metric (e.g rule aggregation) #903

Allow setting retention per metric (e.g rule aggregation) #903

Comments

bwplotka commented Mar 11, 2019

brancz commented Mar 11, 2019

bwplotka commented Mar 14, 2019 • edited Loading

brancz commented Mar 15, 2019

ipstatic commented Mar 18, 2019

bwplotka commented Apr 1, 2019

smalldirector commented Nov 4, 2019

wogri commented Nov 6, 2019

Reamer commented Nov 6, 2019

wogri commented Nov 6, 2019

smalldirector commented Nov 11, 2019

bwplotka commented Dec 10, 2019

stale bot commented Jan 11, 2020

brancz commented Jan 13, 2020 • edited Loading

bwplotka commented Jan 29, 2020

Jigar3 commented Jan 29, 2020

stale bot commented Feb 28, 2020

bwplotka commented Feb 28, 2020 via email

stale bot commented Jul 10, 2021

doug-ba commented Jul 10, 2021

yeya24 commented Jul 13, 2021 • edited Loading

stale bot commented Sep 19, 2021

markmsmith commented Sep 19, 2021

stale bot commented Jan 9, 2022

doug-ba commented Jan 9, 2022

stale bot commented Apr 17, 2022

markmsmith commented Apr 18, 2022

stale bot commented Sep 21, 2022

markmsmith commented Sep 22, 2022

DanielCastronovo commented Apr 18, 2023

aamargant commented May 9, 2023

kamelohr commented Nov 9, 2023

NotAFile commented Apr 8, 2024 • edited Loading

jahknem commented May 13, 2024

bwplotka commented Mar 14, 2019 •

edited

Loading

brancz commented Jan 13, 2020 •

edited

Loading

yeya24 commented Jul 13, 2021 •

edited

Loading

NotAFile commented Apr 8, 2024 •

edited

Loading