Implement vertical query merging and compaction #90

fabxc · 2017-06-06T14:53:03Z

We should add support to query and compact time-overlapping blocks. Probably no need for sophisticated handling of overlapping chunks.

This is a rather complex one and we have to discuss specifics. But once done, it makes our life for restoring from backups etc. a lot easier.

gouthamve · 2018-05-07T11:13:59Z

This should be relatively straightforward from what I can see. For us, BlockReader is an interface and we need to make sure the overlapping blocks turnout to be a single block of that interface.

What we essentially want is I hope clear from the following pictures:

Now if the same series exists in overlapping blocks, we want to merge the data, but if there exists samples with the same time-stamp but different values, we just drop the "second" (arbitrarily choose one) one. This is okay as it violates the TSDB invariants.

/cc @krasi-georgiev @fabxc

PS: This might take some extra memory, but that is okay given how we don't want to recommend this.

gouthamve · 2018-05-07T11:39:52Z

From IRC, @brian-brazil on the semantics:

bulk inserted data wins or the insert fails.

We can do the same thing mentioned above, but, we delete the data that is overlapping in the existing blocks in promtool when inserting data using promtool. Little more work, but we can start with the proposal above, and then when integrating with promtool, implement the deletes.

bwplotka · 2018-05-08T06:35:45Z

I can see on IRC that the main question about this issues is "why we need that", can we enumerate the use cases, before starting to designing this?

For me this issues is also about the assumptions taken by compaction. Basically, I would love to have compactor be resilient on eventual storage write-read consistency. Why? Because so far it (lack of support for it) produced major release issue for 2.2.0 as well as major compaction issues on Thanos a month ago.

Basically:

if compactor sees block for T and for T - 2 it automatically assumes that T-1 is missing for sure.
it compacts T and T -2 together. Now when T - 1 is back we have broken state.
As a result, query is utterly broken (sometimes showing sometimes <T, T-2> or things like that) and further compactions are dropping data around (most likely that T-1 block).

Making compactor aware that overlaping block is a think will basically reduce damage radius if any of these bugs happen again.

@gouthamve can you explain more, how bulk insert can result in overlapping (same) data? (I think I am missing the context on how bulk insert is exactly planned to be done)

I would also keep in mind that TSDB is a library and while mainly used for Prometheus, other use it as well. For Thanos this feature would be extremely valuable, because:

Compaction would be able to handle eventual consistency (objstore: Be clear of our consistency assumptions or handle eventual consistency gracefully. thanos-io/thanos#298). That will simplify our code and prevent super easy to be introduced bugs.
Compaction would be able to "self heal" if the blocks are overlapping. Currently I was required to implement repair job for it: sidecar: Allow Thanos backup when local compaction is enabled thanos-io/thanos#206

So yea. It is needed for Thanos for sure, any other use cases? Issue was created Jan 2017, so @fabxc had some certain use cases in mind for sure (:

krasi-georgiev · 2018-05-15T07:47:22Z

I can work on this once we define the why/what/how
@gouthamve waiting for your input 🙏

krasi-georgiev · 2018-05-16T09:06:56Z

just watched @gouthamve's video that add's another use case - back-filling
https://youtu.be/0UvKEHFNu4Q?t=1219

fabxc · 2018-05-16T10:25:43Z

I'd think in general handling this at query time first will be by far sufficient. Compaction adds lots of attack surface for data loss and regressions and we don't have a use case where we'd be dealing with high fragmentation.

…

On Wed, May 16, 2018 at 11:06 AM Krasi Georgiev ***@***.***> wrote: just watched @gouthamve <https://github.com/gouthamve>'s video that add add another use case - back-filling https://youtu.be/0UvKEHFNu4Q?t=1219 — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#90 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AEuA8jXC3lXg4KLxzPEx8J4sNEA6uFKhks5ty-wygaJpZM4NxdgD> .

bwplotka · 2018-05-16T11:45:35Z

Yea, but we need to make sure compaction does not add regression when you put overlapping blocks. Currently, with overlap check we added, we just block the compaction flow. It is not safe to just unblock compaction in the current form in case of overlapping blocks

bwplotka · 2018-05-25T13:29:24Z

Any ideas for this? It is blocking: thanos-io/thanos#348

codesome · 2018-08-31T11:38:14Z

I would like to work on this, will come up with ideas for this soon.
This would also help implement bulk loading (#24, prometheus/prometheus#535)

krasi-georgiev · 2018-08-31T11:41:38Z

great, I will find time to review it.

I'd think in general handling this at query time first will be by far
sufficient. Compaction adds lots of attack surface for data loss and
regressions and we don't have a use case where we'd be dealing with high
fragmentation.

don't forget to sync with @gouthamve and @fabxc tsdb gurus for the best design approach 😜

codesome · 2018-08-31T12:59:37Z

@fabxc @gouthamve
Are we looking at only vertical query merging? With bulk import I feel vertical compaction would be helpful.

bwplotka · 2018-08-31T13:54:15Z

Again, IMO vertical compaction is must-have. (: For bulk import if you want bulk import further in past, but also more resilient compaction overall.

Plus thanos-io/thanos#348

fabxc mentioned this issue Apr 13, 2018

verify: Add separated blocks issue and fix. thanos-io/thanos#280

Closed

This was referenced Apr 13, 2018

compact: Use sync-delay only for fresh blocks. Refactored halt, retry logic. thanos-io/thanos#282

Merged

objstore: Be clear of our consistency assumptions or handle eventual consistency gracefully. thanos-io/thanos#298

Closed

bwplotka mentioned this issue May 25, 2018

sidecar: Allow to upload old (already compacted blocks). thanos-io/thanos#348

Closed

bwplotka mentioned this issue Jun 27, 2018

compact: Get rid of syncDelay and handle eventual consistency & partial upload differently. thanos-io/thanos#377

Closed

bwplotka mentioned this issue Aug 10, 2018

Thanos Compactor Failure : overlaps found while gathering blocks. thanos-io/thanos#469

Closed

krasi-georgiev added the priority: medium label Aug 31, 2018

codesome mentioned this issue Aug 31, 2018

Bulk Imports #24

Closed

codesome mentioned this issue Aug 31, 2018

Add mechanism to perform bulk imports prometheus/prometheus#535

Closed

codesome mentioned this issue Sep 4, 2018

Vertical query merging and compaction #370

Merged

gouthamve closed this as completed in #370 Feb 14, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement vertical query merging and compaction #90

Implement vertical query merging and compaction #90

fabxc commented Jun 6, 2017

gouthamve commented May 7, 2018

gouthamve commented May 7, 2018

bwplotka commented May 8, 2018 •

edited

Loading

krasi-georgiev commented May 15, 2018

krasi-georgiev commented May 16, 2018 •

edited

Loading

fabxc commented May 16, 2018 via email

bwplotka commented May 16, 2018 •

edited

Loading

bwplotka commented May 25, 2018

codesome commented Aug 31, 2018

krasi-georgiev commented Aug 31, 2018

codesome commented Aug 31, 2018

bwplotka commented Aug 31, 2018

Implement vertical query merging and compaction #90

Implement vertical query merging and compaction #90

Comments

fabxc commented Jun 6, 2017

gouthamve commented May 7, 2018

gouthamve commented May 7, 2018

bwplotka commented May 8, 2018 • edited Loading

krasi-georgiev commented May 15, 2018

krasi-georgiev commented May 16, 2018 • edited Loading

fabxc commented May 16, 2018 via email

bwplotka commented May 16, 2018 • edited Loading

bwplotka commented May 25, 2018

codesome commented Aug 31, 2018

krasi-georgiev commented Aug 31, 2018

codesome commented Aug 31, 2018

bwplotka commented Aug 31, 2018

bwplotka commented May 8, 2018 •

edited

Loading

krasi-georgiev commented May 16, 2018 •

edited

Loading

bwplotka commented May 16, 2018 •

edited

Loading