[WIP] *: Wire up compaction picking with L0Sublevels, enable flush splits #642

itsbilal · 2020-04-27T20:45:06Z

This change connects the compaction picking methods in L0SubLevels
with the existing compaction picker, replacing existing logic for
picking L0 compactions.

It also enables flush splits, and adds a pebble Option (FlushSplitBytes)
to set the threshold for number of bytes that would need to be observed
in L0 files since the last split key before an interval boundary is
added as a flush split key.

Very much a work-in-progress; first commit is #615.

petermattis · 2020-04-27T20:45:13Z

This change is

sumeerbhola

existing compaction picker

This isn't quite the existing compaction picker since it includes a number of heuristic changes from #563 that complement but aren't implied by sub-levels, including how other levels are picked.

I did run without these changes, but I don't have a clean record of how badly those runs went since after adding the heuristics here I made further changes like splitting flushes, and some algorithmic improvements to L0SubLevels (like using knowledge of what files in Lbase were already compacting). It may be worth doing a run with L0SubLevels and a trivial integration with compactionPickerByScore and see what happens, and then use that to justify the heuristics here -- this is the most debatable part of all the changes and there are multiple magic constants.

Ideally we should try with a different stressful workload in addition to the TPCC import -- that may help with tweaking the heuristics, including changing the magic constants. I'm ok with doing that in future PRs, but I am not the one you need to convince :)

Reviewable status: 0 of 20 files reviewed, all discussions resolved (waiting on @petermattis)

This change adds methods to L0SubLevels to help pick, score, and generate L0 -> LBase, and L0 -> L0 compactions, based on information captured in the data structure about L0 sublevels. These functions will be called from in compaction.go and compaction_picker.go in a future change. Also adds associated datadriven unit tests, and a benchmark. Covers a large part of cockroachdb#563. Thanks Sumeer for his work, most of this was written by him.

This change connects the compaction picking methods in L0SubLevels with the existing compaction picker, replacing existing logic for picking L0 compactions. It also enables flush splits, and adds a pebble Option (FlushSplitBytes) to set the threshold for number of bytes that would need to be observed in L0 files since the last split key before an interval boundary is added as a flush split key.

sumeerbhola

Reviewable status: 0 of 20 files reviewed, 2 unresolved discussions (waiting on @itsbilal and @petermattis)

compaction_picker.go, line 324 at r2 (raw file):

func (p *compactionPickerByScore) initL0Score(inProgressCompactions []compactionInfo) {
	if p.vers.L0SubLevels != nil {

Will this ever be nil? This was a temporary integration hack for my experiment -- the "else" code should be deletable.

compaction_picker.go, line 509 at r2 (raw file):

	//   compaction -- see the comment in the if-condition below.
	//
	// - We segment compactions into 3 priority levels:

@petermattis since you were asking today. There are some long comments in this file, but I should have written more justification for these priority levels.

The issue is that we have

two measures for scoring compactions: the score and the currentByteRatios
the score for L0 is not really comparable with the other scores since it is computed differently
I did not increase the max number of concurrent compactions since I considered that cheating wrt experimental comparison, so the import experiment continued to run with 3. So we did not have enough compaction capacity to keep the "nice" LSM shape. So the question was whether we can keep an "ok" LSM shape (which is the long comment preceding this). In the real world, given that we have the potential to do many concurrent compactions, we could consider increasing max concurrent compactions if one is provisioned appropriately. I think we will also need to bring back some of the rate throttling code.

Arranging these into a priority total order is non obvious. So the heuristic first tries to categorize so it can side-step constructing a total ordering. One could potentially argue for 2 categories, high and low. During my experiments I was struggling with how to deal with very high score with just 2 categories. 3 categories made it simpler for me:

highest was really meant for something that has fallen behind too much -- we don't really care which level this is. Just compact it! Things ended up here occasionally, and quickly get out. Note that the ordering within highest is by decreasing score. Most of the time there was nothing in highest.
high is where we can start playing more interesting games -- this is where most of the levels are: we reorder by preferring higher levels first. This is what makes L0 => Lbase happen a lot, and then Lbase => Lbase+1
low is the boring category: ordered again by decreasing score. If we get to these we don't really care about currentByteRatios -- we have enough resources to maintain a low enough score for all levels.

itsbilal · 2020-05-12T19:00:37Z

@sumeerbhola I've answered the questions you asked in #670. This PR was mostly a reference point for any reviewers of past PRs to be able to tie together pieces, but I didn't expect it to be reviewed directly. As I mentioned in #670, my experimentation so far shows that the prioritization done here is less impactful than other changes like flush splits and preIngestDelay changes.

The else code is dead for now, but I expect to either make it enable-able with a knob, or delete it very soon. Also discussing this in #670.

petermattis · 2020-06-29T13:08:17Z

@itsbilal This can be closed, right?

itsbilal · 2020-06-30T13:48:00Z

Yes, this can be closed. Thanks for reminding!

itsbilal requested a review from petermattis April 27, 2020 20:45

itsbilal self-assigned this Apr 27, 2020

sumeerbhola reviewed Apr 27, 2020

View reviewed changes

itsbilal added 2 commits May 1, 2020 12:47

itsbilal force-pushed the l0-sublevels-wire-compactions-flush branch from a6beafb to ab33764 Compare May 1, 2020 22:18

sumeerbhola requested changes May 4, 2020

View reviewed changes

itsbilal mentioned this pull request May 12, 2020

*: Use L0SubLevels to pick base/intra-L0 compactions #670

Merged

petermattis mentioned this pull request May 20, 2020

*: consistently capitalize Sublevels #701

Merged

itsbilal closed this Jun 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] *: Wire up compaction picking with L0Sublevels, enable flush splits #642

[WIP] *: Wire up compaction picking with L0Sublevels, enable flush splits #642

itsbilal commented Apr 27, 2020

petermattis commented Apr 27, 2020

sumeerbhola left a comment

sumeerbhola left a comment

itsbilal commented May 12, 2020

petermattis commented Jun 29, 2020

itsbilal commented Jun 30, 2020

[WIP] *: Wire up compaction picking with L0Sublevels, enable flush splits #642

[WIP] *: Wire up compaction picking with L0Sublevels, enable flush splits #642

Conversation

itsbilal commented Apr 27, 2020

petermattis commented Apr 27, 2020

sumeerbhola left a comment

Choose a reason for hiding this comment

sumeerbhola left a comment

Choose a reason for hiding this comment

itsbilal commented May 12, 2020

petermattis commented Jun 29, 2020

itsbilal commented Jun 30, 2020