db: improve ingested flushable flushing logic #2266

bananabrick · 2023-01-23T21:29:55Z

Presently in #2212, we split a flush into chunks so that ingested flushables have an updated view of the lsm while flushing.

This is unnecessary. If we have some queue of flushables, and the flushable with sstables at some position i is f_i, then it is possible to determine the target level for the files associated with f_i, if we've determined the target level of all flushables upto f_i-1. f_0 is the base case as it can use the current version to determine its target level.

This will prevent a potential slowdown of flushes when ingested flushables are present in the flushable queue.

Jira issue: PEBBLE-145

jbowens · 2023-02-15T19:22:20Z

@bananabrick can this slip to 23.2? I know you've got a lot on your plate for 23.1

bananabrick · 2023-02-15T21:46:25Z

@jbowens No this one can't. I want to do this before enabling flushable ingests.

jbowens · 2023-02-15T23:03:16Z

@bananabrick can you explain why it can't wait? My understanding is that this is purely an optimization and is not a regression over 22.2.

bananabrick · 2023-02-16T19:02:16Z

@jbowens Let's say we have some memtables m1, m2 in the queue, followed by ingested flushables i1, followed by memtable m3, followed by ingested flushable i2, followed by memtable m4. So, the queue is m1,m2,i1,m3,i2,m4. If we try and flush this right now, we're going to incur 4 manifest writes + syncs. In 22.2, we won't have i2, i2, and just m1,m2,m3,m4. To flush m1, m2 and m3, we'll incur exactly one manifest write + sync.

Maybe writing the memtable to disk as sstables and syncing those will dominate over a few manifests writes + syncs, and this is okay. Not a 100% sure.

bananabrick · 2023-02-16T19:08:48Z

Maybe I could just enable it and only do this issue if we see a regression in any benchmarks which perform pebble ingestions.

jbowens · 2023-02-27T16:31:02Z

So, the queue is m1,m2,i1,m3,i2,m4. If we try and flush this right now, we're going to incur 4 manifest writes + syncs. In 22.2, we won't have i2, i2, and just m1,m2,m3,m4.

In 22.2, wouldn't the same sequence of ingests result in:

flush of m1+m2
ingest of i1
flush of m3
ingest of i2
flush of m4

with corresponding manifests writes and syncs for each?

blathers-crl bot added A-storage T-storage labels Jan 23, 2023

bananabrick mentioned this issue Feb 15, 2023

storage: Set IngestAsFlushable to true by default in the Pebble Options cockroachdb/cockroach#97194

Closed

bananabrick mentioned this issue Feb 16, 2023

db: flushable ingest meta issue #2337

Closed

3 tasks

jbowens mentioned this issue Jun 16, 2023

db: investigate elevated batch commit tail latencies #2646

Closed

jbowens added this to [Deprecated] Storage Jun 4, 2024

jbowens moved this to Backlog in [Deprecated] Storage Jun 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

db: improve ingested flushable flushing logic #2266

db: improve ingested flushable flushing logic #2266

bananabrick commented Jan 23, 2023 •

edited by cockroach-jira-scripts

Loading

jbowens commented Feb 15, 2023

bananabrick commented Feb 15, 2023

jbowens commented Feb 15, 2023

bananabrick commented Feb 16, 2023 •

edited

Loading

bananabrick commented Feb 16, 2023

jbowens commented Feb 27, 2023

db: improve ingested flushable flushing logic #2266

db: improve ingested flushable flushing logic #2266

Comments

bananabrick commented Jan 23, 2023 • edited by cockroach-jira-scripts Loading

jbowens commented Feb 15, 2023

bananabrick commented Feb 15, 2023

jbowens commented Feb 15, 2023

bananabrick commented Feb 16, 2023 • edited Loading

bananabrick commented Feb 16, 2023

jbowens commented Feb 27, 2023

bananabrick commented Jan 23, 2023 •

edited by cockroach-jira-scripts

Loading

bananabrick commented Feb 16, 2023 •

edited

Loading