Faster shard loading #5372

jwilder · 2016-01-15T23:34:13Z

This PR improves startup times for databases with many shards. The main changes are:

Load shards concurrently
Avoid querying the TSM index multiple times to get keys and then types for those keys
Avoid some unnecessary allocations
Move the database in-memory index locking into the index type to allow shards to be loaded concurrently

Should help #5311.

jwilder · 2016-01-15T23:36:45Z

@dswarbrick @toddboom Would be interested to see if this improves startup times of some larger DBs.

toddboom · 2016-01-16T20:15:05Z

@jwilder tested this on a smaller database this morning with inconclusive results (there's one large shard that takes all of the time, so parallelization didn't matter), but will test it on the larger datasets this afternoon.

thedrow · 2016-02-03T14:11:10Z

This needs to be rebased FYI.

jwilder · 2016-02-03T14:36:41Z

Yes. I may need to revert some changes in here as well.

e-dard · 2016-03-23T13:27:37Z

tsdb/engine/tsm1/file_store.go

@@ -203,23 +209,35 @@ func (f *FileStore) Remove(paths ...string) {
 	sort.Sort(tsmReaders(f.files))
 }

-func (f *FileStore) Keys() []string {
+func (f *FileStore) WalkKeys(fn func(key string, typ byte) error) error {


Exported method needs a comment.

e-dard · 2016-03-23T13:45:10Z

LGTM

mei-rune · 2016-03-23T14:04:55Z

tsdb/engine/tsm1/wal.go

-			values = getFloat64Values(nvals)
+			for i := 0; i < nvals; i++ {
+				values[i] = &FloatValue{}
+			}
 		case integerEntryType:
 			values = getIntegerValues(nvals)


delete line:571?

When loading many shards concurrently they block trying to acquire a write lock in the sync pool adding a new source of contention. Since this code flow always needs to allocate a buffer it's not really buying us much.

Since loading a shard can allocate a lot of memory, running them all at once could OOM the process. This limits the number of shards loaded to 4. This will be changed to a config option provided the approach helps.

Avoids allocating a big map or all keys.

jwilder force-pushed the jw-shard-load branch from 0c8716c to cacb6fe Compare February 22, 2016 20:08

jwilder mentioned this pull request Feb 22, 2016

InfluxDB starts for 3 hours #5764

Closed

jwilder force-pushed the jw-shard-load branch from cacb6fe to 779d5ea Compare February 26, 2016 06:39

jwilder mentioned this pull request Feb 29, 2016

InfluxDB 0.10.x performance unusable with less than 30 days history #5856

Closed

jwilder force-pushed the jw-shard-load branch from 779d5ea to fe2eb84 Compare March 16, 2016 23:17

jwilder added this to the 0.12.0 milestone Mar 22, 2016

e-dard reviewed Mar 23, 2016
View reviewed changes

mei-rune reviewed Mar 23, 2016
View reviewed changes

rossmcdonald added the support label Mar 24, 2016

jwilder force-pushed the jw-shard-load branch from 31ad3ed to 1e11c5b Compare March 28, 2016 16:40

jwilder added 9 commits March 29, 2016 12:58

Load shards concurrently

03ced4c

Avoid allocating a byte slice

17c7f4a

Update changelog

cc60b6c

Reduce lock content when loading database index

3f0e871

Remove sync.Pool from wal UnmarshalBinary

d4757ad

When loading many shards concurrently they block trying to acquire a write lock in the sync pool adding a new source of contention. Since this code flow always needs to allocate a buffer it's not really buying us much.

Limit how many shards are loaded concurrently

96e076b

Since loading a shard can allocate a lot of memory, running them all at once could OOM the process. This limits the number of shards loaded to 4. This will be changed to a config option provided the approach helps.

Use walk func to load all tsm keys to index

1b08e2d

Avoids allocating a big map or all keys.

Add godoc for KeyAt func

60c3898

Move shard mapping logic into index

9f41acb

jwilder force-pushed the jw-shard-load branch from af09ad7 to 9f41acb Compare March 29, 2016 18:59

jwilder added the area/tsm label Mar 29, 2016

jwilder merged commit b3c1320 into master Mar 29, 2016

jwilder deleted the jw-shard-load branch March 29, 2016 19:11

jwilder mentioned this pull request Mar 31, 2016

Fix write path lock contention #6168

Merged

4 tasks

toddboom mentioned this pull request Apr 1, 2016

[0.10.0-nightly tsm1] influxd takes ~30 minutes to start up #5311

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Faster shard loading #5372

Faster shard loading #5372

jwilder commented Jan 15, 2016

jwilder commented Jan 15, 2016

toddboom commented Jan 16, 2016

thedrow commented Feb 3, 2016

jwilder commented Feb 3, 2016

e-dard Mar 23, 2016

e-dard commented Mar 23, 2016

mei-rune Mar 23, 2016

jwilder Mar 28, 2016

Faster shard loading #5372

Faster shard loading #5372

Conversation

jwilder commented Jan 15, 2016

jwilder commented Jan 15, 2016

toddboom commented Jan 16, 2016

thedrow commented Feb 3, 2016

jwilder commented Feb 3, 2016

e-dard Mar 23, 2016

Choose a reason for hiding this comment

e-dard commented Mar 23, 2016

mei-rune Mar 23, 2016

Choose a reason for hiding this comment

jwilder Mar 28, 2016

Choose a reason for hiding this comment