Shards should rebalance across multiple path.data on one node #11271

mikemccand · 2015-05-20T21:20:58Z

Follow-on from #11185:

As of #9398 we now allocate an entire shard to one path on node's path.data, instead of file by file.

Even though we do the initial allocation to the path.data with the most free space, if the shards then grow in lopsided ways (an "adversary"), we can get to a state where one path.data is nearly full while others are very empty, on a single node. I suspect such adversaries would not be uncommon in practice...

Yet, DiskThresholdDecider only looks at total usable space on the node (not per-path on path.data) so it won't notice when only one path is close to full... we need to fix that? Also, the shard allocation process needs to see/address each path.data separately somehow?

Sometimes a shard would just move from one path.data to another path.data on the same node (maybe we should bias for that, all other criteria being nearly equal, since we save on network traffic).

I think it's important to fix this but this is beyond my knowledge of ES now ...

dakrone · 2015-05-20T21:27:49Z

DiskThresholdDecider only looks at total usable space on the node (not per-path on path.data) so it won't notice when only one path is close to full... we need to fix that?

+1, I think instead of doing the average for all the paths we can just use the max used data path on the node?

mikemccand · 2015-05-20T21:41:57Z

I think instead of doing the average for all the paths we can just use the max used data path on the node?

I think so, at least for the logic that notices when a node is "getting full" and triggers a rebalance.

clintongormley · 2015-06-19T09:42:46Z

I think there are only two possible solutions here:

Write a path-aware allocator
Remove multi-path support

clintongormley · 2015-06-19T09:46:37Z

If we remove multi-path support, users can run multiple instances per server, and we'd need to provide a tool to migrate data before starting the cluster (or use snapshot/restore)

…ation Today we only guess how big the shard will be that we are allocating on a node. Yet, we have this information on the master but it's not available on the data nodes when we pick a data path for the shard. We use some rather simple heuristic based on existing shard sizes on this node which might be complete bogus. This change adds the expected shard size to the ShardRouting for RELOCATING and INITIALIZING shards to be used on the actual node to find the best data path for the shard. Closes elastic#11271

mikemccand added >bug v2.0.0-beta1 blocker :Core/Infra/Core Core issues without another label labels May 20, 2015

mikemccand mentioned this issue May 22, 2015

Balance new shard allocations more evenly on multiple path.data #11185

Closed

clintongormley added the discuss label Jun 18, 2015

dakrone mentioned this issue Aug 9, 2015

Disk allocation with default settings allows the disk to fill up past 88%, ignoring the 85% low watermark #12745

Closed

clintongormley added v2.0.0 and removed v2.0.0-beta1 labels Aug 9, 2015

clintongormley assigned s1monw Aug 9, 2015

clintongormley mentioned this issue Aug 11, 2015

Roadmap for 2.0 #9970

Closed

14 tasks

clintongormley added v2.0.0-beta2 v2.0.0 and removed v2.0.0 v2.0.0-beta2 labels Aug 13, 2015

s1monw mentioned this issue Aug 17, 2015

Add expectedShardSize to ShardRouting and use it in path.data allocation #12947

Merged

s1monw closed this as completed in #12947 Aug 21, 2015

clintongormley added v2.0.0-beta2 v2.0.0-beta1 and removed v2.0.0 v2.0.0-beta2 labels Sep 14, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shards should rebalance across multiple path.data on one node #11271

Shards should rebalance across multiple path.data on one node #11271

mikemccand commented May 20, 2015

dakrone commented May 20, 2015

mikemccand commented May 20, 2015

clintongormley commented Jun 19, 2015

clintongormley commented Jun 19, 2015

Shards should rebalance across multiple path.data on one node #11271

Shards should rebalance across multiple path.data on one node #11271

Comments

mikemccand commented May 20, 2015

dakrone commented May 20, 2015

mikemccand commented May 20, 2015

clintongormley commented Jun 19, 2015

clintongormley commented Jun 19, 2015