Avoid Needless Cache Status Fetches in SearchableSnapshotAllocator #66433

original-brownbear · 2020-12-16T09:36:10Z

We shouldn't fetch cache status if no allocation is possible to begin with.
Also, this surfaced an issue with using the Client to reroute since that
won't retry stale shards (failed the invalid license IT for example) so I moved
to using the RerouteService like we do in the GatewayAllocator.
(Plus, dried up one method that was 100% the same as in the replica allocator)

We shouldn't fetch cache status if no allocation is possible to begin with. Also, this surfaced an issue with using the `Client` to `reroute` since that won't retry stale shards (failed the invalid license IT for example) so I moved to using the `RerouteService` like we do in the `GatewayAllocator`. (Plus, dried up one method that was 100% the same as in the replica allocator)

elasticmachine · 2020-12-16T09:36:13Z

Pinging @elastic/es-distributed (Team:Distributed)

elasticmachine · 2020-12-16T09:36:13Z

Pinging @elastic/es-distributed (Team:Distributed)

original-brownbear · 2020-12-16T09:36:35Z

server/src/main/java/org/elasticsearch/action/admin/cluster/reroute/ClusterRerouteResponse.java

@@ -44,7 +44,7 @@
        explanations = RoutingExplanations.readFrom(in);
    }

-    public ClusterRerouteResponse(boolean acknowledged, ClusterState state, RoutingExplanations explanations) {
+    ClusterRerouteResponse(boolean acknowledged, ClusterState state, RoutingExplanations explanations) {


Just a revert of making this public yesterday now that it's not needed for tests any longer.

henningandersen

LGTM, though it would be good to add a test to verify the problem.

henningandersen · 2020-12-16T12:34:12Z

...s/src/main/java/org/elasticsearch/xpack/searchablesnapshots/SearchableSnapshotAllocator.java

        // pre-check if it can be allocated to any node that currently exists, so we won't list the cache sizes for it for nothing
        // TODO: in the following logic, we do not account for existing cache size when handling disk space checks, should and can we
        // reliably do this in a world of concurrent cache evictions or are we ok with the cache size just being a best effort hint
        // here?
-        Tuple<Decision, Map<String, NodeAllocationResult>> result = canBeAllocatedToAtLeastOneNode(shardRouting, allocation);
+        Tuple<Decision, Map<String, NodeAllocationResult>> result = ReplicaShardAllocator.canBeAllocatedToAtLeastOneNode(


Can we add a test that we do not trigger any reads or reroutes when deciders say no?

Sure thing, I pushed 0cb0017 :)

…fetches

original-brownbear · 2020-12-16T13:47:58Z

Thanks Henning!

…lastic#66433) We shouldn't fetch cache status if no allocation is possible to begin with. Also, this surfaced an issue with using the `Client` to `reroute` since that won't retry stale shards (failed the invalid license IT for example) so I moved to using the `RerouteService` like we do in the `GatewayAllocator`. (Plus, dried up one method that was 100% the same as in the replica allocator)

…66433) (#66444) We shouldn't fetch cache status if no allocation is possible to begin with. Also, this surfaced an issue with using the `Client` to `reroute` since that won't retry stale shards (failed the invalid license IT for example) so I moved to using the `RerouteService` like we do in the `GatewayAllocator`. (Plus, dried up one method that was 100% the same as in the replica allocator)

elasticmachine added Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. labels Dec 16, 2020

original-brownbear commented Dec 16, 2020

View reviewed changes

original-brownbear requested review from henningandersen and tlrx December 16, 2020 10:23

henningandersen approved these changes Dec 16, 2020

View reviewed changes

original-brownbear added 2 commits December 16, 2020 13:36

Merge remote-tracking branch 'elastic/master' into avoid-unnecessary-…

75819e1

…fetches

add test

0cb0017

original-brownbear merged commit 7c65a2b into elastic:master Dec 16, 2020

original-brownbear deleted the avoid-unnecessary-fetches branch December 16, 2020 13:48

original-brownbear mentioned this pull request Dec 16, 2020

Avoid Needless Cache Status Fetches in SearchableSnapshotAllocator (#66433) #66444

Merged

original-brownbear restored the avoid-unnecessary-fetches branch January 4, 2021 01:10

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid Needless Cache Status Fetches in SearchableSnapshotAllocator #66433

Avoid Needless Cache Status Fetches in SearchableSnapshotAllocator #66433

original-brownbear commented Dec 16, 2020

elasticmachine commented Dec 16, 2020

elasticmachine commented Dec 16, 2020

original-brownbear Dec 16, 2020

henningandersen left a comment

henningandersen Dec 16, 2020

original-brownbear Dec 16, 2020

original-brownbear commented Dec 16, 2020

Avoid Needless Cache Status Fetches in SearchableSnapshotAllocator #66433

Avoid Needless Cache Status Fetches in SearchableSnapshotAllocator #66433

Conversation

original-brownbear commented Dec 16, 2020

elasticmachine commented Dec 16, 2020

elasticmachine commented Dec 16, 2020

original-brownbear Dec 16, 2020

Choose a reason for hiding this comment

henningandersen left a comment

Choose a reason for hiding this comment

henningandersen Dec 16, 2020

Choose a reason for hiding this comment

original-brownbear Dec 16, 2020

Choose a reason for hiding this comment

original-brownbear commented Dec 16, 2020