Add support for merging multiple search responses into one #37566

javanna · 2019-01-17T11:59:30Z

This will be used in cross-cluster search when reduction will be
performed locally on each cluster. The CCS coordinating node will send
one search request per remote cluster involved and will get one search
response back from each one of them. Such responses contain all the info
to be able to perform an additional reduction and return results back
to the user.

Relates to #32125

This will be used in cross-cluster search when reduction will be performed locally on each cluster. The CCS coordinating node will send one search request per remote cluster involved and will get one search response back from each one of them. Such responses contain all the info to be able to perform an additional reduction and return results back to the user. Relates to elastic#32125

elasticmachine · 2019-01-17T11:59:32Z

Pinging @elastic/es-search

also add TODOs about possible improvements

jimczi

I left some comments regarding the TODOs in the code, LGTM otherwise
Can you add a comment regarding the limitations (the reduce does not handle inner_hits with collapsing, ...) ?

jimczi · 2019-01-17T19:26:41Z

server/src/main/java/org/elasticsearch/action/search/SearchResponseMerger.java

+ * and pipeline aggregations have not yet been executed. Also, from+size search hits need to be requested to each cluster.
+ */
+//TODO it may make sense to investigate reusing existing merge code in SearchPhaseController#reducedQueryPhase, the logic is similar
+//yet there are substantial differences in terms of the objects exchanged and logic in the sortDocs method.


It's more than just reusing the reducedQueryPhase, as we discussed earlier we could integrate the remote cluster response as a shard response in the initial search phase and ignore hits coming from the remote cluster in the fetch phase. This would be identical to the removed QueryAndFetch strategy except that only the remote cluster response would have the fetch results. This is really a nice to have so no need to follow up on this but it would be nice if the TODO mentions this.

jimczi · 2019-01-17T19:28:45Z

server/src/main/java/org/elasticsearch/action/search/SearchResponseMerger.java

+    /**
+     * Add a search response to the list of responses to be merged together into one.
+     * Merges currently happen at once when all responses are available and {@link #getMergedResponse()} is called. That may change
+     * in the future as it's possible to introduce incremental merges as responses come in if necessary.


I don't think incremental merges is appealing here. The number of remote clusters should be low so there is no benefit to do the reduce incrementally.

I think we may come back to this and have a laugh one day about the "number of remote clusters should be low" assumption :) but I agree we are not talking about hundreds at the moment. I thought it may be useful to incrementally reduce given the size of the responses from each cluster, but we should first measure what the benefit is if any.

javanna · 2019-01-17T19:45:40Z

server/src/test/java/org/elasticsearch/action/search/SearchResponseMergerTests.java

+
+            TotalHits totalHits = null;
+            if (totalHitsRelation != null) {
+                //TODO totalHits may overflow if each cluster reports a very high number?


@jimczi do you have thoughts on this one? is it paranoia on my end?

totalHits.value is a long so I doubt that it will overflow unless you have millions of remote cluster ;). However this made me think that this pr doesn't handle track_total_hits when it is set as a number. In the normal execution we'll merge all the topdocs and if the resulting total hits is greater than track_total_hits we set the final value as the value in track_total_hits and the final relation to gte. You can check the logic here

elasticsearch/server/src/main/java/org/elasticsearch/action/search/SearchPhaseController.java

Line 759 in 95479f1

TotalHits getTotalHits() {

.
We'll also need to change the code when #37466 is merged since the default for track_total_hits is going to change.

Good point, I addressed that in a325129 . I think that since I reuse TopDocsStats, this will not need to change once the default trackTotalHitsUpTo changes.

I also went a step further in 8f1b063 and expanded TopDocsStats. Just an experiment but it saves some duplicated logic, let me know what you think.

thanks, the change looks good. Feel free to push when CI is green

Adapt TopDocsStats so it can be reused.

This allows to share more code between SearchResponseMerger and SearchPhaseController

… clarify comment

… known once all responses have been obtained from the remote clusters

javanna · 2019-01-18T20:43:50Z

retest this please

javanna · 2019-01-21T08:57:30Z

retest this please

javanna · 2019-01-21T10:43:52Z

I am going to merge this PR despite the failed builds. There are problems in the elasticsearch CI infra that are causing a lot of build failures. I have run this PR multiple times on my own CI and everything was green which gives me confidence that I can merge it.

With #37566 we have introduced the ability to merge multiple search responses into one. That makes it possible to expose a new way of executing cross-cluster search requests, that makes CCS much faster whenever there is network latency between the CCS coordinating node and the remote clusters. The coordinating node can now send a single search request to each remote cluster, which gets reduced by each one of them. from + size results are requested to each cluster, and the reduce phase in each cluster is non final (meaning that buckets are not pruned and pipeline aggs are not executed). The CCS coordinating node performs an additional, final reduction, which produces one search response out of the multiple responses received from the different clusters. This new execution path will be activated by default for any CCS request unless a scroll is provided or inner hits are requested as part of field collapsing. The search API accepts now a new parameter called ccs_minimize_roundtrips that allows to opt-out of the default behaviour. Relates to #32125

javanna added >enhancement WIP :Search/Search Search-related issues that do not fall into other categories v7.0.0 v6.7.0 labels Jan 17, 2019

javanna requested a review from jimczi January 17, 2019 11:59

javanna mentioned this pull request Jan 17, 2019

Cross-cluster search alternate execution mode #32125

Closed

11 tasks

javanna added 4 commits January 17, 2019 16:12

Move from ArrayList to CopyOnWriteArrayList and test concurrency

8cc93ed

also add TODOs about possible improvements

Merge branch 'master' into enhancement/search_response_merge

ce08398

remove some empty lines

412c569

replace TreeMap<ShardId, List<DocField>> with TreeMap<ShardId, Integer>

205f0aa

jimczi approved these changes Jan 17, 2019

View reviewed changes

javanna commented Jan 17, 2019

View reviewed changes

javanna added 5 commits January 18, 2019 10:25

Handle trackTotalHitsUpTo and disabling local hits tracking

a325129

Adapt TopDocsStats so it can be reused.

Remove TopDocsStats constructor used only in tests

3537afc

expand TopDocsStats to include timedOut and earlyTerminated

8f1b063

This allows to share more code between SearchResponseMerger and SearchPhaseController

improve TODO comment and make field private final

f03feae

remove unused method, factor setShardIndex as a new static method and…

a252745

… clarify comment

javanna removed the WIP label Jan 18, 2019

javanna added 4 commits January 18, 2019 14:59

expand test to also include some simple aggs reduction

b0d9f00

add basic test for suggestions

aecd028

provide clusters when merging the obtained responses, it will only be…

a8514dc

… known once all responses have been obtained from the remote clusters

expand javadocs and add limitations

16cba8a

javanna removed the v6.7.0 label Jan 21, 2019

javanna merged commit 09a6ba5 into elastic:master Jan 21, 2019

javanna mentioned this pull request Jan 24, 2019

Introduce ability to minimize round-trips in CCS #37828

Merged

colings86 added v7.0.0-beta1 and removed v7.0.0 labels Feb 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for merging multiple search responses into one #37566

Add support for merging multiple search responses into one #37566

javanna commented Jan 17, 2019

elasticmachine commented Jan 17, 2019

jimczi left a comment

jimczi Jan 17, 2019

jimczi Jan 17, 2019

javanna Jan 17, 2019

javanna Jan 17, 2019

jimczi Jan 17, 2019

javanna Jan 18, 2019 •

edited

Loading

jimczi Jan 18, 2019

javanna commented Jan 18, 2019

javanna commented Jan 21, 2019

javanna commented Jan 21, 2019

Add support for merging multiple search responses into one #37566

Add support for merging multiple search responses into one #37566

Conversation

javanna commented Jan 17, 2019

elasticmachine commented Jan 17, 2019

jimczi left a comment

Choose a reason for hiding this comment

jimczi Jan 17, 2019

Choose a reason for hiding this comment

jimczi Jan 17, 2019

Choose a reason for hiding this comment

javanna Jan 17, 2019

Choose a reason for hiding this comment

javanna Jan 17, 2019

Choose a reason for hiding this comment

jimczi Jan 17, 2019

Choose a reason for hiding this comment

javanna Jan 18, 2019 • edited Loading

Choose a reason for hiding this comment

jimczi Jan 18, 2019

Choose a reason for hiding this comment

javanna commented Jan 18, 2019

javanna commented Jan 21, 2019

javanna commented Jan 21, 2019

javanna Jan 18, 2019 •

edited

Loading