Introduce executor for concurrent search #98204

javanna · 2023-08-04T14:05:39Z

Enable concurrent search across segments for knn queries

Elasticsearch has historically performed search sequentially across the segments. Lucene supports parallelizing search across segments when collecting hits (via collector managers) as well as when rewriting certain queries (e.g. knn queries).

This commit also enables concurrent search execution in the DFS phase, which is going to improve resource usage as well as performance of knn queries which benefit from both concurrent rewrite and collection.

Enable offloading of sequential collection to search worker thread pool

We will enable concurrent execution for the query phase in a subsequent commit. While this commit does not introduce parallelism for the query phase, it introduces offloading sequential computation to the newly introduced executor. This is true both for situations where a single slice needs to be searched, as well as scenarios where a specific request does not support concurrency (currently only DFS phase does regardless of the request). Sequential collection is not offloaded only if the request includes aggregations that don't support offloading: composite, nested and cardinality as their post collection method must be executed in the same thread as the collection or we'll trip a lucene assertion that verifies that doc_values are pulled and consumed from the same thread.

Technical details

Elasticsearch is now ready to support concurrency within a single shard. Search is already performed using collector managers, and the last missing piece is providing an executor to the index searcher so that it can offload the concurrent computation to it.

This commit introduces a secondary executor, used exclusively to execute the concurrent bits of search. The search threads are still the ones that coordinate the search (where the caller search will originate from), but the actual work will be offloaded to the newly introduced executor.

We are offloading not only parallel execution but also sequential execution, to make the workload more predictable, as it would be surprising to have bits of search executed in either of the two thread pools. Also, that would introduce the possibility to suddenly run a higher amount of heavy operations overall (some in the caller thread and some in the separate threads), which could overload the system as well as make sizing of thread pools more difficult.

Note that fetch, together with other actions, is still executed in the search thread pool. This commit does not make the search thread pool merely a coordinating only thread pool, It does so only for what concerns the IndexSearcher#search operation itself, which is though a big portion of the different phases of search API execution.

Given that the searcher blocks waiting for all tasks to be completed, we take a simple approach of introducing a thread pool executor that has the same size as the existing search thread pool but relies on an unbounded queue. This simplifies handling of thread pool queue and rejections. In fact, we'd like to guarantee that the secondary thread pool won't reject, and delegate queuing entirely to the search thread pool which is the entry point for every search operation anyway. The principle behind this is that if you got a slot in the search thread pool, you should be able to complete your search, and rather quickly.

As part of this commit we are also introducing the ability to cancel tasks that have not started yet, so that if any task throws an exception, other tasks are prevented from starting needless computation.

Relates to #80693
Relates to #90700

Elasticsearch has historically performed search sequentially across the segments. Lucene supports parallelizing search across segments when collecting hits (via collector managers) as well as when rewriting certain queries (e.g. knn queries). Elasticsearch is now ready to support concurrency within a single shard too. Search is already performed using collector managers, and the last piece that is missing is providing an executor to the index searcher so that it can offload the concurrent computation to it. This commit introduces a secondary executor, used exclusively to execute the concurrent bits of search. The search threads are still the ones that coordinate the search (where the caller search will originate from), but the actual work will be offloaded to the newly introduced executor. We are offloading not only parallel execution but also sequential execution, to make the workload more predictable, as it would be surprising to have bits of search executed in either of the two thread pools. Also, that would introduce the possibility to suddenly run a higher amount of heavy operations overall (some in the caller thread and some in the separate threads), which could overload the system as well as make sizing of thread pools more difficult. Note that fetch, together with other actions, is still executed in the search thread pool. This commit does not make the search thread pool merely a coordinating only thread pool, It does so only for what concerns the IndexSearcher#search operation itself, which is though a big portion of the different phases of search API execution. Given that the searcher blocks waiting for all tasks to be completed, we take a simple approach of introducing a bounded executor, which blocks whenever there are no threads to directly execute tasks. This simplifies handling of thread pool queue and rejections. In fact, we can guarantee that the secondary thread pool won't reject, and delegate queueing entirely to the search thread pool which is the entry point for every search operation anyway. The principle behind this is that if you got a slot in the search thread pool, you should be able to complete your search. As part of this commit we are also introducing the ability to cancel tasks that have not started yet, so that if any task throws an exception, other tasks are prevented from starting needless computation. This commit also enables concurrenct search execution in the DFS phase, which is going to improve resource usage as well as performance of knn queries which benefit from both concurrent rewrite and collection. We will enable concurrent execution for the query phase in a subsequent commit. Relates to elastic#80693 Relates to elastic#90700

elasticsearchmachine · 2023-08-04T14:06:03Z

Pinging @elastic/es-search (Team:Search)

elasticsearchmachine · 2023-08-04T14:06:03Z

Hi @javanna, I've created a changelog YAML for you.

benwtrent

I am by no means an expert here and my review doesn't count for much, I had some minor things

benwtrent · 2023-08-04T15:21:47Z

server/src/main/java/org/elasticsearch/common/util/concurrent/BoundedExecutor.java

+        try {
+            semaphore.acquire();
+        } catch (InterruptedException e) {
+            throw new ThreadInterruptedException(e);


If the thread is interrupted, shouldn't we bubble it up via Thread.currentThread().interrupt();?

server/src/main/java/org/elasticsearch/search/internal/ContextIndexSearcher.java

server/src/test/java/org/elasticsearch/search/internal/ContextIndexSearcherTests.java

server/src/main/java/org/elasticsearch/search/SearchService.java

henningandersen

Started on this and left a few initial comments.

server/src/main/java/org/elasticsearch/common/util/concurrent/BoundedExecutor.java

server/src/main/java/org/elasticsearch/threadpool/ThreadPool.java

server/src/main/java/org/elasticsearch/search/internal/ContextIndexSearcher.java

Co-authored-by: Henning Andersen <[email protected]>

javanna · 2023-08-07T10:00:05Z

server/src/main/java/org/elasticsearch/common/settings/ClusterSettings.java

@@ -502,7 +502,7 @@ public void apply(Settings value, Settings current, Settings previous) {
        ResourceWatcherService.RELOAD_INTERVAL_LOW,
        SearchModule.INDICES_MAX_CLAUSE_COUNT_SETTING,
        SearchModule.INDICES_MAX_NESTED_DEPTH_SETTING,
-        SearchModule.SEARCH_CONCURRENCY_ENABLED,
+        SearchService.SEARCH_WORKER_THREADS_ENABLED,


I have renamed the escape hatch to disable concurrency to align it with the name of the new thread pool, as it effectively affects whether the new thread pool is enabled or not.

server/src/main/java/org/elasticsearch/common/util/concurrent/EsExecutors.java

javanna · 2023-08-07T10:07:49Z

server/src/main/java/org/elasticsearch/search/SearchModule.java

-        Setting.Property.NodeScope,
-        Setting.Property.Dynamic
-    );
-


I moved this to SearchService together with the other existing search settings (including minimum docs per slice which affects search concurrency)

javanna · 2023-08-07T10:12:23Z

@henningandersen @DaveCTurner I pushed an update to replace the bounded executor with a plain thread pool executor that relies on a synchronous queue. I added some questions for you two that I could use your help with.

I am still working on the additional ContextIndexSearcherTests, that I just realized are not using the synchronous queue. Need to update those, work in progress.

DaveCTurner

I don't think a SynchronousQueue has the right semantics here. AIUI it means that each .execute() call will wait for the task to start executing on a worker thread, but I think we want to block the thread calling .execute() until the task it submits is complete. Am I missing something there?

henningandersen · 2023-08-07T10:51:57Z

I don't think a SynchronousQueue has the right semantics here. AIUI it means that each .execute() call will wait for the task to start executing on a worker thread, but I think we want to block the thread calling .execute() until the task it submits is complete. Am I missing something there?

Blocking waiting for completion should be done by calling Future.get() on the result of execute. What SynchronousQueue helps with is to avoid any queuing and block execute calls until a free thread is available in the search worker pool.

DaveCTurner · 2023-08-07T10:58:31Z

.execute() doesn't return anything, but yes to the more general point that we'll need to wrap each task with some kind of Future on which we wait after calling execute().

What SynchronousQueue helps with is to avoid any queuing and block execute calls until a free thread is available in the search worker pool.

Sure, but can you help me understand why we need that? Is it just performance or are there semantic differences I'm missing? If it's performance, how much time does it save in practice? I'd prefer not to introduce yet another kind of queue (and another source of potential blocking to trip up the unwary) without a strong justification.

javanna · 2023-08-07T13:14:25Z

we'll need to wrap each task with some kind of Future on which we wait after calling execute().

This is already the case, it's how the FutureTask are created in ContextIndexSearcher, and Future#get is called on them.

We have introduced a search worker thread pool with elastic#98204 that is responsible for the heavy workloads as part of the query and dfs phase, no matter if it is parallelized across segments/slices or not. TSDB aggregations are still executed in the search thread pool and this commit moves their computation to the search worker thread pool, despite the corresponding search thread blocks and waits for such computation to be completed before returning.

We have introduced a search worker thread pool with #98204 that is responsible for the heavy workloads as part of the query and dfs phase, no matter if it is parallelized across segments/slices or not. TSDB aggregations are still executed in the search thread pool and this commit moves their computation to the search worker thread pool, despite the corresponding search thread blocks and waits for such computation to be completed before returning.

This commit enables concurrent search execution in the DFS phase, which is going to improve resource usage as well as performance of knn queries which benefit from both concurrent rewrite and collection. We will enable concurrent execution for the query phase in a subsequent commit. While this commit does not introduce parallelism for the query phase, it introduces offloading sequential computation to the newly introduced executor. This is true both for situations where a single slice needs to be searched, as well as scenarios where a specific request does not support concurrency (currently only DFS phase does regardless of the request). Sequential collection is not offloaded only if the request includes aggregations that don't support offloading: composite, nested and cardinality as their post collection method must be executed in the same thread as the collection or we'll trip a lucene assertion that verifies that doc_values are pulled and consumed from the same thread. ## Technical details This commit introduces a secondary executor, used exclusively to execute the concurrent bits of search. The search threads are still the ones that coordinate the search (where the caller search will originate from), but the actual work will be offloaded to the newly introduced executor. We are offloading not only parallel execution but also sequential execution, to make the workload more predictable, as it would be surprising to have bits of search executed in either of the two thread pools. Also, that would introduce the possibility to suddenly run a higher amount of heavy operations overall (some in the caller thread and some in the separate threads), which could overload the system as well as make sizing of thread pools more difficult. Note that fetch, together with other actions, is still executed in the search thread pool. This commit does not make the search thread pool merely a coordinating only thread pool, It does so only for what concerns the IndexSearcher#search operation itself, which is though a big portion of the different phases of search API execution. Given that the searcher blocks waiting for all tasks to be completed, we take a simple approach of introducing a thread pool executor that has the same size as the existing search thread pool but relies on an unbounded queue. This simplifies handling of thread pool queue and rejections. In fact, we'd like to guarantee that the secondary thread pool won't reject, and delegate queuing entirely to the search thread pool which is the entry point for every search operation anyway. The principle behind this is that if you got a slot in the search thread pool, you should be able to complete your search, and rather quickly. As part of this commit we are also introducing the ability to cancel tasks that have not started yet, so that if any task throws an exception, other tasks are prevented from starting needless computation. Relates to elastic#80693 Relates to elastic#90700

…98414) We have introduced a search worker thread pool with elastic#98204 that is responsible for the heavy workloads as part of the query and dfs phase, no matter if it is parallelized across segments/slices or not. TSDB aggregations are still executed in the search thread pool and this commit moves their computation to the search worker thread pool, despite the corresponding search thread blocks and waits for such computation to be completed before returning.

elasticsearchmachine · 2023-08-21T18:51:57Z

@javanna according to this PR's labels, I need to update the changelog YAML, but I can't because the PR is closed. Please either update the changelog yourself on the appropriate branch, or adjust the labels. Specifically:

The PR is labelled release highlight but the changelog has no highlight section

Relates #98204

Relates elastic#98204

Relates #98204

With introduction of concurrent search across multiple segments elastic/elasticsearch#98204 there is a need to measure search across multiple segments before force merge. This PR adds this operation.

With introduction of concurrent search across multiple segments elastic/elasticsearch#98204 there is a need to measure search across multiple segments before force merge.

Relates to PR elastic#98204

Relates to PR #98204

With introduction of concurrent search across multiple segments elastic/elasticsearch#98204 there is a need to measure search across multiple segments before force merge.

javanna added >enhancement :Search/Search Search-related issues that do not fall into other categories v8.10.0 labels Aug 4, 2023

elasticsearchmachine added the Team:Search Meta label for search team label Aug 4, 2023

javanna and others added 6 commits August 4, 2023 16:06

Update docs/changelog/98204.yaml

01733dc

forbidden api

9229f4d

grrrrrrrrrr

f67551f

leftover

db6e906

forbidden api

3b86ab1

spotless

438d76d

benwtrent reviewed Aug 4, 2023

View reviewed changes

henningandersen reviewed Aug 4, 2023

View reviewed changes

javanna and others added 2 commits August 4, 2023 20:44

Update server/src/main/java/org/elasticsearch/threadpool/ThreadPool.java

ad260e0

Co-authored-by: Henning Andersen <[email protected]>

iter

5a8ec56

javanna requested a review from DaveCTurner August 4, 2023 19:13

iter

9069bfe

javanna commented Aug 7, 2023

View reviewed changes

server/src/main/java/org/elasticsearch/common/util/concurrent/EsExecutors.java Outdated Show resolved Hide resolved

javanna commented Aug 7, 2023

View reviewed changes

DaveCTurner reviewed Aug 7, 2023

View reviewed changes

javanna added 3 commits August 7, 2023 21:10

iter

026258c

Merge branch 'main' into enhancement/bounded_executor_concurrent_search

fed2aaf

iter

6688e92

javanna mentioned this pull request Aug 11, 2023

TimeSeriesIndexSearcher to offload to the provided executor #98414

Merged

This was referenced Aug 21, 2023

Reconsider which aggregations can support parallel execution #98669

Closed

Make AggregatorTestCase#searchAndReduce(...) more inline with what happens in Search API #98672

Closed

javanna added >feature and removed >enhancement labels Aug 21, 2023

javanna mentioned this pull request Aug 21, 2023

Release highlight for #98204 #98693

Merged

javanna added a commit to javanna/elasticsearch that referenced this pull request Aug 21, 2023

Release highlight for elastic#98204

933f867

martijnvg mentioned this pull request Aug 22, 2023

Move Aggregator#buildTopLevel() to search worker thread. #98705

Closed

quux00 pushed a commit that referenced this pull request Aug 22, 2023

Release highlight for #98204 (#98693)

b79c903

Relates #98204

javanna mentioned this pull request Aug 22, 2023

[8.10] Release highlight for #98204 (#98693) #98726

Merged

javanna added a commit to javanna/elasticsearch that referenced this pull request Aug 22, 2023

Release highlight for elastic#98204 (elastic#98693)

560ab1d

Relates elastic#98204

quux00 pushed a commit that referenced this pull request Aug 22, 2023

Release highlight for #98204 (#98693) (#98726)

77552f5

Relates #98204

iverase mentioned this pull request Aug 30, 2023

Concurrent thread access to shared doc values #99007

Merged

mayya-sharipova mentioned this pull request Sep 2, 2023

Add searches on multiple segments for dense vector elastic/rally-tracks#454

Merged

iverase mentioned this pull request Sep 4, 2023

Execute aggregations in its own thread in AggregatorTestCase #99149

Closed

mayya-sharipova mentioned this pull request Oct 11, 2023

Update kNN search guide with knn parallelization #100705

Merged

mayya-sharipova added a commit to mayya-sharipova/elasticsearch that referenced this pull request Oct 11, 2023

Update kNN search guide with knn parallelization

5df16da

Relates to PR elastic#98204

mayya-sharipova added a commit that referenced this pull request Oct 11, 2023

Update kNN search guide with knn parallelization (#100705)

b582276

Relates to PR #98204

mayya-sharipova added a commit that referenced this pull request Oct 11, 2023

Update kNN search guide with knn parallelization (#100705)

edf6bfd

Relates to PR #98204

mayya-sharipova added a commit that referenced this pull request Oct 11, 2023

Update kNN search guide with knn parallelization (#100705)

b4a62ac

Relates to PR #98204

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce executor for concurrent search #98204

Introduce executor for concurrent search #98204

javanna commented Aug 4, 2023 •

edited

Loading

elasticsearchmachine commented Aug 4, 2023

elasticsearchmachine commented Aug 4, 2023

benwtrent left a comment •

edited

Loading

benwtrent Aug 4, 2023

henningandersen left a comment

javanna Aug 7, 2023

javanna Aug 7, 2023

javanna commented Aug 7, 2023

DaveCTurner left a comment

henningandersen commented Aug 7, 2023

DaveCTurner commented Aug 7, 2023

javanna commented Aug 7, 2023

elasticsearchmachine commented Aug 21, 2023

Introduce executor for concurrent search #98204

Introduce executor for concurrent search #98204

Conversation

javanna commented Aug 4, 2023 • edited Loading

Enable concurrent search across segments for knn queries

Enable offloading of sequential collection to search worker thread pool

Technical details

elasticsearchmachine commented Aug 4, 2023

elasticsearchmachine commented Aug 4, 2023

benwtrent left a comment • edited Loading

Choose a reason for hiding this comment

benwtrent Aug 4, 2023

Choose a reason for hiding this comment

henningandersen left a comment

Choose a reason for hiding this comment

javanna Aug 7, 2023

Choose a reason for hiding this comment

javanna Aug 7, 2023

Choose a reason for hiding this comment

javanna commented Aug 7, 2023

DaveCTurner left a comment

Choose a reason for hiding this comment

henningandersen commented Aug 7, 2023

DaveCTurner commented Aug 7, 2023

javanna commented Aug 7, 2023

elasticsearchmachine commented Aug 21, 2023

javanna commented Aug 4, 2023 •

edited

Loading

benwtrent left a comment •

edited

Loading