[Date Histogram] Apply the optimization to Composite aggregation with source as DateHistogram #11301

bowenlan-amzn · 2023-11-22T07:25:13Z

This issue is to track the effort on investigating how to apply the fast filter optimization proposed in #9310 to composite histogram (comp-agg).

The targeted scenario is when only one source which is a date histogram
Note that composite agg support pagination and the default size is 10 which can be customized by the size param and continue next page with after key.

How composite aggregation works?

Optimization using the leading source (first aggregation in comp-agg)
Based on the sorted data structure of the leading source including Point and TermsEnum, we can early terminate after enough buckets processed.
Optimization using the index sorting
If the sources of comp-agg match index sorting and an afterKey provided, we can use SearchAfterSortedDocQuery to produce a iterator that including the documents where we should continue the aggregation
Normal case
We will try to collect every sources' values per every document, add into the composite aggregation queue if it's competitive. Without existing sorted things, we cannot do much optimization rather than collect one by one and try push into a priority queue/heap.
Deferring collection for sub aggregation

The text was updated successfully, but these errors were encountered:

jainankitk added this to Performance Roadmap Nov 3, 2023

bowenlan-amzn self-assigned this Nov 22, 2023

bowenlan-amzn converted this from a draft issue Nov 22, 2023

github-actions bot added the untriaged label Nov 22, 2023

bowenlan-amzn added Search:Performance Search:Aggregations and removed untriaged labels Nov 22, 2023

bowenlan-amzn mentioned this issue Dec 7, 2023

Apply the fast filter optimization to composite aggregation #11505

Merged

8 tasks

getsaurabh02 added the v2.12.0 Issues and PRs related to version 2.12.0 label Dec 13, 2023

msfroh closed this as completed in #11505 Jan 17, 2024

github-project-automation bot moved this from In Progress to Done in Performance Roadmap Jan 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Date Histogram] Apply the optimization to Composite aggregation with source as DateHistogram #11301

[Date Histogram] Apply the optimization to Composite aggregation with source as DateHistogram #11301

bowenlan-amzn commented Nov 22, 2023 •

edited

Loading

[Date Histogram] Apply the optimization to Composite aggregation with source as DateHistogram #11301

[Date Histogram] Apply the optimization to Composite aggregation with source as DateHistogram #11301

Comments

bowenlan-amzn commented Nov 22, 2023 • edited Loading

bowenlan-amzn commented Nov 22, 2023 •

edited

Loading