Error upper bound may be wrong when performing incremental reductions #40005

javanna · 2019-03-13T17:39:50Z

When reducing terms aggs results, we check if we already have a doc count error for a certain bucket by looking at its error see https://github.com/elastic/elasticsearch/blob/master/server/src/main/java/org/elasticsearch/search/aggregations/bucket/terms/InternalTerms.java#L245): if it's greater than zero we have already calculated it, while if we have zero it means we have not hence we ignore such value and use the doc count of the last returned bucket.

When performing incremental reductions though, 0 may mean that the error was not previously calculated, or that the error was indeed previously calculated and its value was 0. We end up rejecting true values set to 0 this way.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2019-03-13T17:39:51Z

Pinging @elastic/es-analytics-geo

javanna · 2019-03-13T17:40:13Z

ping @colings86 @polyfractal as we have chatted about this.

colings86 · 2019-03-14T10:20:50Z

IMO the best way to fix this will be to change to using null as the indication the error was not previously calculated and make the variable a Long. Note that -1 already has a meaning here in that it indicates that the error cannot be calculated because of the sort order used. I think it would be a bad idea to start encoding more negative numbers with meanings (e.g. -2 means not calculated) since it will make what is already a complex series of calculations for the error confusing and doesn't save us much here since a null check and a check for -2 would amount to basically the same thing.

javanna · 2019-03-14T10:33:07Z

sounds good @colings86 I can look into this.

javanna · 2019-03-26T10:35:26Z

Actually, I am not sure I will get to this anytime soon, I marked this issue "help wanted"

Hohol · 2019-06-27T15:47:13Z

I'd like to take this.

Hohol · 2019-06-27T16:05:34Z

As I can see, there is a binary serialization/deserialization of InternalTerms.Bucket values.
Can we assume serialization will always be performed when docCountError is non-null?
If not, we need to change binary format. What about cross-version compatibility?

Hohol · 2019-07-01T10:40:40Z

@javanna, @colings86, can you answer, please?
Is this issue still needed at all?

polyfractal · 2019-07-01T14:28:34Z

As I can see, there is a binary serialization/deserialization of InternalTerms.Bucket values.
Can we assume serialization will always be performed when docCountError is non-null?

Are you referring to how it is serialized today? Currently, docCountError is a primitive long, so it is not possible for the variable to be null.

If not, we need to change binary format. What about cross-version compatibility?

We will indeed need to change the serialization, and worry about cross-version compatibility (clusters can be heterogeneous, so a new version might need to serialize a response to an older version and vice versa).

This is done by checking the input/output stream versions and serializing appropriately. The pattern is:

if (in.getVersion().onOrAfter(Version.V_7_3_0)) {
  this.docCountError = in.readOptionalLong();
} else {
  this.docCountError = in.readLong();
}

Similar for the output stream. E.g. when talking to an older node, we use the old serialization method. And when talking to newer nodes, we can use the "optional" methods to instantiate a Long instead of long

Here's another example, you can find these scattered around the code if you look at the input/output streams:

elasticsearch/server/src/main/java/org/elasticsearch/action/admin/indices/close/CloseIndexResponse.java

Line 47 in 44ea7dc

if (in.getVersion().onOrAfter(Version.V_7_3_0)) {

Hohol · 2019-07-01T14:51:38Z

If I read from an old version stream, there is ambiguity in 0 value meaning.
Should I treat such value as "0 errors" or "not calculated"?

polyfractal · 2019-07-01T14:59:02Z

I think we'll want to treat it as "not calculated", since that's the less-bad way to resolve the ambiguity. E.g. if we interpret it as "0 error" we might assign a no-error rate to something that was just not calculated yet and actually has a large error, leading to very incorrect error reports.

(@javanna is on holiday right now, but he can confirm when he's back)

Hohol · 2019-07-02T15:16:35Z

I have two questions regarding this piece of code:

elasticsearch/server/src/main/java/org/elasticsearch/search/aggregations/bucket/terms/InternalTerms.java

Lines 242 to 243 in 44ea7dc

    
           if (terms.getBuckets().size() < getShardSize() || InternalOrder.isKeyOrder(order)) { 
        
               thisAggDocCountError = 0;

Wouldn't <= instead of < be better in the first check?
How key order guarantees that there are 0 errors?
I found this mentioned in the documentation:

When the aggregation is ordered by the terms values themselves (either ascending or descending) there is no error in the document count since if a shard does not return a particular term which appears in the results from another shard, it must not have that term in its index.

I don't understand, what happens in this case when terms.getBuckets().size() > getShardSize(). Wouldn't some terms be skipped in this case regardless of ordering?

polyfractal · 2019-07-02T16:56:43Z

Wouldn't <= instead of < be better in the first check?

I believe it is < because, if the returned size is smaller than what we requested, we know definitively that we have all the terms from that shard. But if it is the same as the requested size, we're not sure if that was all that was available or if it was truncated before being sent to the coordinator

How key order guarantees that there are 0 errors?

Ordering by the term itself ("_key" : "asc") is basically lexicographic sorting. So imagine we have three shards, asking for top-5 from each (and for simplicity each term only has count: 1 on the shard):

Shard A	Shard B	Shard C
A: 1	B: 1	D: 1
B: 1	C: 1	E: 1
C: 1	D: 1	F: 1
D: 1	E: 1	G: 1
E: 1	F: 1	H: 1

We are sorting alphabetically, so since Shard 2 starts with "B" we know it doesn't have an "A" term (otherwise it would have sent a count for "A"). Similarly, we know Shard 3 doesn't have "A", "B", or "C".

When merging, we take the "top" 5 alphabetically from the returned results, meaning A through E in this case. After merging doc counts we get:

Top 5
A: 1
B: 2
C: 2
D: 3
E: 3

We know the counts are exact because we're not relying on doc_counts for ordering, but the total lexicographic ordering. "A" is the "top" term because it was returned and sorts to the first position, regardless of the document count. If only one shard returns that value, or all the shards return it, doesn't matter because it will sort to the "top" regardless of count.

Hohol · 2019-07-02T17:28:03Z

Got it, thanks!

Hohol · 2019-07-03T16:30:39Z

PR is ready for review
#43874

…43874) When performing incremental reductions, 0 value of docCountError may mean that the error was not previously calculated, or that the error was indeed previously calculated and its value was 0. We end up rejecting true values set to 0 this way. This may lead to wrong upper bound of error in result. To fix it, this PR makes docCountError nullable. null values mean that error was not calculated yet. Fixes #40005 Co-authored-by: Igor Motov <[email protected]> Co-authored-by: Elastic Machine <[email protected]>

…lastic#43874) When performing incremental reductions, 0 value of docCountError may mean that the error was not previously calculated, or that the error was indeed previously calculated and its value was 0. We end up rejecting true values set to 0 this way. This may lead to wrong upper bound of error in result. To fix it, this PR makes docCountError nullable. null values mean that error was not calculated yet. Fixes elastic#40005 Co-authored-by: Igor Motov <[email protected]> Co-authored-by: Elastic Machine <[email protected]>

…ons (#43874) (#76475) When performing incremental reductions, 0 value of docCountError may mean that the error was not previously calculated, or that the error was indeed previously calculated and its value was 0. We end up rejecting true values set to 0 this way. This may lead to wrong upper bound of error in result. To fix it, this PR makes docCountError nullable. null values mean that error was not calculated yet. Fixes #40005, #75667 Co-authored-by: Nikita Glashenko <[email protected]>

cbuescher · 2022-04-21T12:56:36Z

I'm reopening this issue because when removing an "@AwaitsFix" in CCSDuelIT in #85538 that was still pointing to this issue, I ran into a reproducable test failure that seems to point at this issue still. Removing the "@AwaitsFix" and running

./gradlew ':qa:multi-cluster-search:v8.3.0#multi-cluster' -Dtests.class="org.elasticsearch.search.CCSDuelIT" -Dtests.method="testTermsAggsWithProfile" -Dtests.seed=FC39998A856BD0CF

on master show that the responses of running a search request with a terms aggregation on a CCS setup differs on whether it is run with or without minimizing rountrip, and I think this touches incremental reductions.

With the above reproduction line I get the following mismatch in the doc_count_error_upper_bound of some terms aggregation buckets:

 2> java.lang.AssertionError: Didn't match expected value:
                         _clusters:
                             skipped: same [0]
                          successful: same [2]
                               total: same [2]
                           _shards:
                              failed: same [0]
                          successful: same [4]
                               total: same [4]
                      aggregations:
                      filter#answers:
                             doc_count: same [221]
            sterms#answer_per_question:
                                 buckets:
                                         0:
                                   doc_count: same [6]
                 doc_count_error_upper_bound: expected Integer [3] but was Integer [1]
                                         key: same [multi_cluster-27]
                                         1:
                                   doc_count: same [5]
                 doc_count_error_upper_bound: same [2]
                                         key: same [remote_cluster-47]
                                         2:
                                   doc_count: same [4]
                 doc_count_error_upper_bound: expected Integer [4] but was Integer [2]
                                         key: same [multi_cluster-0]
                                         3:
                                   doc_count: same [4]
                 doc_count_error_upper_bound: expected Integer [4] but was Integer [2]
                                         key: same [multi_cluster-11]
                                         4:
                                   doc_count: same [4]
                 doc_count_error_upper_bound: expected Integer [3] but was Integer [1]
                                         key: same [multi_cluster-64]
                                         5:
                                   doc_count: same [4]
                 doc_count_error_upper_bound: expected Integer [3] but was Integer [1]
                                         key: same [multi_cluster-86]
                                         6:
                                   doc_count: same [4]
                 doc_count_error_upper_bound: same [2]
                                         key: same [remote_cluster-17]
                                         7:
                                   doc_count: same [4]
                 doc_count_error_upper_bound: same [2]
                                         key: same [remote_cluster-29]
                                         8:
                                   doc_count: same [4]
                 doc_count_error_upper_bound: same [2]
                                         key: same [remote_cluster-45]
                                         9:
                                   doc_count: same [4]
                 doc_count_error_upper_bound: same [2]
                                         key: same [remote_cluster-46]
             doc_count_error_upper_bound: expected Integer [4] but was Integer [2]
                     sum_other_doc_count: same [178]

javanna added >bug :Analytics/Aggregations Aggregations labels Mar 13, 2019

javanna added the help wanted adoptme label Mar 26, 2019

Hohol mentioned this issue Jul 2, 2019

Fix wrong error upper bound when performing incremental reductions #43874

Merged

rjernst added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label May 4, 2020

imotov closed this as completed in #43874 Jul 22, 2021

imotov mentioned this issue Aug 12, 2021

[7.x] Fix wrong error upper bound when performing incremental reductions (#43874) #76475

Merged

cbuescher mentioned this issue Mar 31, 2022

[Tests] Removing AwaitsFix for closed issue in CCSDuelIT #85538

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error upper bound may be wrong when performing incremental reductions #40005

Error upper bound may be wrong when performing incremental reductions #40005

javanna commented Mar 13, 2019 •

edited

Loading

elasticmachine commented Mar 13, 2019

javanna commented Mar 13, 2019

colings86 commented Mar 14, 2019

javanna commented Mar 14, 2019

javanna commented Mar 26, 2019

Hohol commented Jun 27, 2019

Hohol commented Jun 27, 2019 •

edited

Loading

Hohol commented Jul 1, 2019

polyfractal commented Jul 1, 2019

Hohol commented Jul 1, 2019 •

edited

Loading

polyfractal commented Jul 1, 2019

Hohol commented Jul 2, 2019

polyfractal commented Jul 2, 2019 •

edited

Loading

Hohol commented Jul 2, 2019

Hohol commented Jul 3, 2019

cbuescher commented Apr 21, 2022

Error upper bound may be wrong when performing incremental reductions #40005

Error upper bound may be wrong when performing incremental reductions #40005

Comments

javanna commented Mar 13, 2019 • edited Loading

elasticmachine commented Mar 13, 2019

javanna commented Mar 13, 2019

colings86 commented Mar 14, 2019

javanna commented Mar 14, 2019

javanna commented Mar 26, 2019

Hohol commented Jun 27, 2019

Hohol commented Jun 27, 2019 • edited Loading

Hohol commented Jul 1, 2019

polyfractal commented Jul 1, 2019

Hohol commented Jul 1, 2019 • edited Loading

polyfractal commented Jul 1, 2019

Hohol commented Jul 2, 2019

polyfractal commented Jul 2, 2019 • edited Loading

Hohol commented Jul 2, 2019

Hohol commented Jul 3, 2019

cbuescher commented Apr 21, 2022

javanna commented Mar 13, 2019 •

edited

Loading

Hohol commented Jun 27, 2019 •

edited

Loading

Hohol commented Jul 1, 2019 •

edited

Loading

polyfractal commented Jul 2, 2019 •

edited

Loading