Add circuit breaker for aggregation memory limit #67474

williamrandolph · 2021-01-13T19:56:25Z

Add a circuit breaker that limits the amount of memory an aggregation can use. For example, breaker.search.aggregation_memory.limit.

This should work better than max_buckets, as this would be more dynamic.

This issue was first raised as part of #62457 .

The text was updated successfully, but these errors were encountered:

elasticmachine · 2021-01-13T19:56:28Z

Pinging @elastic/es-analytics-geo (Team:Analytics)

not-napoleon · 2021-01-13T20:33:09Z

(CC @lanerjo since you opened the original issue this is based on)

I believe we already have a circuit breaker for aggregation memory usage, as of 7.7: #46751. Does that meet your needs, or if not can you clarify what you're looking for specifically?

maosuhan · 2021-02-26T03:44:50Z

@not-napoleon Request circuit breaker trips is more like a final check for protection and is slow to react. What's more, it will reject all the requests no matter it is small request or big one after breaker trips .

In most cases, user need to get rapid response from ES. If some request is too large, user still want to get rejection message quickly. If breaker.search.aggregation_memory.limit apply to per-request context, a large request will be rejected quickly and will not impact other small requests.

@williamrandolph is this a cluster-level setting which apply to each request context like search.max_buckets

not-napoleon · 2021-03-01T17:34:07Z

I see. Essentially, you're asking for a way to cap a large aggregation's memory usage without exhausting the available cluster resources. So for example, you might want to say that one aggregation can never use more than 10% of memory, or something like that (just making up a number, that's not a recommendation). This would then kill large requests while allowing smaller requests to complete, which is not the case with the current request circuit breaker.

I'm not sure this will give a rejection message faster than the current approach though. In general, aggregation memory usage is highest during the reduce phase, which happens fairly late in the request life cycle. Adding a per request breaker doesn't really change that.

maosuhan · 2021-03-02T03:03:01Z

@not-napoleon Thanks for reading my comment and replying. I think you totally get my point.
Aggregation memory can not only apply to reduce phase in coordinator but also can apply to aggregation phase in data node.

Datanode breaker trip

In our production, we have 30GB heap ES nodes and we run a high cardinality query, if we rely on circuit breaker, it takes 90 seconds before parent circuit breaker trips(real memory exceeds 29.4GB). But if we rely on search.max_buckets, it only takes several seconds to raise the exception.
You can see response time is proportional to the search.max_buckets

search.max_buckets	response time
10000	511ms
100000	1173ms
1000000	9569ms
2000000	23000ms

Coordinator breaker trip

As you said, aggregation memory usage is highest during the reduce phase, which happens fairly late in the request life cycle. Adding a per request breaker doesn't really change that. According to our test, the most memory consumption part is the shard response from all shards especially batched_reduce_size is not tuned. And #67478 is already trying to fix this issue but it is not a per-request breaker.

And if we can only break the big query and let small queries run smoothly ,it will be a benefit too.

not-napoleon · 2021-03-15T21:13:43Z

Thanks for taking the time to reply. We discussed this as a team, and currently we do not want to head in this direction. It's not at all clear that killing large queries in favor of smaller queries should be a good general policy. The existing aggregation circuit breaker covers the majority of cases in our opinion.

If we were to take up something like this in the future, I think it would have to come from a different direction. Elasticsearch doesn't currently support (or have a plan to support) per-user usage limiting, but I could see that as a more reasonable direction to take something like this.

Overall, we want to support the ability to run very large queries as needed, and don't think killing a query just because it is large, when we otherwise might be able to serve it, is a good policy. The assumption that small queries are more important than large ones doesn't play out in all cases.

williamrandolph added >enhancement :Analytics/Aggregations Aggregations labels Jan 13, 2021

elasticmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Jan 13, 2021

$@polyfractal$ polyfractal mentioned this issue Jan 13, 2021

Add circuit breaker for memory used by a request's calculations #67476

Closed

This was referenced Feb 22, 2021

Add partial reduce nodes for reducing intermediate aggregation results #56748

Open

Allow administrators to limit the depth of nested aggregations #67479

Open

not-napoleon added the team-discuss label Mar 2, 2021

not-napoleon closed this as completed Mar 15, 2021

not-napoleon removed the team-discuss label Mar 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add circuit breaker for aggregation memory limit #67474

Add circuit breaker for aggregation memory limit #67474

williamrandolph commented Jan 13, 2021

elasticmachine commented Jan 13, 2021

not-napoleon commented Jan 13, 2021

maosuhan commented Feb 26, 2021

not-napoleon commented Mar 1, 2021

maosuhan commented Mar 2, 2021

not-napoleon commented Mar 15, 2021

Add circuit breaker for aggregation memory limit #67474

Add circuit breaker for aggregation memory limit #67474

Comments

williamrandolph commented Jan 13, 2021

elasticmachine commented Jan 13, 2021

not-napoleon commented Jan 13, 2021

maosuhan commented Feb 26, 2021

not-napoleon commented Mar 1, 2021

maosuhan commented Mar 2, 2021

Datanode breaker trip

Coordinator breaker trip

not-napoleon commented Mar 15, 2021