retryOnTimeout / retryBackoff: #110

SSANSH · 2024-07-16T09:38:07Z

On latest version of elastic-transport library, 2 new features are introduce

"retryOnTimeout" = false
d2956c2
"retryBackoff" =
(min: number, max: number, attempt: number): number {
const ceiling = Math.min(max, 2 ** attempt) / 2
return ceiling + ((Math.random() * (ceiling - min)) + min)
}

https://github.com/elastic/elastic-transport-js/pull/101/files
This break how it should work on cluster.

🐛 Bug Report

if you have 3 elasticsearch node with maxretries = 4, if one query exit in timeout on node 1 , the connection pool will query the timeout query on node 2 and node 3 if need.
With this update, the query is run only on node 1.
Moreover this parameter cannot be updated on elastic client in order to keep how is running before.

Your Environment

node version: v20.15.0
@elastic/elasticsearch 8.14.0
os: Linux

The text was updated successfully, but these errors were encountered:

SSANSH · 2024-07-16T10:01:12Z

I found a way to disable by overload Transport class with a custom Class

class CustomTransport extends _elasticsearch.Transport {
 constructor(options) {
    // Modify the transport options or add custom settings
    const customOptions = {
      ...options,
      retryBackoff: () => 0,
      retryOnTimeout: true, 
    };
    
    super(customOptions);
  }
}

however I would like to understand if features added are really correct?

SSANSH · 2024-07-16T10:36:52Z

From my point of view we must have retrybackoff only when we are on same failure node and retry should always be apply

JoshMock · 2024-07-17T16:42:25Z

Creating a custom transport is indeed the appropriate way to override these features for now.

Are you saying that you would expect timeouts to be retried when there are nodes in the pool on which the request has not yet been tried?

SSANSH · 2024-07-18T06:57:36Z

yes exactly and in this case we dont need to apply backoff
you have 3 nodes, node 1 failed
we have to try node 2 and then node 3 without backoff

JoshMock · 2024-07-22T20:51:31Z

I'm not sure we should ever retry on timeout unless retryOnTimeout is true, just to make that option as explicit as possible. If you know your cluster has multiple nodes, and you've set maxRetries to >= the number of nodes, it seems reasonable to suggest that you explicitly set retryOnTimeout to true in that case.

But what I do see you saying is that retryBackoff creates an unnecessary delay when retries are enabled, when there are multiple nodes. To address this concern, it might make sense for us to update the logic to not start using retryBackoff until a retry has been attempted on every healthy node in the pool.

Does that solution make sense to you?

SSANSH · 2024-07-23T15:33:21Z

the change on retryOnTimeout default = false is a bc from my point of view, so this need to be define into the change log in minima.
for the retryBackoff , agree with the solution expose.

JoshMock · 2024-08-28T19:25:23Z

The retryBackoff issue discussed here is fixed in #135 and will be published to npm soon.

JoshMock self-assigned this Jul 24, 2024

JoshMock added the enhancement New feature or request label Jul 24, 2024

JoshMock mentioned this issue Aug 28, 2024

Don't use exponential backoff until each node has been tried #135

Merged

JoshMock closed this as completed in #135 Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

retryOnTimeout / retryBackoff: #110

retryOnTimeout / retryBackoff: #110

SSANSH commented Jul 16, 2024 •

edited

Loading

SSANSH commented Jul 16, 2024 •

edited

Loading

SSANSH commented Jul 16, 2024 •

edited

Loading

JoshMock commented Jul 17, 2024

SSANSH commented Jul 18, 2024

JoshMock commented Jul 22, 2024

SSANSH commented Jul 23, 2024

JoshMock commented Aug 28, 2024

retryOnTimeout / retryBackoff: #110

retryOnTimeout / retryBackoff: #110

Comments

SSANSH commented Jul 16, 2024 • edited Loading

🐛 Bug Report

Your Environment

SSANSH commented Jul 16, 2024 • edited Loading

SSANSH commented Jul 16, 2024 • edited Loading

JoshMock commented Jul 17, 2024

SSANSH commented Jul 18, 2024

JoshMock commented Jul 22, 2024

SSANSH commented Jul 23, 2024

JoshMock commented Aug 28, 2024

SSANSH commented Jul 16, 2024 •

edited

Loading

SSANSH commented Jul 16, 2024 •

edited

Loading

SSANSH commented Jul 16, 2024 •

edited

Loading