Infinite ingestion retry when batches are too large and using GuaranteedSend #14350

simitt · 2019-10-31T09:24:00Z

Elasticsearch returns status code 413 when a bulk request exceeds the size limit. A user can either increase the http.max_content_length in ES or decrease the bulk_max_size in the Beat to overcome such failures.
However, when this error happens and the beat is using a GuaranteedSend publisher method the current implementation can lead to an infinite retry, sending the same request to ES.
This might result in not being able to ingest any more events.

It might be worth exploring to use a special handling for the batch when the request size exceeds a limit, e.g. split it in half.

The text was updated successfully, but these errors were encountered:

ph · 2019-10-31T12:47:22Z

Related issue #3688

ph · 2019-10-31T12:48:30Z

Prior experience with that from LS logstash-plugins/logstash-output-elasticsearch#497

ph · 2019-10-31T13:27:24Z

Linked to #6749

faec · 2021-10-06T16:01:53Z

This issue probably still exists, but seems rare, is fixable with proper configuration, and was never allocated time in a release cycle -- unassigning so it can be re-triaged.

elasticmachine · 2021-10-06T16:02:12Z

Pinging @elastic/elastic-agent-data-plane (Team:Elastic-Agent-Data-Plane)

jlind23 · 2021-12-17T12:11:53Z

Ping @mukeshelastic @nimarezainia as you were both interested by this issue. It will be fixed to 8.1 thanks To @rdner

mukeshelastic · 2021-12-20T17:14:16Z

Thanks @jlind23

dikshachauhan-qasource · 2022-02-15T11:57:48Z

Hi @simitt

Could you please help us on this Ticket validation with below points:

How can we create bulk requests for elasticsearch?
Can it be covered under manual testing.

Thanks
QAS

simitt · 2022-02-15T13:30:29Z

@rdner given that you implemented the fix, can you please provide guidance for the testers.
It's been a long time since I created the issue, and don't believe this is reproducible anymore with the latest apm-server version, as it is not using libbeat output to ES anymore.

rdner · 2022-02-15T14:14:17Z

@dikshachauhan-qasource I described the testing process in my PR #29368

Let me know if it's missing something.

ph added the libbeat label Oct 31, 2019

urso assigned faec Nov 1, 2019

simitt added the bug label Nov 19, 2019

faec added the Team:Elastic-Agent-Data-Plane Label for the Agent Data Plane team label Oct 6, 2021

faec removed their assignment Oct 6, 2021

jlind23 added 8.1-candidate good first issue Indicates a good issue for first-time contributors labels Nov 30, 2021

jlind23 assigned rdner and faec Nov 30, 2021

jlind23 added v8.1.0 and removed 8.1-candidate labels Dec 3, 2021

rdner mentioned this issue Dec 9, 2021

Drop event batch when get HTTP status 413 from ES #29368

Merged

4 tasks

rdner closed this as completed in #29368 Dec 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Infinite ingestion retry when batches are too large and using GuaranteedSend #14350

Infinite ingestion retry when batches are too large and using GuaranteedSend #14350

simitt commented Oct 31, 2019 •

edited

Loading

ph commented Oct 31, 2019

ph commented Oct 31, 2019

ph commented Oct 31, 2019

faec commented Oct 6, 2021

elasticmachine commented Oct 6, 2021

jlind23 commented Dec 17, 2021 •

edited

Loading

mukeshelastic commented Dec 20, 2021

dikshachauhan-qasource commented Feb 15, 2022

simitt commented Feb 15, 2022

rdner commented Feb 15, 2022

Infinite ingestion retry when batches are too large and using GuaranteedSend #14350

Infinite ingestion retry when batches are too large and using GuaranteedSend #14350

Comments

simitt commented Oct 31, 2019 • edited Loading

ph commented Oct 31, 2019

ph commented Oct 31, 2019

ph commented Oct 31, 2019

faec commented Oct 6, 2021

elasticmachine commented Oct 6, 2021

jlind23 commented Dec 17, 2021 • edited Loading

mukeshelastic commented Dec 20, 2021

dikshachauhan-qasource commented Feb 15, 2022

simitt commented Feb 15, 2022

rdner commented Feb 15, 2022

simitt commented Oct 31, 2019 •

edited

Loading

jlind23 commented Dec 17, 2021 •

edited

Loading