Specify when flush returns unsuccessfully #623

felixbarny · 2022-03-24T10:13:25Z

Create PR as draft
Approval by at least one other agent
Mark as Ready for Review (automatically requests reviews from all agents and PM via CODEOWNERS)
- Remove PM from reviewers if impact on product is negligible
- Remove agents from reviewers if the change is not relevant for them
Merge after 2 business days passed without objections
To auto-merge the PR, add /schedule YYYY-MM-DD to the PR description.

apmmachine · 2022-03-24T10:16:51Z

💚 Build Succeeded

the below badges are clickable and redirect to their specific view in the CI or DOCS

Expand to view the summary

Build stats

Start Time: 2023-03-31T05:18:00.028+0000
Duration: 3 min 25 sec

specs/agents/tracing-instrumentation-aws-lambda.md

Co-authored-by: Emily S <[email protected]>

basepi · 2022-03-24T16:53:10Z

specs/agents/tracing-instrumentation-aws-lambda.md

+In the edge case where the extension takes too much time to respond (e.g. if there's a lenghy GC pause),
+the `flush` method should return after a timeout.
+
+The default timeout is 1s.


As a data point for what is currently happening in the agents, we use api_request_time as our flush timeout, which defaults to 10s.

That's likely too long, especially for Lambda. But I'm thinking we should create a new config option so this is configurable (and divorced from api_request_time).

Suggested change

The default timeout is 1s.

| | |

|----------------|---|

| Type | [duration](configuration.md#configuration-value-types) |

| Default | `1s` |

| Dynamic | `true` |

specs/agents/tracing-instrumentation-aws-lambda.md

trentm

Sounds good to me as an improvement over #613

Co-authored-by: Trent Mick <[email protected]>

felixbarny · 2022-03-24T20:20:13Z

specs/agents/tracing-instrumentation-aws-lambda.md

@@ -291,3 +291,17 @@ Therefore, the Lambda instrumentation has to ensure that data is flushed in a bl

 Some Lambda functions will use the custom-built Lambda extension that allows the agent to send its data locally. The extension asynchronously forwards the data it receives from the agent to the APM server so the Lambda function can return its result with minimal delay. In order for the extension to know when it can flush its data, it must receive a signal indicating that the lambda function has completed. There are two possible signals: one is via a subscription to the AWS Lambda Logs API and the other is an agent intake request with the query param `flushed=true`. A signal from the agent is preferrable because there is an inherent delay with the sending of the Logs API signal.
 Therefore, the agent must send its final intake request at the end of the function invocation with the query param `flushed=true`. In case there is no more data to send at the end of the function invocation, the agent must send an empty intake request with this query param.
+
+### Flush timeout


This option is used by the Java agent already. See https://www.elastic.co/guide/en/apm/agent/java/current/config-serverless.html#config-data-flush-timeout

Suggested change

### Flush timeout

### Configuration option `data_flush_timeout`

felixbarny · 2022-03-24T20:23:34Z

specs/agents/tracing-instrumentation-aws-lambda.md

+In the edge case where the extension takes too much time to respond (e.g. if there's a lenghy GC pause),
+the `flush` method should return after a timeout.
+
+The default timeout is 1s.


Suggested change

The default timeout is 1s.

| | |

|----------------|---|

| Type | [duration](configuration.md#configuration-value-types) |

| Default | `1s` |

| Dynamic | `true` |

Specify when flush returns unsuccessfully

54fd027

felixbarny requested a review from trentm March 24, 2022 10:13

felixbarny mentioned this pull request Mar 24, 2022

Spec that agents in Lambda should *not* do back-off #613

Closed

4 tasks

estolfo reviewed Mar 24, 2022

View reviewed changes

specs/agents/tracing-instrumentation-aws-lambda.md Outdated Show resolved Hide resolved

Apply suggestions from code review

d0b99da

Co-authored-by: Emily S <[email protected]>

estolfo approved these changes Mar 24, 2022

View reviewed changes

basepi reviewed Mar 24, 2022

View reviewed changes

trentm reviewed Mar 24, 2022

View reviewed changes

specs/agents/tracing-instrumentation-aws-lambda.md Outdated Show resolved Hide resolved

trentm approved these changes Mar 24, 2022

View reviewed changes

Fix typo

dc5350a

Co-authored-by: Trent Mick <[email protected]>

felixbarny commented Mar 24, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specify when flush returns unsuccessfully #623

Specify when flush returns unsuccessfully #623

felixbarny commented Mar 24, 2022

apmmachine commented Mar 24, 2022 •

edited by jenkins-apm-ci bot

Loading

Build stats

basepi Mar 24, 2022

felixbarny Mar 24, 2022

trentm left a comment

felixbarny Mar 24, 2022

felixbarny Mar 24, 2022

-The default timeout is 1s.
+|                |   |
+|----------------|---|
+| Type  | [duration](configuration.md#configuration-value-types) |
+| Default        | `1s` |
+| Dynamic        | `true` |

	### Flush timeout
	### Configuration option `data_flush_timeout`

Specify when flush returns unsuccessfully #623

Are you sure you want to change the base?

Specify when flush returns unsuccessfully #623

Conversation

felixbarny commented Mar 24, 2022

apmmachine commented Mar 24, 2022 • edited by jenkins-apm-ci bot Loading

💚 Build Succeeded

Build stats

basepi Mar 24, 2022

Choose a reason for hiding this comment

felixbarny Mar 24, 2022

Choose a reason for hiding this comment

trentm left a comment

Choose a reason for hiding this comment

felixbarny Mar 24, 2022

Choose a reason for hiding this comment

felixbarny Mar 24, 2022

Choose a reason for hiding this comment

apmmachine commented Mar 24, 2022 •

edited by jenkins-apm-ci bot

Loading