Lambda exec wrapper calls upstream OTel Python auto instr script #164

NathanielRN · 2021-10-25T21:48:47Z

Description

@wangzlei and I worked together to figure out how to mostly auto-instrument Lambda functions using the upstream opentelemetry-instrument auto-instrumentation package.

In doing so, we found it best to re-write otel-instrument as a bash script.

We made two major updates

Update the PYTHONPATH in `otel-instrument` so we can call `opentelemtry-instrument`

The auto-instrumentation package counts on 2 things:

the opentelemetry package
locating all the python packages the user wants to instrument

Both these 2 things require us modifying the environment variable PYTHONPATH right away in the otel-instrument script. AWS Lambda will add the correct paths but it does it too late. (It only does it once it calls it's originally intended entry point of python3 /var/runtime/bootstrap.py).

Update the `otel_wrapper.py` to only call the `AwsLambdaInstrumentor`

All of the instrumentation is done in sitecustomize.py by opentelemetry-instrument. However, the way the Lambda Handler is imported in the AWS Lambda bootstrap.py file which is run after sitecustomize.py CLEARS any instrumentation done on lambda_function.lambda_handler. That's why we keep otel_wrapper.py around. So that bootstrap.py can call it, do any destructive imports it needs to do, and then call AwsLambdaInstrumentor.instrument() explicitly. This way we know the Lambda function is instrumented, we can import, and give bootstrap.py the Lambda Handler that we know is instrumented.

Future Work

We would ideally like to modify AWS Lambda's boostrap.py or investigate how we can get rid of otel_wrapper.py and have all the instrumentation be finished in sitecustomize.py such that bootstrap.py can just import the normal lambda handler at the _HANDLER environment variable without destroying the import at all.

Fixes #152

anuraaga · 2021-10-26T02:30:41Z

python/src/otel/otel_sdk/otel-instrument

+#   script can find them (it needs to find `opentelemetry` to find the auto
+#   instrumentation `run()` method later)
+
+if [ -z ${PYTHONPATH} ]; then


For any path type variabe it's conventional to not check for presence, empty strings are fine (they should be made fine for the resource attributes variable in the python SDK separately too at some point)

Okay sounds good! Yeah I didn't know if user would dislike us modifying a variable they already set, but then again they're probably using this script BECAUSE they want us to handle all these little settings 😛

Changed it to this!

export PYTHONPATH="$LAMBDA_LAYER_PKGS_DIR:$PYTHONPATH";

And I guess you mean in the future someone can do this:

export OTEL_RESOURCE_ATTRIBUTES="$LAMBDA_RESOURCE_ATTRIBUTES,$OTEL_RESOURCE_ATTRIBUTES";

Thanks again for the prompt!

I looked into this and am fairly certain OTel Python SDK currently does handle empty Resource Attributes.

if not (key and isinstance(key, str)): _logger.warning("invalid key `%s`. must be non-empty string.", key) return None

They even have a test to protect against this:

def test_invalid_resource_attribute_values(self): resource = resources.Resource( { resources.SERVICE_NAME: "test", "non-primitive-data-type": {}, "invalid-byte-type-attribute": b"\xd8\xe1\xb7\xeb\xa8\xe5 \xd2\xb7\xe1", "": "empty-key-value", None: "null-key-value", "another-non-primitive": uuid.uuid4(), } ) self.assertEqual( resource.attributes, { resources.SERVICE_NAME: "test", }, ) self.assertEqual(len(resource.attributes), 1)

So I'll simplify this even further!

I actually was wrong about this... I've made a PR to fix this upstream: open-telemetry/opentelemetry-python#2256

And will follow up with a fix #173

python/src/otel/otel_sdk/otel-instrument

python/src/otel/otel_sdk/otel_wrapper.py

python/src/otel/tests/test_otel.py

NathanielRN · 2021-10-27T14:34:04Z

python/src/otel/tests/test_otel.py

+    # NOTE: Because we run as a subprocess, the python packages are NOT patched
+    # with instrumentation. In this test we just make sure we can complete auto
+    # instrumentation without error and the correct environment variabels are
+    # set. A future improvement might have us run `opentelemetry-instrument` in
+    # this process to imitate `otel-instrument`, but our lambda handler does not
+    # call other instrumented libraries so we have no use for it for now.


Just want to flag this comment, I tested it and the subprocess does not patch the libraries in the parent process as I initially assumed 😞 I think we should merge as is, and the only suggestion I can think of is to call opentelemetry-instrument ourselves in this process if we really want the packages to be instrumented.

Either way the check=True below will help us make sure we can run both otel-instrument and opentelemetry-instrument through to completion without any errors.

What's weird about this is that the TracerProvider must be getting initialized by the child opentelemetry-instrument call, but when I make a call to botocore for example I should be seeing more spans and yet I do not...

Actually I understand why now. We use the from opentelemetry.test.test_base import TestBase which initializes the TracerProvider at the setupClass stage.

We can fix this in another PR. See Issue #168.

wangzlei

LGTM!

NathanielRN requested review from codeboten and wangzlei as code owners October 25, 2021 21:48

NathanielRN force-pushed the exec-script-calls-upstream-auto-instrument branch 2 times, most recently from 4e5fa9a to 5916901 Compare October 25, 2021 22:02

NathanielRN mentioned this pull request Oct 25, 2021

Only test Auto Instrumentation related Python Tests #165

Closed

Lambda exec wrapper calls upstream OTel Python auto instr script

e4276be

NathanielRN force-pushed the exec-script-calls-upstream-auto-instrument branch from 5916901 to e4276be Compare October 25, 2021 22:10

NathanielRN marked this pull request as draft October 25, 2021 22:36

NathanielRN mentioned this pull request Oct 25, 2021

Add instrumentation for AWS Lambda Service - pkg metadata files (Part 1/2) open-telemetry/opentelemetry-python-contrib#739

Merged

7 tasks

anuraaga reviewed Oct 26, 2021

View reviewed changes

NathanielRN force-pushed the exec-script-calls-upstream-auto-instrument branch from 3973c17 to a3badc1 Compare October 26, 2021 22:50

Update testing file to use bash script

f5b9cdf

NathanielRN force-pushed the exec-script-calls-upstream-auto-instrument branch from a3badc1 to f5b9cdf Compare October 26, 2021 22:52

NathanielRN commented Oct 26, 2021

View reviewed changes

python/src/otel/tests/test_otel.py Outdated Show resolved Hide resolved

NathanielRN marked this pull request as ready for review October 26, 2021 22:56

NathanielRN requested a review from anuraaga October 26, 2021 22:56

anuraaga approved these changes Oct 27, 2021

View reviewed changes

python/src/otel/tests/test_otel.py Outdated Show resolved Hide resolved

Parse subprocess output to modify environment

a897770

NathanielRN force-pushed the exec-script-calls-upstream-auto-instrument branch from de19b2a to a897770 Compare October 27, 2021 14:28

NathanielRN commented Oct 27, 2021

View reviewed changes

wangzlei approved these changes Oct 27, 2021

View reviewed changes

NathanielRN mentioned this pull request Oct 27, 2021

[Python] Tests that init OTel with otel-instrument script should not use TestBase setup method. #168

Closed

Set OTEL_RESOURCE_ATTRIBUTES without checking for previous value

2e3df55

wangzlei merged commit d58ef1e into open-telemetry:main Oct 28, 2021

NathanielRN mentioned this pull request Oct 28, 2021

Update patch of upstream OTel Python otel-instrument script aws-observability/aws-otel-lambda#158

Merged

NathanielRN deleted the exec-script-calls-upstream-auto-instrument branch November 1, 2021 18:11

This was referenced Nov 1, 2021

Make sure env var Python Resource Attr pairs are valid #173

Merged

Sanitize resource attribute pairs to avoid exception open-telemetry/opentelemetry-python#2256

Merged

wangzlei mentioned this pull request Jan 27, 2023

Improve nodejs layer by auto-instrumentations-node #448

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lambda exec wrapper calls upstream OTel Python auto instr script #164

Lambda exec wrapper calls upstream OTel Python auto instr script #164

NathanielRN commented Oct 25, 2021

anuraaga Oct 26, 2021

NathanielRN Oct 26, 2021

NathanielRN Oct 28, 2021

NathanielRN Nov 1, 2021

NathanielRN Oct 27, 2021

NathanielRN Oct 27, 2021

NathanielRN Oct 27, 2021 •

edited

Loading

wangzlei left a comment

Lambda exec wrapper calls upstream OTel Python auto instr script #164

Lambda exec wrapper calls upstream OTel Python auto instr script #164

Conversation

NathanielRN commented Oct 25, 2021

Description

Update the PYTHONPATH in otel-instrument so we can call opentelemtry-instrument

Update the otel_wrapper.py to only call the AwsLambdaInstrumentor

Future Work

anuraaga Oct 26, 2021

Choose a reason for hiding this comment

NathanielRN Oct 26, 2021

Choose a reason for hiding this comment

NathanielRN Oct 28, 2021

Choose a reason for hiding this comment

NathanielRN Nov 1, 2021

Choose a reason for hiding this comment

NathanielRN Oct 27, 2021

Choose a reason for hiding this comment

NathanielRN Oct 27, 2021

Choose a reason for hiding this comment

NathanielRN Oct 27, 2021 • edited Loading

Choose a reason for hiding this comment

wangzlei left a comment

Choose a reason for hiding this comment

Update the PYTHONPATH in `otel-instrument` so we can call `opentelemtry-instrument`

Update the `otel_wrapper.py` to only call the `AwsLambdaInstrumentor`

NathanielRN Oct 27, 2021 •

edited

Loading