[PROF-4535] Report code provenance metadata with Ruby profiles #1813

ivoanjo · 2021-12-16T12:38:20Z

The code provenance metadata will be used to power grouping and categorization of stack traces, and is basically a list of gem names, version and paths that have been loaded into a Ruby app that is being profiled. (Gems that have not been loaded are not reported)

This PR is pretty-much feature-complete, but I'm marking it as a draft because:

I still need to fix the benchmark that is broken
The profiling backend needs to be updated to correctly accept this data (currently it seems to reject the entire profile)

In terms of the profiling architecture, the code provenance metadata doesn't quite fit very much with the existing structure, and so the current approach seems somewhat tacked on.

I left a comment in the Recorder discussing this in detail, but TL;DR the Ruby profiler will be switched to report data through libddprof which will be a big architectural shift and mean many classes will probably change quite a lot (including the Recorder) and so it's not worth doing a huge refactoring now that we'll throw away in Q1.

codecov-commenter · 2021-12-16T18:21:23Z

Codecov Report

Merging #1813 (b931c45) into master (0aeb038) will increase coverage by 0.00%.
The diff coverage is 98.86%.

@@           Coverage Diff            @@
##           master    #1813    +/-   ##
========================================
  Coverage   98.21%   98.21%            
========================================
  Files         931      933     +2     
  Lines       44920    45048   +128     
========================================
+ Hits        44120    44246   +126     
- Misses        800      802     +2

Impacted Files	Coverage Δ
spec/ddtrace/profiling/integration_spec.rb	`97.31% <ø> (ø)`
spec/ddtrace/configuration/components_spec.rb	`99.40% <83.33%> (-0.40%)`	⬇️
lib/ddtrace/configuration/components.rb	`98.26% <100.00%> (+0.04%)`	⬆️
lib/ddtrace/configuration/settings.rb	`100.00% <100.00%> (ø)`
lib/ddtrace/ext/profiling.rb	`100.00% <100.00%> (ø)`
lib/ddtrace/profiling.rb	`100.00% <100.00%> (ø)`
...ib/ddtrace/profiling/collectors/code_provenance.rb	`100.00% <100.00%> (ø)`
lib/ddtrace/profiling/flush.rb	`100.00% <100.00%> (ø)`
lib/ddtrace/profiling/recorder.rb	`97.95% <100.00%> (+0.08%)`	⬆️
...b/ddtrace/profiling/transport/http/api/endpoint.rb	`100.00% <100.00%> (ø)`
... and 7 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0aeb038...b931c45. Read the comment docs.

marcotc

Looks good so far!

Although not stated explicitly, the Ruby profiler was previously using the 1.2 intake format. The 1.3 format (lightly documented [here](https://github.com/DataDog/profiling-backend/blob/prod/README.md#v3-intake-format-used-by-go-net-native) [Datadog-internal link, apologies]) shuffles around the fields a bit: * `recording-start` => `start` * `recording-end` => `end` * `data[0]` + `types[0]` => `data[somefilename]` * `runtime` => `family` * `format` is removed * `version` is added This change is not observable to customers; but is a requirement to submitting extra files along with profiles, as we plan to do in #1813.

ivoanjo · 2022-01-05T17:18:51Z

I've previously stated that this was marked as a draft because

This PR is pretty-much feature-complete, but I'm marking it as a draft because:

I still need to fix the benchmark that is broken

The profiling backend needs to be updated to correctly accept this data (currently it seems to reject the entire profile)

Item 1. has since been fixed, and 2. is fixed by #1820 . After that is merged in, I'll rebase this PR again, and this should be good to go.

Although not stated explicitly, the Ruby profiler was previously using the 1.2 intake format. The 1.3 format (lightly documented [here](https://github.com/DataDog/profiling-backend/blob/prod/README.md#v3-intake-format-used-by-go-net-native) [Datadog-internal link, apologies]) shuffles around the fields a bit: * `recording-start` => `start` * `recording-end` => `end` * `data[0]` + `types[0]` => `data[somefilename]` * `runtime` => `family` * `format` is removed * `version` is added This change is not observable to customers; but is a requirement to submitting extra files along with profiles, as we plan to do in #1813.

The `CodeProvenance` collector collects library metadata for loaded files in the Ruby VM. This data powers grouping and categorization of stack trace data. Also updated the `ProfilingDevelopment.md` with the new class and removed classes/modules that no longer exist.

Adding new arguments becomes really awkward and error-prone with this many positional arguments (and many of them being optional), so I decided to switch the Flush class to use keyword arguments. Lots of support-for-older-rubies boilerplate here :(

I'm not quite happy with how complex wiring this in is, and also not with how it looks (see also TODO on `Recording`), but I think it strikes a good balance between respecting the current architecture and also not requiring a massive refactoring.

The benchmark was broken by the addition of a `code_provenance` field to the flush object, which is not relevant to this benchmark. I did a bit of magic in a REPL to update the marshalled data to not break the benchmark.

I ran into this issue in the tests being run on GitHub Actions, since it installs our dependencies inside the dd-trace-rb folder. It's unclear to me if it can happen in actual customer setups, but I've decided to fix it anyway.

ivoanjo · 2022-01-07T10:39:39Z

All set, ready for review/re-review! :)

The "code provenance" metadata was added in #1813 but is not yet in use (and was never in any released version of ddtrace), so it's OK/safe to rename this field.

…e.json` The profiling team decided to rename this file for consistency. The code provenance feature (#1813) is not yet exposed to customers, and the only release made with the old file name is 1.0.0.beta1 so this does not cause any regression.

ivoanjo requested a review from a team December 16, 2021 12:38

ivoanjo marked this pull request as draft December 16, 2021 12:38

ivoanjo changed the title ~~Report code provenance metadata with Ruby profiles~~ Draft: Report code provenance metadata with Ruby profiles Dec 16, 2021

ivoanjo changed the title ~~Draft: Report code provenance metadata with Ruby profiles~~ Draft: [PROF-4535] Report code provenance metadata with Ruby profiles Dec 16, 2021

ivoanjo self-assigned this Dec 16, 2021

ivoanjo force-pushed the ivoanjo/prof-4535-report-code-provenance branch from 1e659f9 to b931c45 Compare December 16, 2021 18:11

marcotc approved these changes Dec 16, 2021

View reviewed changes

ivoanjo mentioned this pull request Jan 5, 2022

[PROF-4535] Switch profiling to use intake 1.3 format #1820

Merged

ivoanjo force-pushed the ivoanjo/prof-4535-report-code-provenance branch from b931c45 to 29ac298 Compare January 5, 2022 17:17

ivoanjo added 5 commits January 7, 2022 10:33

Fix profiler_submission benchmark to work with new Flush object

9af7949

The benchmark was broken by the addition of a `code_provenance` field to the flush object, which is not relevant to this benchmark. I did a bit of magic in a REPL to update the marshalled data to not break the benchmark.

Fix issue where gem is installed inside another gem's path

d3b464b

I ran into this issue in the tests being run on GitHub Actions, since it installs our dependencies inside the dd-trace-rb folder. It's unclear to me if it can happen in actual customer setups, but I've decided to fix it anyway.

ivoanjo force-pushed the ivoanjo/prof-4535-report-code-provenance branch from 29ac298 to d3b464b Compare January 7, 2022 10:34

ivoanjo changed the title ~~Draft: [PROF-4535] Report code provenance metadata with Ruby profiles~~ [PROF-4535] Report code provenance metadata with Ruby profiles Jan 7, 2022

ivoanjo marked this pull request as ready for review January 7, 2022 10:39

marcotc approved these changes Jan 20, 2022

View reviewed changes

ivoanjo merged commit 93cd757 into master Jan 21, 2022

ivoanjo deleted the ivoanjo/prof-4535-report-code-provenance branch January 21, 2022 15:11

github-actions bot added this to the 0.55.0 milestone Jan 21, 2022

ivoanjo mentioned this pull request Jan 24, 2022

Namespacing: Profiling #1849

Merged

1 task

ivoanjo mentioned this pull request Feb 1, 2022

Rename "type" field to "kind" in profiling code provenance metadata #1880

Closed

ivoanjo mentioned this pull request Feb 25, 2022

Rename code_provenance.json reported by profiler to code-provenance.json #1919

Merged

ivoanjo modified the milestones: 0.55.0, 1.0.0.beta1 Mar 4, 2022

ivoanjo mentioned this pull request May 26, 2022

Memory leak and memory bloat in resque processes when profiling is enabled (suspected to impact at least 0.54.2 and above) #2045

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PROF-4535] Report code provenance metadata with Ruby profiles #1813

[PROF-4535] Report code provenance metadata with Ruby profiles #1813

ivoanjo commented Dec 16, 2021

codecov-commenter commented Dec 16, 2021

marcotc left a comment

ivoanjo commented Jan 5, 2022

ivoanjo commented Jan 7, 2022

[PROF-4535] Report code provenance metadata with Ruby profiles #1813

[PROF-4535] Report code provenance metadata with Ruby profiles #1813

Conversation

ivoanjo commented Dec 16, 2021

codecov-commenter commented Dec 16, 2021

Codecov Report

marcotc left a comment

Choose a reason for hiding this comment

ivoanjo commented Jan 5, 2022

ivoanjo commented Jan 7, 2022