[PROF-9476] Managed string storage for interning over several profiles #414

AlexJF · 2024-05-06T09:17:08Z

What does this PR do?

PoC for allowing string interning to survive across several profiles. This is used in DataDog/dd-trace-rb#3628 to reduce the overhead of heap profiling.

THIS CODE IS NOT PRODUCTION READY AND IS SIMPLY A HACKISH PoC!

Motivation

Heap profiling is a stateful mode of profiling where samples associated with live objects may be emitted across several profiles (those where the live object stays alive). Because of this, information such as the allocation class and allocation stacktrace needs to be preserved alongside the tracked object so heap samples can be re-inserted in subsequent profiles.

libdatadog already does a good job of handling and deduplicating strings in the span of a single profile. Exposing an API to allow doing this work across several profiles would prevent users from having to re-implement this outside of libdatadog and duplicating a lot of work.

Initial testing with dd-trace-rb shows great promise, allowing us to increase heap sampling rate by 10x with negligible overhead compared to the currently released implementation:

Additional Notes

Anything else we should know when reviewing?

How to test the change?

Describe here in detail how the change can be validated.

For Reviewers

If this PR touches code that signs or publishes builds or packages, or handles credentials of any kind, I've requested a review from @DataDog/security-design-and-guidance.
This PR doesn't touch any of that.

github-actions · 2024-10-23T19:42:45Z

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. To override this behavior, add the keep-open label or update the PR.

ivoanjo · 2024-10-24T08:59:07Z

I've added keep-open for this one, as I plan to pick it up soon ™️

ivoanjo · 2024-11-05T11:55:18Z

Closing in favor of #607

[PROF-9476] Managed string storage for interning over several profiles

bf637d9

github-actions bot added profiling Relates to the profiling* modules. ci-build labels May 6, 2024

AlexJF mentioned this pull request May 6, 2024

[PROF-9476] Managed string storage PoC DataDog/dd-trace-rb#3628

Draft

github-actions bot added the stale Used by actions/stale to identify PRs that have been inactive for 90+ days label Oct 23, 2024

ivoanjo added keep-open Overrides actions/stale auto-closing stale PRs and removed stale Used by actions/stale to identify PRs that have been inactive for 90+ days labels Oct 24, 2024

ivoanjo closed this Nov 5, 2024

ivoanjo self-assigned this Nov 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PROF-9476] Managed string storage for interning over several profiles #414

[PROF-9476] Managed string storage for interning over several profiles #414

AlexJF commented May 6, 2024 •

edited

Loading

github-actions bot commented Oct 23, 2024

ivoanjo commented Oct 24, 2024

ivoanjo commented Nov 5, 2024

[PROF-9476] Managed string storage for interning over several profiles #414

[PROF-9476] Managed string storage for interning over several profiles #414

Conversation

AlexJF commented May 6, 2024 • edited Loading

What does this PR do?

Motivation

Additional Notes

How to test the change?

For Reviewers

github-actions bot commented Oct 23, 2024

ivoanjo commented Oct 24, 2024

ivoanjo commented Nov 5, 2024

AlexJF commented May 6, 2024 •

edited

Loading