[RFC] [ACIX-453] Implement Shared Agent 6 / 7 Tasks #31176

CelianR · 2024-11-18T13:27:10Z

What does this PR do?

Note

Documentation.
Please refer to this RFC.
Tasks will be migrated to share tasks in future PRs).

This implements this RFC to allow creating shared tasks (tasks that can be executed on a specific branch).

Such a task can be created like this:

@task
def sharedtask(ctx, branch):
	with agent_context(ctx, branch):
		# Shared code to run applying to the specific branch

Changes:

Implemented context switching
Added worktree tasks (init worktree context / run a command from this context)
Refactored DEFAULT_BRANCH to get_default_branch() to get the default branch of the current context. Also refactored code when main was used
GoModules are not cached anymore (since they can change even though the path is not changed)
Tested agent context

Motivation

Describe how to test/QA your changes

Tasks

$ inv worktree.run 6.53.x pwd
# ~/dd/datadog-agent-worktree

$ inv worktree.run 6.53.x 'git branch'
# 6.53.x

$ inv worktree.invoke 6.53.x modules.show-all
# Will show less modules than `inv modules.show-all`

Possible Drawbacks / Trade-offs

Additional Notes

…ules commands [skip ci]

chouetz

TBD if we want to have the same behaviour (workdir) for agent6 & agent7 or not

chouetz · 2024-11-18T16:21:24Z

tasks/agent6.py

+    """Enters the agent 6 environment in order to invoke tasks in this context.
+
+    Note:
+        This task should be avoided when a --version, --major-version or --agent-version argument is available in the task.


I suppose we should update the release coordinator guide with what we must call in the agent6 context

chouetz · 2024-11-18T16:27:18Z

tasks/libs/common/agent6.py

+
+@contextmanager
+def agent_context(ctx, version: str | int | None):
+    """Runs code from the agent6 environment if the version is 6.


Is it only for agent6? Or will/should we create also a worktree for agent7. This is a bit overkill as the tools are still in the datadog-agent repo but it make senses with the potential migration to devtool, wdyt?

tasks/libs/common/agent6.py

tasks/libs/common/constants.py

tasks/libs/common/agent6.py

tasks/libs/releasing/json.py

chouetz · 2024-11-18T16:35:49Z

tasks/libs/releasing/notes.py

@@ -42,7 +43,7 @@ def _add_dca_prelude(ctx, agent7_version, agent6_version=""):
            f"""prelude:
    |
    Released on: {date.today()}
-    Pinned to datadog-agent v{agent7_version}: `CHANGELOG <https://github.com/{GITHUB_REPO_NAME}/blob/{DEFAULT_BRANCH}/CHANGELOG.rst#{agent7_version.replace('.', '')}{agent6_version}>`_."""
+    Pinned to datadog-agent v{agent7_version}: `CHANGELOG <https://github.com/{GITHUB_REPO_NAME}/blob/{get_default_branch()}/CHANGELOG.rst#{agent7_version.replace('.', '')}{agent6_version}>`_."""


not related to this PR but we might need to adapt this if we want it to work properly for both agent6 & 7: something that could look like

Suggested change

Pinned to datadog-agent v{agent7_version}: `CHANGELOG <https://github.com/{GITHUB_REPO_NAME}/blob/{get_default_branch()}/CHANGELOG.rst#{agent7_version.replace('.', '')}{agent6_version}>`_."""

Pinned to datadog-agent v{agent_version}: `CHANGELOG <https://github.com/{GITHUB_REPO_NAME}/blob/{get_default_branch()}/CHANGELOG.rst#{agent_version.replace('.', '')}>`_."""

cit-pr-commenter · 2024-11-18T16:45:21Z

Regression Detector

Regression Detector Results

Metrics dashboard
Target profiles
Run ID: 4ead2161-7c05-48d1-adb6-855be2577adc

Baseline: c1ac65c
Comparison: 259efe7
Diff

Optimization Goals: ✅ No significant changes detected

Fine details of change detection per experiment

perf	experiment	goal	Δ mean %	Δ mean % CI	trials	links
➖	tcp_syslog_to_blackhole	ingress throughput	+1.05	[+0.99, +1.12]	1	Logs
➖	uds_dogstatsd_to_api_cpu	% cpu utilization	+0.14	[-0.59, +0.88]	1	Logs
➖	file_to_blackhole_500ms_latency	egress throughput	+0.09	[-0.69, +0.86]	1	Logs
➖	file_to_blackhole_300ms_latency	egress throughput	+0.03	[-0.59, +0.66]	1	Logs
➖	uds_dogstatsd_to_api	ingress throughput	+0.01	[-0.10, +0.12]	1	Logs
➖	tcp_dd_logs_filter_exclude	ingress throughput	-0.00	[-0.01, +0.01]	1	Logs
➖	file_to_blackhole_1000ms_latency	egress throughput	-0.01	[-0.78, +0.77]	1	Logs
➖	file_to_blackhole_0ms_latency	egress throughput	-0.01	[-0.77, +0.75]	1	Logs
➖	file_to_blackhole_100ms_latency	egress throughput	-0.01	[-0.79, +0.76]	1	Logs
➖	quality_gate_idle_all_features	memory utilization	-0.18	[-0.31, -0.05]	1	Logs bounds checks dashboard
➖	file_to_blackhole_1000ms_latency_linear_load	egress throughput	-0.22	[-0.69, +0.25]	1	Logs
➖	quality_gate_idle	memory utilization	-0.45	[-0.53, -0.38]	1	Logs bounds checks dashboard
➖	otel_to_otel_logs	ingress throughput	-0.94	[-1.63, -0.25]	1	Logs
➖	file_tree	memory utilization	-1.26	[-1.40, -1.12]	1	Logs
➖	pycheck_lots_of_tags	% cpu utilization	-1.88	[-5.39, +1.62]	1	Logs
➖	basic_py_check	% cpu utilization	-4.42	[-8.28, -0.57]	1	Logs

Bounds Checks: ✅ Passed

perf	experiment	bounds_check_name	replicates_passed	links
✅	file_to_blackhole_0ms_latency	lost_bytes	10/10
✅	file_to_blackhole_0ms_latency	memory_usage	10/10
✅	file_to_blackhole_1000ms_latency	memory_usage	10/10
✅	file_to_blackhole_1000ms_latency_linear_load	memory_usage	10/10
✅	file_to_blackhole_100ms_latency	lost_bytes	10/10
✅	file_to_blackhole_100ms_latency	memory_usage	10/10
✅	file_to_blackhole_300ms_latency	lost_bytes	10/10
✅	file_to_blackhole_300ms_latency	memory_usage	10/10
✅	file_to_blackhole_500ms_latency	lost_bytes	10/10
✅	file_to_blackhole_500ms_latency	memory_usage	10/10
✅	quality_gate_idle	memory_usage	10/10	bounds checks dashboard
✅	quality_gate_idle_all_features	memory_usage	10/10	bounds checks dashboard

Explanation

Confidence level: 90.00%
Effect size tolerance: |Δ mean %| ≥ 5.00%

Performance changes are noted in the perf column of each table:

✅ = significantly better comparison variant performance
❌ = significantly worse comparison variant performance
➖ = no significant change in performance

A regression test is an A/B test of target performance in a repeatable rig, where "performance" is measured as "comparison variant minus baseline variant" for an optimization goal (e.g., ingress throughput). Due to intrinsic variability in measuring that goal, we can only estimate its mean value for each experiment; we report uncertainty in that value as a 90.00% confidence interval denoted "Δ mean % CI".

For each experiment, we decide whether a change in performance is a "regression" -- a change worth investigating further -- if all of the following criteria are true:

Its estimated |Δ mean %| ≥ 5.00%, indicating the change is big enough to merit a closer look.
Its 90.00% confidence interval "Δ mean % CI" does not contain zero, indicating that if our statistical model is accurate, there is at least a 90.00% chance there is a difference in performance between baseline and comparison variants.
Its configuration does not mark it "erratic".

CI Pass/Fail Decision

✅ Passed. All Quality Gates passed.

quality_gate_idle_all_features, bounds check memory_usage: 10/10 replicas passed. Gate passed.
quality_gate_idle, bounds check memory_usage: 10/10 replicas passed. Gate passed.

tasks/system_probe.py

…d agent 6 context ctx mgnr

sabrina-datadog

looks good so far, had some minor suggestions! :)

tasks/git.py

tasks/libs/common/gomodules.py

tasks/libs/common/worktree.py

tasks/release.py

tasks/unit_tests/libs/common/worktree_tests.py

tasks/worktree.py

Co-authored-by: sabrina lu <[email protected]>

…lready checked out

tasks/libs/common/git.py

tasks/libs/common/worktree.py

Co-authored-by: pducolin <[email protected]>

…cached anymore

…kip_checkout option

…nt switch branch but will enter context

sabrina-datadog

looks good to me! 👍

tasks/libs/common/worktree.py

tasks/worktree.py

Co-authored-by: sabrina lu <[email protected]>

chouetz

Minor comment

chouetz · 2024-11-27T09:22:13Z

tasks/libs/common/worktree.py

+    """
+
+    if not WORKTREE_DIRECTORY.is_dir():
+        if not ctx.run(f"git worktree add '{WORKTREE_DIRECTORY}' origin/{branch or 'main'}", warn=True):


What's the point of initialising the worktree to main by default? Should we make the branch a mandatory argument?

In some cases, we want to enter the environment without changing the branch. main is used to create the worktree in any case. For example, some release tasks won't have the branch argument required such that we:

Switch to the target branch

Apply tasks such as tag_modules etc. without specifying again the branch

In the release tasks, it is also possible that a task calls an inner function that will reuse the current context without switching the branch explicitly

CelianR · 2024-11-27T10:30:05Z

/merge

dd-devflow · 2024-11-27T10:30:19Z

Devflow running: `/merge`

View all feedbacks in Devflow UI.

2024-11-27 10:30:18 UTC ℹ️ MergeQueue: pull request added to the queue

The median merge time in main is 23m.

CelianR added 5 commits November 15, 2024 14:48

[rfc-update-tasks-a6-a7] tmp

5e81855

[rfc-update-tasks-a6-a7] tmp

88d022c

[rfc-update-tasks-a6-a7] Implemented context + agent 6 switch for mod…

b36cef8

…ules commands [skip ci]

[rfc-update-tasks-a6-a7] tmp

7cc0f50

[rfc-update-tasks-a6-a7] Fixed refactoring [skip ci]

1f8ed02

CelianR added changelog/no-changelog qa/no-code-change No code change in Agent code requiring validation team/agent-devx-infra labels Nov 18, 2024

CelianR self-assigned this Nov 18, 2024

CelianR added 3 commits November 18, 2024 14:40

[rfc-update-tasks-a6-a7] modules: Updated cache

e50350e

[rfc-update-tasks-a6-a7] Added agent6.invoke

728a0c0

[rfc-update-tasks-a6-a7] Updated all "main"

aa21a8e

github-actions bot added component/system-probe long review PR is complex, plan time to review it labels Nov 18, 2024

CelianR added 3 commits November 18, 2024 15:37

[rfc-update-tasks-a6-a7] Fixes, cleaning

d90e347

[rfc-update-tasks-a6-a7] Added tests

f195804

[rfc-update-tasks-a6-a7] unit-tests: Added agent 6 tests

6376156

CelianR marked this pull request as ready for review November 18, 2024 15:53

CelianR requested review from a team as code owners November 18, 2024 15:53

chouetz reviewed Nov 18, 2024

View reviewed changes

brycekahle reviewed Nov 18, 2024

View reviewed changes

tasks/system_probe.py Outdated Show resolved Hide resolved

CelianR added 2 commits November 19, 2024 11:12

[rfc-update-tasks-a6-a7] Applied suggestions, moved functions, remove…

1905845

…d agent 6 context ctx mgnr

[rfc-update-tasks-a6-a7] Removed system probe branch

5c37de7

github-actions bot removed the component/system-probe label Nov 19, 2024

CelianR added 2 commits November 20, 2024 10:46

[rfc-update-tasks-a6-a7] agent6: Add remove env

76a6bd1

[rfc-update-tasks-a6-a7] agent 6 -> worktree

6c12470

CelianR added 2 commits November 21, 2024 12:08

[rfc-update-tasks-a6-a7] switch -> checkout

210e96c

[rfc-update-tasks-a6-a7] switch -> checkout

7e61258

spencergilbert requested a review from sabrina-datadog November 21, 2024 12:27

CelianR added 5 commits November 21, 2024 18:12

[rfc-update-tasks-a6-a7] remove: Fixed

c0b62e1

[rfc-update-tasks-a6-a7] Added notes

bd9dd21

[rfc-update-tasks-a6-a7] Fix error note

2466d27

Merge branch 'main' into celian/rfc-update-tasks-a6-a7-acix-453

61706e3

Merge branch 'main' into celian/rfc-update-tasks-a6-a7-acix-453

29ca9d1

sabrina-datadog requested changes Nov 22, 2024

View reviewed changes

CelianR and others added 3 commits November 25, 2024 03:51

Apply suggestions from code review [skip ci]

a1290b5

Co-authored-by: sabrina lu <[email protected]>

[rfc-update-tasks-a6-a7] Applied Sabrina's suggestions

1869b01

[rfc-update-tasks-a6-a7] skip_checkout: Verified that the branch is a…

c127193

…lready checked out

pducolin approved these changes Nov 25, 2024

View reviewed changes

tasks/libs/common/git.py Show resolved Hide resolved

tasks/libs/common/worktree.py Outdated Show resolved Hide resolved

CelianR and others added 6 commits November 25, 2024 08:01

Update tasks/libs/common/worktree.py

0ffb4d4

Co-authored-by: pducolin <[email protected]>

[rfc-update-tasks-a6-a7] default_modules: Refactored since it is not …

1e57702

…cached anymore

[rfc-update-tasks-a6-a7] Fixed missing arg

7daf382

[rfc-update-tasks-a6-a7] Added various git commands to worktree and s…

14ad3d7

…kip_checkout option

[rfc-update-tasks-a6-a7] Updated error message

3d2e68b

[rfc-update-tasks-a6-a7] agent_context: If branch is None, then it wo…

3a2147e

…nt switch branch but will enter context

sabrina-datadog approved these changes Nov 25, 2024

View reviewed changes

tasks/libs/common/worktree.py Outdated Show resolved Hide resolved

tasks/libs/common/worktree.py Outdated Show resolved Hide resolved

tasks/worktree.py Outdated Show resolved Hide resolved

tasks/worktree.py Show resolved Hide resolved

CelianR and others added 3 commits November 26, 2024 10:08

[rfc-update-tasks-a6-a7] invoke: Added warning

74155c3

Apply suggestions from code review

bc87c2d

Co-authored-by: sabrina lu <[email protected]>

Merge branch 'main' into celian/rfc-update-tasks-a6-a7-acix-453

259efe7

CelianR mentioned this pull request Nov 27, 2024

[ACIX-453] Agent 6 release tasks support #30062

Merged

chouetz approved these changes Nov 27, 2024

View reviewed changes

dd-mergequeue bot merged commit 4451a3f into main Nov 27, 2024
193 checks passed

dd-mergequeue bot deleted the celian/rfc-update-tasks-a6-a7-acix-453 branch November 27, 2024 11:00

github-actions bot added this to the 7.61.0 milestone Nov 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] [ACIX-453] Implement Shared Agent 6 / 7 Tasks #31176

[RFC] [ACIX-453] Implement Shared Agent 6 / 7 Tasks #31176

CelianR commented Nov 18, 2024 •

edited

Loading

chouetz left a comment

chouetz Nov 18, 2024

chouetz Nov 18, 2024

chouetz Nov 18, 2024

cit-pr-commenter bot commented Nov 18, 2024 •

edited

Loading

Fine details of change detection per experiment

Bounds Checks: ✅ Passed

Explanation

sabrina-datadog left a comment

sabrina-datadog left a comment

chouetz left a comment

chouetz Nov 27, 2024

CelianR Nov 27, 2024 •

edited

Loading

CelianR commented Nov 27, 2024

dd-devflow bot commented Nov 27, 2024 •

edited

Loading

	Pinned to datadog-agent v{agent7_version}: `CHANGELOG <https://github.com/{GITHUB_REPO_NAME}/blob/{get_default_branch()}/CHANGELOG.rst#{agent7_version.replace('.', '')}{agent6_version}>`_."""
	Pinned to datadog-agent v{agent_version}: `CHANGELOG <https://github.com/{GITHUB_REPO_NAME}/blob/{get_default_branch()}/CHANGELOG.rst#{agent_version.replace('.', '')}>`_."""

[RFC] [ACIX-453] Implement Shared Agent 6 / 7 Tasks #31176

[RFC] [ACIX-453] Implement Shared Agent 6 / 7 Tasks #31176

Conversation

CelianR commented Nov 18, 2024 • edited Loading

What does this PR do?

Motivation

Describe how to test/QA your changes

Tasks

Possible Drawbacks / Trade-offs

Additional Notes

chouetz left a comment

Choose a reason for hiding this comment

chouetz Nov 18, 2024

Choose a reason for hiding this comment

chouetz Nov 18, 2024

Choose a reason for hiding this comment

chouetz Nov 18, 2024

Choose a reason for hiding this comment

cit-pr-commenter bot commented Nov 18, 2024 • edited Loading

Regression Detector

Regression Detector Results

Optimization Goals: ✅ No significant changes detected

Fine details of change detection per experiment

Bounds Checks: ✅ Passed

Explanation

CI Pass/Fail Decision

sabrina-datadog left a comment

Choose a reason for hiding this comment

sabrina-datadog left a comment

Choose a reason for hiding this comment

chouetz left a comment

Choose a reason for hiding this comment

chouetz Nov 27, 2024

Choose a reason for hiding this comment

CelianR Nov 27, 2024 • edited Loading

Choose a reason for hiding this comment

CelianR commented Nov 27, 2024

dd-devflow bot commented Nov 27, 2024 • edited Loading

Devflow running: /merge

CelianR commented Nov 18, 2024 •

edited

Loading

cit-pr-commenter bot commented Nov 18, 2024 •

edited

Loading

CelianR Nov 27, 2024 •

edited

Loading

dd-devflow bot commented Nov 27, 2024 •

edited

Loading

Devflow running: `/merge`