[CSM] Track image tags of syscalls in activity trees #27483

Gui774ume · 2024-07-10T16:17:05Z

What does this PR do?

This PR isolates syscalls entries in activity trees into their own SyscallNode struct. This is done so that we start tracking syscalls per image tags. This is a 2 step PR, next step is to update the protobuf to record these image tags.

Motivation

Make sure syscalls are tracked like any other resources.

Additional Notes

Possible Drawbacks / Trade-offs

Describe how to test/QA your changes

pr-commenter · 2024-07-10T17:24:31Z

Regression Detector

Regression Detector Results

Metrics dashboard
Target profiles
Run ID: 7368438c-8f78-421d-979d-b7759f69bd3d

Baseline: 0a2d548
Comparison: ba2ea25
Diff

Optimization Goals: ✅ No significant changes detected

Fine details of change detection per experiment

perf	experiment	goal	Δ mean %	Δ mean % CI	trials	links
➖	quality_gate_idle_all_features	memory utilization	+0.79	[+0.68, +0.91]	1	Logs bounds checks dashboard
➖	uds_dogstatsd_to_api_cpu	% cpu utilization	+0.77	[+0.04, +1.51]	1	Logs
➖	quality_gate_idle	memory utilization	+0.56	[+0.51, +0.61]	1	Logs bounds checks dashboard
➖	basic_py_check	% cpu utilization	+0.46	[-3.39, +4.31]	1	Logs
➖	otel_to_otel_logs	ingress throughput	+0.41	[-0.26, +1.08]	1	Logs
➖	file_to_blackhole_500ms_latency	egress throughput	+0.31	[-0.45, +1.08]	1	Logs
➖	file_tree	memory utilization	+0.15	[+0.03, +0.28]	1	Logs
➖	file_to_blackhole_300ms_latency	egress throughput	+0.11	[-0.52, +0.73]	1	Logs
➖	file_to_blackhole_1000ms_latency_linear_load	egress throughput	+0.06	[-0.40, +0.53]	1	Logs
➖	uds_dogstatsd_to_api	ingress throughput	+0.01	[-0.09, +0.10]	1	Logs
➖	tcp_dd_logs_filter_exclude	ingress throughput	-0.00	[-0.02, +0.02]	1	Logs
➖	file_to_blackhole_100ms_latency	egress throughput	-0.01	[-0.77, +0.76]	1	Logs
➖	file_to_blackhole_0ms_latency	egress throughput	-0.04	[-0.85, +0.77]	1	Logs
➖	file_to_blackhole_1000ms_latency	egress throughput	-0.49	[-1.29, +0.31]	1	Logs
➖	tcp_syslog_to_blackhole	ingress throughput	-0.61	[-0.69, -0.53]	1	Logs
➖	pycheck_lots_of_tags	% cpu utilization	-1.08	[-4.45, +2.29]	1	Logs

Bounds Checks: ❌ Failed

perf	experiment	bounds_check_name	replicates_passed	links
❌	file_to_blackhole_1000ms_latency	lost_bytes	0/10
❌	file_to_blackhole_300ms_latency	lost_bytes	9/10
✅	file_to_blackhole_0ms_latency	lost_bytes	10/10
✅	file_to_blackhole_0ms_latency	memory_usage	10/10
✅	file_to_blackhole_1000ms_latency	memory_usage	10/10
✅	file_to_blackhole_1000ms_latency_linear_load	memory_usage	10/10
✅	file_to_blackhole_100ms_latency	lost_bytes	10/10
✅	file_to_blackhole_100ms_latency	memory_usage	10/10
✅	file_to_blackhole_300ms_latency	memory_usage	10/10
✅	file_to_blackhole_500ms_latency	lost_bytes	10/10
✅	file_to_blackhole_500ms_latency	memory_usage	10/10
✅	quality_gate_idle	memory_usage	10/10	bounds checks dashboard
✅	quality_gate_idle_all_features	memory_usage	10/10	bounds checks dashboard

Explanation

Confidence level: 90.00%
Effect size tolerance: |Δ mean %| ≥ 5.00%

Performance changes are noted in the perf column of each table:

✅ = significantly better comparison variant performance
❌ = significantly worse comparison variant performance
➖ = no significant change in performance

A regression test is an A/B test of target performance in a repeatable rig, where "performance" is measured as "comparison variant minus baseline variant" for an optimization goal (e.g., ingress throughput). Due to intrinsic variability in measuring that goal, we can only estimate its mean value for each experiment; we report uncertainty in that value as a 90.00% confidence interval denoted "Δ mean % CI".

For each experiment, we decide whether a change in performance is a "regression" -- a change worth investigating further -- if all of the following criteria are true:

Its estimated |Δ mean %| ≥ 5.00%, indicating the change is big enough to merit a closer look.
Its 90.00% confidence interval "Δ mean % CI" does not contain zero, indicating that if our statistical model is accurate, there is at least a 90.00% chance there is a difference in performance between baseline and comparison variants.
Its configuration does not mark it "erratic".

CI Pass/Fail Decision

✅ Passed. All Quality Gates passed.

quality_gate_idle, bounds check memory_usage: 10/10 replicas passed. Gate passed.
quality_gate_idle_all_features, bounds check memory_usage: 10/10 replicas passed. Gate passed.

pr-commenter · 2024-07-11T10:03:04Z

Test changes on VM

Use this command from test-infra-definitions to manually test this PR changes on a VM:

inv create-vm --pipeline-id=49357010 --os-family=ubuntu

Note: This applies to commit ba2ea25

spikat

Just a small NIT, otherwise LGTM :)

pkg/security/security_profile/activity_tree/process_node.go

spikat · 2024-07-12T08:51:07Z

It would be great to add some functional tests too

lebauce · 2024-07-24T15:42:50Z

@Gui774ume Can you please and merge if applicable ?

Gui774ume · 2024-11-18T13:57:49Z

/merge

dd-devflow · 2024-11-18T13:57:59Z

Devflow running: `/merge`

View all feedbacks in Devflow UI.

2024-11-18 13:57:58 UTC ℹ️ MergeQueue: waiting for PR to be ready

This merge request is not mergeable yet, because of pending checks/missing approvals. It will be added to the queue as soon as checks pass and/or get approvals.
Note: if you pushed new commits since the last approval, you may need additional approval.
You can remove it from the waiting list with /remove command.

2024-11-18 13:59:00 UTC ⚠️ MergeQueue: This merge request was unqueued

This merge request was unqueued

Gui774ume · 2024-11-18T13:58:53Z

/merge -c

Gui774ume · 2024-11-18T14:10:43Z

/merge

dd-devflow · 2024-11-18T14:10:54Z

Devflow running: `/merge`

View all feedbacks in Devflow UI.

2024-11-18 14:10:53 UTC ℹ️ MergeQueue: waiting for PR to be ready

This merge request is not mergeable yet, because of pending checks/missing approvals. It will be added to the queue as soon as checks pass and/or get approvals.
Note: if you pushed new commits since the last approval, you may need additional approval.
You can remove it from the waiting list with /remove command.

2024-11-18 18:10:59 UTC ⚠️ MergeQueue: This merge request was unqueued

This merge request was unqueued

Gui774ume · 2024-11-19T12:22:31Z

/merge

dd-devflow · 2024-11-19T12:22:40Z

Devflow running: `/merge`

View all feedbacks in Devflow UI.

2024-11-19 12:22:40 UTC ℹ️ MergeQueue: waiting for PR to be ready

This merge request is not mergeable yet, because of pending checks/missing approvals. It will be added to the queue as soon as checks pass and/or get approvals.
Note: if you pushed new commits since the last approval, you may need additional approval.
You can remove it from the waiting list with /remove command.

2024-11-19 12:26:45 UTC ℹ️ MergeQueue: merge request added to the queue

The median merge time in main is 26m.

This reverts commit 83e319e.

Gui774ume requested a review from a team as a code owner July 10, 2024 16:17

Gui774ume force-pushed the will/syscalls-node branch from 3bda2c7 to e3884f3 Compare July 10, 2024 16:17

github-actions bot added the component/system-probe label Jul 10, 2024

Gui774ume added changelog/no-changelog team/agent-security labels Jul 10, 2024

Gui774ume added this to the 7.57.0 milestone Jul 10, 2024

Gui774ume force-pushed the will/syscalls-node branch 2 times, most recently from f0ba9b9 to 6377ea5 Compare July 11, 2024 09:35

spikat approved these changes Jul 12, 2024

View reviewed changes

pkg/security/security_profile/activity_tree/process_node.go Outdated Show resolved Hide resolved

Gui774ume force-pushed the will/syscalls-node branch 4 times, most recently from 2acfd3d to 606e0ff Compare July 31, 2024 09:58

paulcacheux modified the milestones: 7.57.0, 7.58.0 Aug 9, 2024

paulcacheux modified the milestones: 7.58.0, Triage Sep 4, 2024

Gui774ume force-pushed the will/syscalls-node branch from 606e0ff to 011087a Compare November 18, 2024 13:56

github-actions bot added the medium review PR review might take time label Nov 18, 2024

Gui774ume modified the milestones: Triage, 7.60.0 Nov 18, 2024

Gui774ume added the qa/done QA done before merge and regressions are covered by tests label Nov 18, 2024

YoannGh approved these changes Nov 18, 2024

View reviewed changes

Gui774ume modified the milestones: 7.60.0, 7.61.0 Nov 18, 2024

Gui774ume force-pushed the will/syscalls-node branch 3 times, most recently from 48cab3d to 148ec03 Compare November 19, 2024 10:47

[CSM] Track image tags of syscalls in activity trees

ba2ea25

Gui774ume force-pushed the will/syscalls-node branch from 148ec03 to ba2ea25 Compare November 19, 2024 11:35

dd-mergequeue bot merged commit 83e319e into main Nov 19, 2024
220 of 222 checks passed

dd-mergequeue bot deleted the will/syscalls-node branch November 19, 2024 12:51

paulcacheux added a commit that referenced this pull request Nov 19, 2024

Revert "[CSM] Track image tags of syscalls in activity trees (#27483)"

44fae30

This reverts commit 83e319e.

paulcacheux mentioned this pull request Nov 19, 2024

Revert "[CSM] Track image tags of syscalls in activity trees" #31232

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CSM] Track image tags of syscalls in activity trees #27483

[CSM] Track image tags of syscalls in activity trees #27483

Gui774ume commented Jul 10, 2024

pr-commenter bot commented Jul 10, 2024 •

edited by cit-pr-commenter bot

Loading

Fine details of change detection per experiment

Explanation

pr-commenter bot commented Jul 11, 2024 •

edited by agent-platform-auto-pr bot

Loading

spikat left a comment

spikat commented Jul 12, 2024

lebauce commented Jul 24, 2024

Gui774ume commented Nov 18, 2024

dd-devflow bot commented Nov 18, 2024 •

edited

Loading

Gui774ume commented Nov 18, 2024

Gui774ume commented Nov 18, 2024

dd-devflow bot commented Nov 18, 2024 •

edited

Loading

Gui774ume commented Nov 19, 2024

dd-devflow bot commented Nov 19, 2024 •

edited

Loading

[CSM] Track image tags of syscalls in activity trees #27483

[CSM] Track image tags of syscalls in activity trees #27483

Conversation

Gui774ume commented Jul 10, 2024

What does this PR do?

Motivation

Additional Notes

Possible Drawbacks / Trade-offs

Describe how to test/QA your changes

pr-commenter bot commented Jul 10, 2024 • edited by cit-pr-commenter bot Loading

Regression Detector

Regression Detector Results

Optimization Goals: ✅ No significant changes detected

Fine details of change detection per experiment

Bounds Checks: ❌ Failed

Explanation

CI Pass/Fail Decision

pr-commenter bot commented Jul 11, 2024 • edited by agent-platform-auto-pr bot Loading

Test changes on VM

spikat left a comment

Choose a reason for hiding this comment

spikat commented Jul 12, 2024

lebauce commented Jul 24, 2024

Gui774ume commented Nov 18, 2024

dd-devflow bot commented Nov 18, 2024 • edited Loading

Devflow running: /merge

Gui774ume commented Nov 18, 2024

Gui774ume commented Nov 18, 2024

dd-devflow bot commented Nov 18, 2024 • edited Loading

Devflow running: /merge

Gui774ume commented Nov 19, 2024

dd-devflow bot commented Nov 19, 2024 • edited Loading

Devflow running: /merge

pr-commenter bot commented Jul 10, 2024 •

edited by cit-pr-commenter bot

Loading

pr-commenter bot commented Jul 11, 2024 •

edited by agent-platform-auto-pr bot

Loading

dd-devflow bot commented Nov 18, 2024 •

edited

Loading

Devflow running: `/merge`

dd-devflow bot commented Nov 18, 2024 •

edited

Loading

Devflow running: `/merge`

dd-devflow bot commented Nov 19, 2024 •

edited

Loading

Devflow running: `/merge`