Oss merge libs 0.10.1 #874

greyhame-s · 2023-02-10T06:07:25Z

First attempt at 0.10.1 merge from upstream. I also specifically picked up Greg's completed fix for adding euid to execve exit events. I had to make a few agent changes to get the build to succeed, so that will be a separate PR that will need to go in soon after this one.

fixup commits

b562ea53 HEAD@{16}: commit: fixup! struct definitions moved by 53aad03
fac67317 HEAD@{18}: rebase (continue): !fixup - removed dead code, or code that will be added in a subsequent cherry pick
87c6c2fc HEAD@{34}: commit: fixup! Merge with sprintf - snprintf cleanup from 08f078c
c4d7129f HEAD@{37}: commit: fixup! Allow enabling/disabling individual container engines on startup (docker windows support dropped by 07acb8c9)
c523c796 HEAD@{39}: commit: fixup! Make sinsp remove_inactive_threads() method public (simple merge against 2f235d7)
b54e415c HEAD@{43}: commit: fixup! Add procfs_utils.ut.cpp to the test binary (simple merge against 4b0cf1e)
75e96eb8 HEAD@{48}: commit: fixup! Enhancements to initial scan of /proc, for supportability (Joe, your old changes are split across 3 linux specific files now. Greg, please check that 08fd40e and 0fedd76 are merged properly)
d7a5fe84 HEAD@{50}: commit: fixup! Fill thread loginuid with default value -1, if /proc loginuid is unavailable (Joe, please check if your changes were correctly moved to the new linux/scap_procs.c file)

Signed-off-by: Andrea Terzolo <[email protected]>

This change increases the number of retries to retrieve container information from CRI API from 3 to 5, as several failures were observed with the maximum number of attempts set to 3. Signed-off-by: Iacopo Rozzo <[email protected]>

Signed-off-by: Melissa Kilby <[email protected]>

Co-authored-by: Andrea Terzolo <[email protected]> Signed-off-by: Melissa Kilby <[email protected]>

Signed-off-by: Melissa Kilby <[email protected]>

This change makes sure that 5 maximum retries to retrieve container information are used with CRI only. It puts back the number of retries to 3 for all the other container runtimes. It also adjusts the maximum time to complete all their attempts to take into account the increased of retries. Signed-off-by: Iacopo Rozzo <[email protected]>

Signed-off-by: Andrea Terzolo <[email protected]>

Signed-off-by: Andrea Terzolo <[email protected]> Co-authored-by: Hendrik Brueckner <[email protected]> Co-authored-by: Mauro Ezequiel Moltrasio <[email protected]>

Signed-off-by: Andrea Terzolo <[email protected]> Co-authored-by: Hendrik Brueckner <[email protected]>

Signed-off-by: Grzegorz Nosek <[email protected]>

Signed-off-by: Andrea Terzolo <[email protected]>

Signed-off-by: Jason Dellaluce <[email protected]>

…m target Signed-off-by: Andrea Terzolo <[email protected]>

Signed-off-by: Andrea Terzolo <[email protected]>

… seen by drivers Signed-off-by: Andrea Terzolo <[email protected]>

Signed-off-by: Andrea Terzolo <[email protected]>

Signed-off-by: Luca Guerra <[email protected]>

Signed-off-by: Adnan Ali <[email protected]>

Signed-off-by: Luca Guerra <[email protected]>

Signed-off-by: Andrea Terzolo <[email protected]>

…name Signed-off-by: Jason Dellaluce <[email protected]>

Signed-off-by: Andrea Terzolo <[email protected]>

Signed-off-by: Andrea Terzolo <[email protected]> Co-authored-by: Hendrik Brueckner <[email protected]>

Have sinsp_container_lookup with what was sinsp_container_lookup_state inside. Also introduce convenience methods. Signed-off-by: Angelo Puglisi <[email protected]>

This is still used in analyzer_thread.cpp so keep it in our fork.

* fix(container_engine): Only return on success or all retries failed Instead of always returning a result on the first attempt, only return results on success or when all retries have failed. This prevents spurious "container" events for incomplete results. This is especially important when both docker and cri are enabled, when both must be tried due to the cgroup pattern overlapping, but only one actually holds the container. Signed-off-by: Mark Stemm <[email protected]> * Log a warning when empty container infos are returned When empty container infos are passed up due to all attempts failing, log a warining. This will help highlight cases when the communication with the container runtime isn't working properly. Signed-off-by: Mark Stemm <[email protected]> * Add debug log to note when a lookup is async or sync The "async_xxx" refers to the code that performs the lookup (we used to have a separate "docker" engine, but it's been removed. To make it more clear about whether a lookup is synchronous or asynchronous, add a debug log. Signed-off-by: Mark Stemm <[email protected]> * Use bundled valijson for "regular" build valijson doesn't really have an ubuntu package, so it can't be preinstalled. Use the bundled valijson instead. * Add RE2 to container used for builds + tests This way it will be present when building with -DUSE_BUNDLED_DEPS=False Signed-off-by: Mark Stemm <[email protected]>

This reverts commit 35d80de. It was probably causing some container runtime tests to fail.

Userspace workaround for Linux kernel behaviors on ARM and zLinux, was not fully effective, and has since been obviated by kernel driver/eBPF probe logic to generate missing scap events by other means. So this changeset removes the userspace workaround.

* Revert "Revert "Merge upstream pr 688 (#121)" (#122)" This reverts commit c8dbbf3. This adds the fix back. I'll test with an agent PR that updates/removes the tests. * Add the ability to "defer" an async lookup In some cases, the "server" code running run_impl might want to retry its work until later. The current version can't do that--once a key is dequeued using deque_next_key, it has to call store_value or lose the request. To make retries easier, add a method defer_lookup that pushes the key (and optional value) back onto the request queue with a configurable delay. After delay, the key will be pulled again with a call to dequeue_next_key(). Signed-off-by: Mark Stemm <[email protected]> * Use defer_lookup for container info retry instead of lookup_delayed When the container async lookup class wants to retry a lookup, the current version tries to use lookup_delayed to initiate a new request. It turns out that that doesn't work--if there's already an existing request in m_value_map, it assumes that the "server" doing run_impl will eventually return an answer, and doesn't add a request to the queue. The solution is to use the newly added lookup_delayed instead, which pushes the request back onto the queue with a short delay. Signed-off-by: Mark Stemm <[email protected]> * Use a separate max_wait_ms instead of re-using s_cri_timeout Now that timeouts are working, it may take several seconds for subsequent retries to complete. However, s_cri_timeout (typically 1 second) was being used for the max_wait_ms in cri_async_source. That would mean that a lookup would expire before the server side had retried the lookup. The solution is to use a separate 10 second max_wait_ms, which matches docker. Signed-off-by: Mark Stemm <[email protected]> Signed-off-by: Mark Stemm <[email protected]>

See falcosecurity/libs#677

It isn't on Windows. Signed-off-by: Grzegorz Nosek <[email protected]>

The function sinsp_thread_manager::reinit_thread_from_proc() was added to draios/agent-libs as part of a now-obsolete workaround for an ARM/zLinux platform bug. That workaround has been removed, so now we need to remove this no-longer-used function from sinsp_thread_manager.

…er_group_manager API. Port falcosecurity/libs#753

In some cases, we want to ensure that a visitor does *not* change the ast. This includes cases where the ast pointer used by the visitor is read-only. To support these use cases, add a const_expr_visitor interface where all the visit() methods take a const argument. Also add variants of accept() that take const_expr_visitor arguments and call the const_expr_visitors visit() method. Compiling, cloning, and stringifying asts are all cases that should not change the underlying ast, so switch those to use const_expr_visitor instead of expr_visitor. A couple of compile private methods had to be changed to take const arguments. They already didn't modify those arguments, so it was a safe change. Signed-off-by: Mark Stemm <[email protected]> Signed-off-by: Mark Stemm <[email protected]>

We need them when building with hayabusa

We can't prevent losing setuid events completely and the uid is pretty important for some execve-related rules, so explicitly pass the uid in execve/at exit events Signed-off-by: Grzegorz Nosek <[email protected]> Co-authored-by: Angelo Puglisi <[email protected]> Co-authored-by: Andrea Terzolo <[email protected]>

poiana · 2023-02-10T06:07:27Z

Adding the "do-not-merge/release-note-label-needed" label because no release-note block was detected, please follow our release note process to remove it.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

poiana · 2023-02-10T06:07:29Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: greyhame-s
Once this PR has been reviewed and has the lgtm label, please assign incertum for approval by writing /assign @incertum in a comment. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

poiana · 2023-02-10T06:07:30Z

Thanks for your pull request. Before we can look at it, you'll need to add a 'DCO signoff' to your commits.

📝 Please follow instructions in the contributing guide to update your commits with the DCO

Full details of the Developer Certificate of Origin can be found at developercertificate.org.

The list of commits missing DCO signoff:

dc041af Conflicts for ff5aca4
d7a5fe8 fixup! Fill thread loginuid with default value -1, if /proc loginuid is unavailable (Joe, please check if your changes were correctly moved to the new linux/scap_procs.c file)
30060ff Conflicts for 89a8a08
75e96eb fixup! Enhancements to initial scan of /proc, for supportability (Joe, your old changes are split across 3 linux specific files now. Greg, please check that 08fd40e and 0fedd76 are merged properly)
0367ac2 Conflicts for 3b0e3ef
b54e415 fixup! Add procfs_utils.ut.cpp to the test binary (simple merge against 4b0cf1e)
3b8b0a1 Remove valijson support
ee41927 Restore setters used in tests
b0bf860 Conflicts for a90ea20
c523c79 fixup! Make sinsp remove_inactive_threads() method public (simple merge against 2f235d7)
2448648 Conflicts for 9b4c4e8
c4d7129 fixup! Allow enabling/disabling individual container engines on startup (docker windows support dropped by 07acb8c)
4af5cfc Add special case code to work around syscall default behavior
f24a7f7 Conflicts for 3c125ff
87c6c2f fixup! Merge with sprintf - snprintf cleanup from 08f078c
5bde7d4 Compile eBPF probe with -Wno-unknown-attributes
ba0f396 perf(sinsp): populate cmdline when setting threadinfo command args to eliminate repeated string concats.
5100c94 Fix after 9768501
5867cb5 Get CRI image metadata both from image and imageRef
a73eedd Fix CRI image tag detection (new: userfaultfd support #50)
10897e9 Workaround Linux on ARM event-generation deficiencies
520f659 Enable CLONE_EXIT_TO_CHILD workaround on s390x
974f9fb Fix logic to recognize and avoid reporting expected TID collisions
60504be CI with github actions
87bc52c Workaround for fatal: unsafe repository (REPO is owned by someone else)
78727ad Fix __STDC_FORMAT_MACROS issue
dbb523a remove redundant procfs_utils.ut.cpp
4e2cc63 Turn off gvisor support when building libs
826524d Conflicts for ab8be1c
fac6731 !fixup - removed dead code, or code that will be added in a subsequent cherry pick
6fdf17b Conflicts for c98bf05
b562ea5 fixup! struct definitions moved by 53aad03
2909f7b Additional build changes
9c8464d Retain m_sysdig_agent_conf, was removed upstream
766a7ee Revert "Merge upstream pr 688 (Fix link for libraries contibutions in README.md #121)" (update: in nodriver mode, avoid loading proc, users and interfaces related informations #122)
1d15025 [SMAGENT-4237] Remove dead LIBSINSP_CPUARCH_THREAD_EVENT_BUG code (new(driver): correctly use WEXITSTATUS set of macros to retrieve procexit return code #126)
04e1f66 [container users and groups from process root #677] container users and groups from process root (Add Mark Stemm as libs OWNER #127)
388e871 [SMAGENT-4309] Remove obsolete function reinit_thread_from_proc() (Updated protobuf version for s390x #134)
f73eed7 [cleanup(userspace/libsinsp): small cleanups in user_group_manager API. #753] cleanup(userspace/libsinsp): small cleanups in user_group_manager API.
0459b52 Add extra include directories
2ae70f4 Resolve build errors from bad merges

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

Andreagit97 and others added 30 commits December 15, 2022 17:11

fix(driver): use extract__egid instread of extract__euid helper

ac2e18a

Signed-off-by: Andrea Terzolo <[email protected]>

fix(driver-modern-bpf): optimize exctract__tty lookups

ff9a370

Signed-off-by: Melissa Kilby <[email protected]>

cleanup(driver-bpf): optimize tty lookup

352837d

Signed-off-by: Melissa Kilby <[email protected]>

cleanup(driver-modern-bpf): re-use inode lookup

1a02109

Signed-off-by: Melissa Kilby <[email protected]>

cleanup(driver-modern-bpf): add comment to tty extraction

01830c3

Co-authored-by: Andrea Terzolo <[email protected]> Signed-off-by: Melissa Kilby <[email protected]>

cleanup(driver-modern-bpf): re-use inode lookup for sched_process_exec

76bff97

Signed-off-by: Melissa Kilby <[email protected]>

chore(driver): support external skeleton build for modern bpf

1d3f296

Signed-off-by: Andrea Terzolo <[email protected]>

doc: improve cmake comments

4535cd8

Signed-off-by: Andrea Terzolo <[email protected]> Co-authored-by: Hendrik Brueckner <[email protected]> Co-authored-by: Mauro Ezequiel Moltrasio <[email protected]>

docs: add documentation for the MODERN_BPF_SKEL_DIR option

b82bc3b

Signed-off-by: Andrea Terzolo <[email protected]> Co-authored-by: Hendrik Brueckner <[email protected]>

fix(sinsp): format PT_ABSTIME values

14f0137

Signed-off-by: Grzegorz Nosek <[email protected]>

update(ci): enable gh actions jobs on maintainers/ branches

e8ea980

Signed-off-by: Andrea Terzolo <[email protected]>

update(userspace/libscap): avoid owning events offset in test engine

d937062

Signed-off-by: Jason Dellaluce <[email protected]>

fix(userspace/libsinsp/test): own events offset in test engine

82f2f4c

Signed-off-by: Jason Dellaluce <[email protected]>

chore(userspace): manage not bundled libelf dependency adding a custo…

725732a

…m target Signed-off-by: Andrea Terzolo <[email protected]>

update(userspace): compute the sum of all drops in modern probe

258ec63

Signed-off-by: Andrea Terzolo <[email protected]>

fix(driver): drops should be considered in the total number of events…

04a0aa8

… seen by drivers Signed-off-by: Andrea Terzolo <[email protected]>

update(driver): improve logging in case of failed bpf loading

0fac704

Signed-off-by: Andrea Terzolo <[email protected]>

update(build): update libcurl to 7.87.0

772397f

Signed-off-by: Luca Guerra <[email protected]>

fix: handle capset_x missing thread_info

6f9569d

Signed-off-by: Adnan Ali <[email protected]>

update(build): update openssl to 1.1.1q

13800c9

Signed-off-by: Luca Guerra <[email protected]>

new(driver): add a new bpf map to retrieve PPM_SC codes

0c3d243

Signed-off-by: Andrea Terzolo <[email protected]>

new: implement generic events support in modern bpf probe

3e825d4

Signed-off-by: Andrea Terzolo <[email protected]>

fix(userspace/libsinsp): avoid exception failure on unknown k8s node …

1b54028

…name Signed-off-by: Jason Dellaluce <[email protected]>

fix: correctly free the state in modern bpf probe

cea6078

Signed-off-by: Andrea Terzolo <[email protected]>

new: support multiple CPUs per buffer

453cd0e

Signed-off-by: Andrea Terzolo <[email protected]>

update: propagate support to scap-open

8b38418

Signed-off-by: Andrea Terzolo <[email protected]>

update: propagate support to sinsp

ff44778

Signed-off-by: Andrea Terzolo <[email protected]>

update: set online_only as default in scap-open

88c7af6

Signed-off-by: Andrea Terzolo <[email protected]> Co-authored-by: Hendrik Brueckner <[email protected]>

greyhame-s and others added 17 commits February 9, 2023 10:27

Conflicts for c98bf05

6fdf17b

fixup! struct definitions moved by 53aad03

b562ea5

refactor(libsinsp/container): introduce sinsp_container_lookup class

ab27b1d

Have sinsp_container_lookup with what was sinsp_container_lookup_state inside. Also introduce convenience methods. Signed-off-by: Angelo Puglisi <[email protected]>

Additional build changes

2909f7b

Retain m_sysdig_agent_conf, was removed upstream

9c8464d

This is still used in analyzer_thread.cpp so keep it in our fork.

Revert "Merge upstream pr 688 (#121)" (#122)

766a7ee

This reverts commit 35d80de. It was probably causing some container runtime tests to fail.

[falcosecurity#677] container users and groups from process root (#127)

04e1f66

See falcosecurity/libs#677

fix(scap): don't assume __always_inline is defined

571330b

It isn't on Windows. Signed-off-by: Grzegorz Nosek <[email protected]>

[falcosecurity#753] cleanup(userspace/libsinsp): small cleanups in us…

f73eed7

…er_group_manager API. Port falcosecurity/libs#753

Add extra include directories

0459b52

We need them when building with hayabusa

Resolve build errors from bad merges

2ae70f4

poiana added do-not-merge/work-in-progress do-not-merge/release-note-label-needed dco-signoff: no labels Feb 10, 2023

poiana added the size/XXL label Feb 10, 2023

poiana requested review from hbrueckner and Molter73 February 10, 2023 06:07

greyhame-s closed this Feb 10, 2023

jasondellaluce deleted the oss-merge-libs-0.10.1-2023-02-02 branch February 10, 2023 10:45

jasondellaluce restored the oss-merge-libs-0.10.1-2023-02-02 branch February 10, 2023 10:46

greyhame-s deleted the oss-merge-libs-0.10.1-2023-02-02 branch March 24, 2023 16:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Oss merge libs 0.10.1 #874

Oss merge libs 0.10.1 #874

greyhame-s commented Feb 10, 2023

poiana commented Feb 10, 2023

poiana commented Feb 10, 2023

poiana commented Feb 10, 2023

Oss merge libs 0.10.1 #874

Oss merge libs 0.10.1 #874

Conversation

greyhame-s commented Feb 10, 2023

poiana commented Feb 10, 2023

poiana commented Feb 10, 2023

poiana commented Feb 10, 2023