update: in nodriver mode, avoid loading proc, users and interfaces related informations #122

FedeDP · 2021-11-09T08:38:13Z

Signed-off-by: Federico Di Pierro [email protected]

What type of PR is this?

Uncomment one (or more) /kind <> lines:

/kind cleanup

Any specific area of the project related to this PR?

/area libscap

/area libsinsp

What this PR does / why we need it:

In nodriver mode, skip loading proc, users and interfaces related informations as all event sources will be system-external.

Which issue(s) this PR fixes:

Possibly: falcosecurity/falco#1757

Fixes #

Special notes for your reviewer:

Does this PR introduce a user-facing change?:

NONE

poiana · 2021-11-09T08:38:21Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: FedeDP
To complete the pull request process, please assign gnosek after the PR has been reviewed.
You can assign the PR to them by writing /assign @gnosek in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

poiana · 2021-11-09T08:38:22Z

Hi @FedeDP. Thanks for your PR.

I'm waiting for a falcosecurity member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

leogr · 2021-11-10T07:58:49Z

/ok-to-test

mstemm

Don't you still want the initial process/user/interface lists in nodriver mode? I'm not sure I understand why we would not want them.

FedeDP · 2021-11-11T17:01:49Z

Don't you still want the initial process/user/interface lists in nodriver mode? I'm not sure I understand why we would not want them.

What's the use for that? I mean, is there any particular use case where we want that in nodriver mode?

EDIT: surely i can be missing a piece there :)

… loading proc, users and interfaces related informations. Signed-off-by: Federico Di Pierro <[email protected]>

mstemm · 2021-11-12T17:02:21Z

I thought nodriver mode still attempted to get a reconstructed view of the world and a minimal set of events that could be written to a capture file. (I don't remember the details). Do you know?

FedeDP · 2021-11-15T13:01:53Z

I thought nodriver mode still attempted to get a reconstructed view of the world and a minimal set of events that could be written to a capture file. (I don't remember the details). Do you know?

As nodriver mode has no events from syscalls. there is no real "world's view" in that sense.
We always dump:

scap_write_machine_info
scap_write_iflist
scap_write_userlist
scap_write_proclist
scap_write_fdlist

when opening the scap file. With the implementation, all of these will be empty (except for machine_info possibly).
Note, moreover, that these scap_write_ functions already have a

//
// No machine info on disk if the source is a plugin
//
if(handle->m_mode == SCAP_MODE_PLUGIN)
{
	return SCAP_SUCCESS;
}

IE: we are already avoiding dumping initial world for plugins; this is what we need to do not only for plugins, but whenever internal event source (ie: syscalls) is disabled.
I may rearrange the code with a more explicit check (eg: adding an helper method on scap is_system_event_source_enabled() or something similar).

mstemm · 2021-11-15T21:56:20Z

One big difference between nodriver mode and plugin mode is that when in plugin mode, the only events and state are within the plugin. All of libsinsp is effectively disabled.

That's not true for nodriver mode, where the goal is to still obtain process and thread level state of the system.

Let me get more familiar with nodriver mode and then comment more.

mstemm · 2021-11-16T15:37:24Z

I double-checked and nodriver mode depends on reading this info from /proc, so I think if the goal is to come up with a "light" mode that allowed only k8s audit logs and no syscalls, we should come up with another mode/solution, perhaps at the falco level instead of the libs level, to fix this.

FedeDP · 2021-11-16T15:42:25Z

I double-checked and nodriver mode depends on reading this info from /proc, so I think if the goal is to come up with a "light" mode that allowed only k8s audit logs and no syscalls, we should come up with another mode/solution, perhaps at the falco level instead of the libs level, to fix this.

I think that when (if?) k8s audit logs will become a plugin, the issue will be fixed in any case; it's ok for me to close this PR and wait for a proper plugin implementation.

FedeDP · 2021-11-22T16:18:57Z

Closing because nodriver mode still needs a system view.
Issue itself (falcosecurity/falco#1757) will be fixed once k8s audit log becomes a plugin.

This reverts commit 35d80de. It was probably causing some container runtime tests to fail.

* Revert "Revert "Merge upstream pr 688 (falcosecurity#121)" (falcosecurity#122)" This reverts commit c8dbbf3. This adds the fix back. I'll test with an agent PR that updates/removes the tests. * Add the ability to "defer" an async lookup In some cases, the "server" code running run_impl might want to retry its work until later. The current version can't do that--once a key is dequeued using deque_next_key, it has to call store_value or lose the request. To make retries easier, add a method defer_lookup that pushes the key (and optional value) back onto the request queue with a configurable delay. After delay, the key will be pulled again with a call to dequeue_next_key(). Signed-off-by: Mark Stemm <[email protected]> * Use defer_lookup for container info retry instead of lookup_delayed When the container async lookup class wants to retry a lookup, the current version tries to use lookup_delayed to initiate a new request. It turns out that that doesn't work--if there's already an existing request in m_value_map, it assumes that the "server" doing run_impl will eventually return an answer, and doesn't add a request to the queue. The solution is to use the newly added lookup_delayed instead, which pushes the request back onto the queue with a short delay. Signed-off-by: Mark Stemm <[email protected]> * Use a separate max_wait_ms instead of re-using s_cri_timeout Now that timeouts are working, it may take several seconds for subsequent retries to complete. However, s_cri_timeout (typically 1 second) was being used for the max_wait_ms in cri_async_source. That would mean that a lookup would expire before the server side had retried the lookup. The solution is to use a separate 10 second max_wait_ms, which matches docker. Signed-off-by: Mark Stemm <[email protected]> Signed-off-by: Mark Stemm <[email protected]>

poiana added release-note-none kind/cleanup dco-signoff: yes area/libscap area/libsinsp labels Nov 9, 2021

poiana requested review from ldegio and leogr November 9, 2021 08:38

poiana added the needs-ok-to-test label Nov 9, 2021

poiana added the size/M label Nov 9, 2021

poiana added ok-to-test and removed needs-ok-to-test labels Nov 10, 2021

mstemm reviewed Nov 11, 2021

View reviewed changes

update(userspace/libscap,userspace/libsinsp): in nodriver mode, avoid…

a70dd4a

… loading proc, users and interfaces related informations. Signed-off-by: Federico Di Pierro <[email protected]>

FedeDP force-pushed the nodriver_no_scan_procs_users_interfaces branch from 11f87d3 to a70dd4a Compare November 12, 2021 16:13

FedeDP closed this Nov 22, 2021

poiana mentioned this pull request Dec 7, 2022

DO NOT MERGE: Add hotpot timings #777

Closed

leogr pushed a commit to leogr/libs that referenced this pull request Jan 5, 2023

Revert "Merge upstream pr 688 (falcosecurity#121)" (falcosecurity#122)

c8dbbf3

This reverts commit 35d80de. It was probably causing some container runtime tests to fail.

poiana mentioned this pull request Feb 10, 2023

Oss merge libs 0.10.1 #874

Closed

poiana mentioned this pull request Apr 17, 2023

Ssprod 23814 add addl event init method #1052

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update: in nodriver mode, avoid loading proc, users and interfaces related informations #122

update: in nodriver mode, avoid loading proc, users and interfaces related informations #122

FedeDP commented Nov 9, 2021 •

edited

Loading

poiana commented Nov 9, 2021

poiana commented Nov 9, 2021

leogr commented Nov 10, 2021

mstemm left a comment

FedeDP commented Nov 11, 2021 •

edited

Loading

mstemm commented Nov 12, 2021

FedeDP commented Nov 15, 2021 •

edited

Loading

mstemm commented Nov 15, 2021

mstemm commented Nov 16, 2021

FedeDP commented Nov 16, 2021 •

edited

Loading

FedeDP commented Nov 22, 2021

update: in nodriver mode, avoid loading proc, users and interfaces related informations #122

update: in nodriver mode, avoid loading proc, users and interfaces related informations #122

Conversation

FedeDP commented Nov 9, 2021 • edited Loading

poiana commented Nov 9, 2021

poiana commented Nov 9, 2021

leogr commented Nov 10, 2021

mstemm left a comment

Choose a reason for hiding this comment

FedeDP commented Nov 11, 2021 • edited Loading

mstemm commented Nov 12, 2021

FedeDP commented Nov 15, 2021 • edited Loading

mstemm commented Nov 15, 2021

mstemm commented Nov 16, 2021

FedeDP commented Nov 16, 2021 • edited Loading

FedeDP commented Nov 22, 2021

FedeDP commented Nov 9, 2021 •

edited

Loading

FedeDP commented Nov 11, 2021 •

edited

Loading

FedeDP commented Nov 15, 2021 •

edited

Loading

FedeDP commented Nov 16, 2021 •

edited

Loading