Add args for `sendmmsg` events #1636

Biagio-Dipalma · 2024-01-22T14:59:09Z

Motivation

Currently, the sendmmsg syscall is supported, but lacks arguments in both exit and entry events. I believe that adding parameters could significantly enhance the visibility of outbound connection events.

Feature

Add params to the sendmmsg events

The text was updated successfully, but these errors were encountered:

Andreagit97 · 2024-01-31T14:51:14Z

thank you for this! We will try to understand when to schedule this!

oheifetz · 2024-03-09T18:06:13Z

so basically you would like the sendmmsg to support an array of arguments where every argument is like what sendmsg does, right?

oheifetz · 2024-03-09T18:09:45Z

I can take it if no one has already started to add this support

Andreagit97 · 2024-03-11T11:10:07Z

Yes we would like to have something similar to sendmsg syscall

	[PPME_SOCKET_SENDMSG_E] = {"sendmsg", EC_IO_WRITE | EC_SYSCALL, EF_USES_FD | EF_WRITES_TO_FD | EF_MODIFIES_STATE, 3, {{"fd", PT_FD, PF_DEC}, {"size", PT_UINT32, PF_DEC}, {"tuple", PT_SOCKTUPLE, PF_NA} } },
	[PPME_SOCKET_SENDMSG_X] = {"sendmsg", EC_IO_WRITE | EC_SYSCALL, EF_USES_FD | EF_WRITES_TO_FD | EF_MODIFIES_STATE, 2, {{"res", PT_ERRNO, PF_DEC}, {"data", PT_BYTEBUF, PF_NA} } },

but since there is no need to have some info in the enter event and others in the exit one, I would go with implementing all the info in the exit event, so something like

	[PPME_SOCKET_SENDMMSG_E] = {"sendmmsg", EC_IO_WRITE | EC_SYSCALL, EF_USES_FD, 0},
	[PPME_SOCKET_SENDMMSG_X] = {"sendmmsg", EC_IO_WRITE | EC_SYSCALL, EF_USES_FD, 5, {{"res", PT_ERRNO, PF_DEC}, {"fd", PT_FD, PF_DEC}, {"data", PT_BYTEBUF, PF_NA}, {"size", PT_UINT32, PF_DEC}, {"tuple", PT_SOCKTUPLE, PF_NA} } },

I can take it if no one has already started to add this support

Sure! thank you!

oheifetz · 2024-03-16T16:33:19Z

@Andreagit97 regarding your post above, you think that there is no need to push any data to ringbuf in enter since the only change between the enter and exit is the fact that on exit the msg_len changes and this may be important to user and on exit all other data can be inserted to ring?

Andreagit97 · 2024-03-18T08:44:42Z

Yes, exactly. In the exit event, we have all syscalls parameters + the return value so it should be enough to collect everything in the exit event, since the enter event doesn't bring any additional value

poiana · 2024-06-16T09:53:50Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh with /remove-lifecycle stale.

Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Provide feedback via https://github.com/falcosecurity/community.

/lifecycle stale

Andreagit97 · 2024-06-17T08:10:58Z

/remove-lifecycle stale

FedeDP · 2024-06-28T09:24:08Z

@oheifetz are you actively working on this? Otherwise i'd like to tackle it :)

Molter73 · 2024-07-05T15:24:38Z

Hey @FedeDP and @oheifetz, don't know if there has been any progress on this, but I find myself in need of getting date from both sendmmsg and recvmmsg, so I'll likely have to implement them myself for testing unless a PR is opened by either of you guys in the near future.

Just letting you know in case you want me to take over the issue.

FedeDP · 2024-07-08T13:13:39Z

Np from me, indeed thank you Mauro!
Waiting for @oheifetz to check his status on this!

FedeDP · 2024-07-22T09:05:23Z

Ehy @Molter73 since @oheifetz is not answering anymore, i think you can take this one :) Are you willing to?
Thanks!

Molter73 · 2024-07-24T10:11:28Z

Sorry for the delay in the response, my priorities have shifted a bit for the next week or so, but I should be able to pick this up afterwards if that's ok.

In the meantime, I've had moderate success on a very naive implementation for the modern probe by basically translating every sendmmsg/recvmmsg into multiple sendmsg/recvmsg events, I know it is not the approach discussed earlier, but it made the userspace implementation really straight forward (almost no changes needed). Would like to hear thoughts on this approach from more knowledgeable people, would this be appropriate or should I go back to issuing the proper events? What do folks think?

FedeDP · 2024-07-24T12:04:25Z

No problem for the delay! I would love to get this in Falco 0.39 so we got plenty of time :)

In the meantime, I've had moderate success on a very naive implementation for the modern probe by basically translating every sendmmsg/recvmmsg into multiple sendmsg/recvmsg events

Mmmh this means we pay the push cost (+ header size of the event) multiple times for a single event? I don't think that is good from a perf pov.
Moreover, given that libs are completely syscall-driven as of now, we should implement the syscall under its own event so that one can filter for it.
At the same time, i don't know how hard it will be to implement these syscalls, therefore...i summon @Andreagit97 to ask for his opinions :)

Molter73 · 2024-07-24T13:34:38Z

Mmmh this means we pay the push cost (+ header size of the event) multiple times for a single event? I don't think that is good from a perf pov.

This behavior is the same that you would get on an older system that doesn't have these syscalls, but it's a fair concern. I thought you would be more worried with the cost of pushing the received/sent data rather than the header, the implementation I propose would apply snaplen to each individual message rather than to the whole of the syscall, but I also don't know which is the correct/preferred implementation. When running with snaplen=0, we would only be pushing the header, fd, socket tuple and a handful of other things, so I would not assume it'd be a huge deal.

Either way, I'd like to hear from Andrea as well.

Andreagit97 · 2024-07-26T08:17:20Z

Hey folks! I have to admit this is a tough call...

Let's say that from the performance point of view having just one event would be better for 2 reasons:

kernel side we just need to push one event instead of many.
in userspace we need to evaluate the rules on just one event (this is the expensive part).

BTW what is not completely clear to me is how we would handle this unique event in userspace. If I recall correctly the sendmmsg in the case of UDP, can send many events to different destinations. If this is the case we will end up with not only an array of messages but also an array of tuples. So when we call fd.name which tuple should we return? Probably we need to invent a new logic to handle this case.

Please note that I'm not 100% sure that the above case is possible we should verify that the sendmmsg can send messages to different destinations.

So to summarize:
Unique events -> better performance but difficult to manage in both userspace and kernel side
Multiple events -> worse performance but easier to implement and handle in both userspace and kernel side

If the above point about multiple tuples is true, probably I would go with the multiple sendmsg events implementation.

@Biagio-Dipalma from the security point of view, do you think it is important to distinguish a sendmmsg from a sendmsg syscall? Are there attacks based only on sendmmsg syscall? I don't think so but better to ask because, with the approach of multiple events, we will lose this distinction between a sendmmsg and a sendmsg

FedeDP · 2024-07-31T10:14:51Z

This behavior is the same that you would get on an older system that doesn't have these syscalls, but it's a fair concern.

You are right indeed.

Unique events -> better performance but difficult to manage in both userspace and kernel side
Multiple events -> worse performance but easier to implement and handle in both userspace and kernel side

I agree; thanks for the clarification!

If the above point about multiple tuples is true, probably I would go with the multiple sendmsg events implementation.

+1 then, i agree the perf penalty is worth the solution's simplicity.

Thanks both for your inputs!

Molter73 · 2024-07-31T10:31:53Z

Please note that I'm not 100% sure that the above case is possible we should verify that the sendmmsg can send messages to different destinations.

I'll try testing if sending messages to multiple destinations in a single syscall is possible ASAP, I have a simple C program that I can adapt for it, I just haven't had the time these last few days.

@Biagio-Dipalma from the security point of view, do you think it is important to distinguish a sendmmsg from a sendmsg syscall? Are there attacks based only on sendmmsg syscall? I don't think so but better to ask because, with the approach of multiple events, we will lose this distinction between a sendmmsg and a sendmsg

I would also like to know this, but worse case scenario, I can still send multiple messages so we can have distinct socktuples, just use the sendmmsg header, look everywhere in userspace for places where sendmsg is used and add the new one next to it. I just don't want to mess with a completely new implementation for a syscall that is basically already implemented and just needs some tweaking. 😅

Andreagit97 · 2024-08-01T07:25:05Z

It makes sense! Moreover, it is never too late to switch and implement it as a completely new syscall, for example, if we see cases in which a system throws 10 Millions/s of sendmmsg and splitting them we will obtain a throughput of 30 Mevt/s (supposing 3 events for each syscall) maybe we can consider other possible ways, but probably at the beginning i would start with something simple

Molter73 · 2024-08-09T15:52:11Z

Well, I finally made some time to try sending messages to multiple destinations with sendmmsg and it totally works, so some level of demultiplexing will be needed in order to have a single socktuple per event. I haven't checked if recvmmsg is able to receive from multiple sources, but I have to assume it does (I'll verify it before opening a PR).

I'll take a shot at implementing the syscall issuing multiple driver events per call, which would be the simplest. A second iteration could probably send a single event from the driver and create multiple events in userspace, but I would need to figure out what that looks like across libscap and libsinsp.

Andreagit97 · 2024-08-12T08:16:01Z

Thank you for the research Mauro!

I'll take a shot at implementing the syscall issuing multiple driver events per call, which would be the simplest. A second iteration could probably send a single event from the driver and create multiple events in userspace, but I would need to figure out what that looks like across libscap and libsinsp.

I agree to your plan!

Molter73 · 2024-08-26T14:47:50Z

This is embarrassing, I managed to send multiple events from the modern probe to userspace without much trouble, but now that I started trying to implement the same thing on the legacy probe and kernel module I see it's not as straight forward, mainly because a lot of the logic for adding events to the ringbuffers is abstracted away from the fillers. I'm considering jumping straight to generating a single event from the driver and demultiplexing in userspace, I figure I would need to declare the ppm_event_info for sendmmsg_x to look something like this:

	[PPME_SOCKET_SENDMMSG_X] = {"sendmmsg", EC_IO_WRITE | EC_SYSCALL, EF_USES_FD | EF_WRITES_TO_FD | EF_MODIFIES_STATE, 4, {{"res", PT_ERRNO, PF_DEC}, {"fd", PT_FD, PF_DEC}, {"msgvec", PT_MMSGHDR_ARRAY, PF_NA}, {"vlen", PT_UINT32, PF_DEC} } },

Where PT_MMSGHDR_ARRAY will be a special type that will hold an array of the data extracted for each message sent, basically:

socket tuple
data extracted from the IO vectors
size of the data extracted

Now the problem I have is that neither sinsp nor scap are really designed to turn a single kernel event into multiple userspace events...

So, I either implement quite a bit of custom logic in userspace for this syscall specifically, do a bunch of nasty changes to the legacy probe and kernel module for them to support sending multiple events, do just the modern probe implementation, or... Yeah, I'm out of ideas. @FedeDP, @Andreagit97, do you guys have any thoughts on how I could move this change forward?

FedeDP · 2024-08-26T15:00:34Z

Hey Mauro, my 2c:

do a bunch of nasty changes to the legacy probe and kernel module for them to support sending multiple events, do just the modern probe implementation

these seems the more reasonable options; let's go through each of them:

do a bunch of nasty changes to the legacy probe and kernel module for them to support sending multiple events

I would be ok with this; if we can move stuff away from using (bpf_)val_to_ring helpers, it is a win. Indeed for the old bpf probe we already dropped many data types (the most basic, like integers types) from using bpf_val_to_ring, since it causes many headaches and verifier issues.
What i mean is, if you are blocked by the fact that val_to_ring is a bit complex, we can workaround that by avoiding it entirely.
If that's not the problem, mind to share it a bit more in depth?

do just the modern probe implementation

While we always strive for feature parity between drivers, i think that if making it work on kmod and old bpf probe gets too hard, we might consider to make it a modern-ebpf only feature. Again, this should be a last resort if everything else fails miserably 😆

I'd completely avoid the userspace magic to demultiplex a single event into multiple if possible.

Andreagit97 · 2024-08-27T08:49:49Z

yep, I agree with @FedeDP the ideal solution would be to try to support multiple events in the kernel in all 3 drivers. I also agree to the idea of just supporting it in the modern and in the KMOD if we see that is really complex in the legacy ebpf, in the end at a certain point we should put it in maintenance mode

Molter73 · 2024-08-27T09:48:10Z

Thanks to both of you, I'll do a best effort implementation on all drivers and open a PR so we can move the discussion to something with more context.

I also agree to the idea of just supporting it in the modern and in the KMOD if we see that is really complex in the legacy ebpf

I can tell you right now that the implementation on the legacy probe will be incomplete due to verifier limitations, but more on that once I open the PR.

Biagio-Dipalma added the kind/feature New feature or request label Jan 22, 2024

Andreagit97 added this to the TBD milestone Jan 31, 2024

Andreagit97 assigned oheifetz Mar 11, 2024

poiana added the lifecycle/stale label Jun 16, 2024

poiana removed the lifecycle/stale label Jun 17, 2024

Andreagit97 assigned Molter73 Aug 12, 2024

Molter73 linked a pull request Aug 27, 2024 that will close this issue

Add arguments for sendmmsg and recvmmsg #2027

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add args for `sendmmsg` events #1636

Add args for `sendmmsg` events #1636

Biagio-Dipalma commented Jan 22, 2024

Andreagit97 commented Jan 31, 2024

oheifetz commented Mar 9, 2024

oheifetz commented Mar 9, 2024

Andreagit97 commented Mar 11, 2024

oheifetz commented Mar 16, 2024

Andreagit97 commented Mar 18, 2024

poiana commented Jun 16, 2024

Andreagit97 commented Jun 17, 2024

FedeDP commented Jun 28, 2024

Molter73 commented Jul 5, 2024

FedeDP commented Jul 8, 2024

FedeDP commented Jul 22, 2024

Molter73 commented Jul 24, 2024

FedeDP commented Jul 24, 2024

Molter73 commented Jul 24, 2024

Andreagit97 commented Jul 26, 2024

FedeDP commented Jul 31, 2024

Molter73 commented Jul 31, 2024

Andreagit97 commented Aug 1, 2024

Molter73 commented Aug 9, 2024

Andreagit97 commented Aug 12, 2024

Molter73 commented Aug 26, 2024

FedeDP commented Aug 26, 2024

Andreagit97 commented Aug 27, 2024

Molter73 commented Aug 27, 2024

Add args for sendmmsg events #1636

Add args for sendmmsg events #1636

Comments

Biagio-Dipalma commented Jan 22, 2024

Andreagit97 commented Jan 31, 2024

oheifetz commented Mar 9, 2024

oheifetz commented Mar 9, 2024

Andreagit97 commented Mar 11, 2024

oheifetz commented Mar 16, 2024

Andreagit97 commented Mar 18, 2024

poiana commented Jun 16, 2024

Andreagit97 commented Jun 17, 2024

FedeDP commented Jun 28, 2024

Molter73 commented Jul 5, 2024

FedeDP commented Jul 8, 2024

FedeDP commented Jul 22, 2024

Molter73 commented Jul 24, 2024

FedeDP commented Jul 24, 2024

Molter73 commented Jul 24, 2024

Andreagit97 commented Jul 26, 2024

FedeDP commented Jul 31, 2024

Molter73 commented Jul 31, 2024

Andreagit97 commented Aug 1, 2024

Molter73 commented Aug 9, 2024

Andreagit97 commented Aug 12, 2024

Molter73 commented Aug 26, 2024

FedeDP commented Aug 26, 2024

Andreagit97 commented Aug 27, 2024

Molter73 commented Aug 27, 2024

Add args for `sendmmsg` events #1636

Add args for `sendmmsg` events #1636