Add sendmmsg support for UDP #1034

davidBar-On · 2020-08-09T16:43:50Z

Version of iperf3 (or development branch, such as master or
3.1-STABLE) to which this pull request applies:
3.10.1 latest master
Issues fixed (if any):
UDP throughput issue #873
Brief description of code changes (suitable for use as a commit message):

Add sendmmsg support for sending UDP messages for enhanced throughput.
sendmmsg is used by setting the -Z option (which is currently used only for TCP), as it is regarded as the UDP's alternative to TCP's zero copy.
The number of packets that are send by each call to sendmmsg is the burst size set by the -b option.

Note:

configure.ac was changed so running bootstrap.sh; configure is required for the changes to take effect. (New defines are HAVE_SENDMMSG, HAVE_RECVMMSG and HAVE-SEND_RECVMMSG.)
recvmmsg is not used because tests showed does not help the throughput and event may hart it. However, the changes for testing recvmmsg are commented out in iperf_udp_recv() and not removed in case further evaluation is desired. If this is not the case, then all changes tp iperf_udp_recv can be removed.

Catchup with laters iper3 changes

davidBar-On · 2021-09-18T11:28:23Z

@bmah888 I have change the PR to use -Z option for using sendmmsg for UDP, instead of adding a new option (I also rewrote the PR description). I also enhanced implementation, so if -Z is not used for UDP, practically there should not be any change to the current iperf3 behavior. Therefore the risk of adding these changes is low.

The UDP throughput enhancement achieved for high throughput interfaces is quite substantial, especially when the UDP messages are not large (witch is usually the case).

database64128 · 2022-04-13T14:31:52Z

@bmah888 Any plan to have this PR reviewed and merged?

database64128 · 2022-04-23T16:54:57Z

src/iperf_udp.c

+	    i = 0;	/* count of messages sent */
+	    r = 0;	/* total bytes sent */
+	    while (i < sp->sendmmsg_buffered_packets_count) {
+	        j = sendmmsg(sp->socket, &sp->msg[i], sp->sendmmsg_buffered_packets_count - i, MSG_DONTWAIT);


Before each sendmmsg(2) call, you should poll for socket write readiness. In my tests, this can significantly improve performance.

Can you add more details:

How do you do the polling?

From your experience, what should be done if the socket is not ready for write? The current design of the code is that the function does not return before all was sent (or there was an error). Do you suggest that in case the socket is not ready for write the function will return successfully, but without sending anything or before all was sent?

Do you understand why the method you suggest improve performance? I am asking since in any case, iperf3 will retry sending.

Thanks

Do you understand why the method you suggest improve performance?

In my case, I was doing raw syscalls in my Go program. The UDP socket opened by the Go runtime is in non-blocking mode. With sendmsg(2), if the sending operation was going to block, sendmsg(2) would return -EAGAIN or -EWOULDBLOCK, which is handled by the Go runtime to poll for socket write readiness with epoll. The calling goroutine can then be parked by the runtime to free the OS thread. (My limited understanding of Go internals might be inaccurate.)

Now with sendmmsg(2), according to the manual, a nonblocking call sends as many messages as possible (up to the limit specified by vlen) and returns immediately. By treating a non-complete return value the same way as -EAGAIN and -EWOULDBLOCK, that is, instead of immediately calling sendmmsg(2) again, I instructed the Go runtime to poll for write readiness before the next sendmmsg(2) call. This change yielded a 10% increase in throughput.

The current design of the code is that the function does not return before all was sent (or there was an error).

I'm not familiar with iperf3's code base. I just read some code, and it seems to me that iperf3 uses sockets in blocking mode for UDP tests. In this case, maybe it's better to simply drop the MSG_DONTWAIT flag, sendmmsg(2) would then only return when all messages have been sent. This saves even more syscall overhead.

@database64128, thanks a lot for the detailed explanation.

I will have to check how easy it is to implement this. To minimize iper3 design changes, the approach I took for sendmmsg is to accumulate packets iperf3 is sending and send them in bursts using sendmmsg. It may be that instead of the for loop, sendmmsg can be called once. In this case all the packets that were not sent can either be moved to the beginning of the buffer or ignored. (The issue with ignoring is that the packets are numbered, so the new packets numbering should start from the last successful packet sent.)

src/iperf_udp.c

davidBar-On added 2 commits May 16, 2020 10:57

temporary changes to undef congestion control

6882134

Merge remote-tracking branch 'upstream/master'

f316fcf

Catchup with laters iper3 changes

davidBar-On mentioned this pull request Aug 9, 2020

UDP throughput issue #873

Open

bmah888 linked an issue Sep 11, 2020 that may be closed by this pull request

UDP throughput issue #873

Open

davidBar-On mentioned this pull request Aug 27, 2021

Zero copy doesn't work with UDP #1193

Closed

davidBar-On added 2 commits September 3, 2021 17:09

Remove PR changes and cathcup with master code

e3cd055

Merge branch 'master' into iss873

6f8918e

davidBar-On force-pushed the iss873 branch 2 times, most recently from 635e879 to 6f8918e Compare September 18, 2021 10:30

Add 'sendmmg' support for UDP by setting '-Z' and using '-b' burt size

8be4586

davidBar-On changed the title ~~Add sendmmsg/recvmmsg support~~ Add sendmmsg support for UDP Sep 18, 2021

davidBar-On mentioned this pull request Mar 3, 2022

Zero copy for sending and receiving #1286

Open

database64128 suggested changes Apr 23, 2022

View reviewed changes

database64128 mentioned this pull request Apr 23, 2022

Discussion: UDP protocol performance optimization shadowsocks/shadowsocks-org#194

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sendmmsg support for UDP #1034

Add sendmmsg support for UDP #1034

davidBar-On commented Aug 9, 2020 •

edited

Loading

davidBar-On commented Sep 18, 2021

database64128 commented Apr 13, 2022

database64128 Apr 23, 2022

davidBar-On Apr 24, 2022

database64128 Apr 24, 2022

davidBar-On Apr 24, 2022 •

edited

Loading

Add sendmmsg support for UDP #1034

Are you sure you want to change the base?

Add sendmmsg support for UDP #1034

Conversation

davidBar-On commented Aug 9, 2020 • edited Loading

davidBar-On commented Sep 18, 2021

database64128 commented Apr 13, 2022

database64128 Apr 23, 2022

Choose a reason for hiding this comment

davidBar-On Apr 24, 2022

Choose a reason for hiding this comment

database64128 Apr 24, 2022

Choose a reason for hiding this comment

davidBar-On Apr 24, 2022 • edited Loading

Choose a reason for hiding this comment

davidBar-On commented Aug 9, 2020 •

edited

Loading

davidBar-On Apr 24, 2022 •

edited

Loading