Conntrack flush support #32505

fcrisciani · 2017-04-11T01:02:44Z

- What I did
Fixes #8795
Fixes #31610
Fixes #31414

On external connectivity removal, the bridge driver will take care of erasing all the conntrack flows relative to the endpoint that is going down

- How I did it
Introduced the conntrack support in the netlink library and then from libnetwork will cleanup the flows passing the IP address of the endpoint that is going down

- How to verify it
Added a test that creates a server container and a client container and establishes an UDP flow (using UDP for simplicity) between them.
The test verifies the presence of the flow fetching its from the conntrack table.
The server container is then destroyed and the tests validates that the previous flow got purged from conntrack.

- Description for the changelog

Conntrack flush on external connectivity removal

- A picture of a cute animal (not mandatory but encouraged)

- Added conntrack support Signed-off-by: Flavio Crisciani <[email protected]>

- adding conntrack flush fix for moby#8795 Signed-off-by: Flavio Crisciani <[email protected]>

mavenugo · 2017-04-11T01:28:15Z

@fcrisciani thanks for resolving this long standing issue and a good test to catch it as well.

LGTM

vieux · 2017-04-11T01:45:35Z

LGTM

AkihiroSuda · 2017-04-11T02:18:08Z

integration-cli/docker_cli_nat_test.go

+	c.Assert(err, check.IsNil)
+	var flowMatch int
+	for _, flow := range flows {
+		if flow.Forward.Protocol == 17 &&


Can we use constant for this magic number?

mavenugo · 2017-04-11T02:41:16Z

@fcrisciani the windowsRS1 failure seems genuine -

02:32:17 # github.com/docker/docker/vendor/github.com/vishvananda/netns
02:32:17 ..\vendor\github.com\vishvananda\netns\netns.go:27: undefined: syscall.Stat_t
02:32:17 ..\vendor\github.com\vishvananda\netns\netns.go:28: undefined: syscall.Fstat
02:32:17 ..\vendor\github.com\vishvananda\netns\netns.go:31: undefined: syscall.Fstat
02:32:17 ..\vendor\github.com\vishvananda\netns\netns.go:39: undefined: syscall.Stat_t
02:32:17 ..\vendor\github.com\vishvananda\netns\netns.go:43: undefined: syscall.Fstat
02:32:17 ..\vendor\github.com\vishvananda\netns\netns.go:57: cannot use int(*ns) (type int) as type syscall.Handle in argument to syscall.Close
02:32:17 FAIL	github.com/docker/docker/integration-cli [build failed]

When a container was being destroyed was possible to have flows in conntrack left behind on the host. If a flow is present into the conntrack table, the packet processing will skip the POSTROUTING table of iptables and will use the information in conntrack to do the translation. For this reason is possible that long lived flows created towards a container that is destroyed, will actually affect new flows incoming to the host, creating erroneous conditions where traffic cannot reach new containers. The fix takes care of cleaning them up when a container is destroyed. The test of this commit is actually reproducing the condition where an UDP flow is established towards a container that is then destroyed. The test verifies that the flow established is gone after the container is destroyed. Signed-off-by: Flavio Crisciani <[email protected]>

thaJeztah · 2017-04-11T13:00:22Z

ping @mavenugo @vieux all green now; ready to merge?

thaJeztah · 2017-04-11T13:01:47Z

adding impact/changelog, because #8795 has been a long standing issue, so worth a mention in the changelog

mavenugo · 2017-04-11T14:27:11Z

@thaJeztah @vieux 👍

thaJeztah · 2017-04-11T14:40:50Z

Thanks! I'll go ahead and merge 👍

seemethere · 2017-08-31T04:10:27Z

integration-cli/docker_cli_network_unix_test.go

+
+	// Launch the server, this will remain listening on an exposed port and reply to any request in a ping/pong fashion
+	cmd := "while true; do echo hello | nc -w 1 -lu 8080; done"
+	_, _, err := dockerCmdWithError("run", "-d", "--name", "server", "--net", "testbind", "-p", "8080:8080/udp", "appropriate/nc", "sh", "-c", cmd)


appropriate/nc?

@fcrisciani I think @seemethere was refering to the use of a 3rd party image, and it that case it was not updated in 2 years. could you replace it with a simple alpine for the official library ? it includes nc

@vieux @seemethere Using that image is not accidental but is intended. The nc version in alpine does not support for example the option -q. There is a bug in nc that let the command never return also when specify the -w option.
Feel free to try to change it to the alpine removing the -q option to let the command work, but most likely the test will fail timing out because the nc command will hang forever.

There is a race condition between the local proxy and iptables rule setting. When we have a lot of UDP traffic, the kernel will create conntrack entries to the local proxy and will ignore the iptables rules set after that. Related to PR #32505. Fix #8795. Signed-off-by: Vincent Bernat <[email protected]>

Flavio Crisciani added 2 commits April 10, 2017 17:09

Vendoring Netlink library

e88bc31

- Added conntrack support Signed-off-by: Flavio Crisciani <[email protected]>

Vendoring Libnetwork library

c16eb5f

- adding conntrack flush fix for moby#8795 Signed-off-by: Flavio Crisciani <[email protected]>

GordonTheTurtle added the status/0-triage label Apr 11, 2017

mavenugo added this to the 17.05.0 milestone Apr 11, 2017

mavenugo approved these changes Apr 11, 2017

View reviewed changes

vieux added status/4-merge and removed status/0-triage labels Apr 11, 2017

AkihiroSuda reviewed Apr 11, 2017

View reviewed changes

AkihiroSuda added status/2-code-review status/failing-ci Indicates that the PR in its current state fails the test suite and removed status/4-merge labels Apr 11, 2017

fcrisciani force-pushed the conntrack_test branch from 8b8dfbe to 1c4286b Compare April 11, 2017 04:36

thaJeztah removed the status/failing-ci Indicates that the PR in its current state fails the test suite label Apr 11, 2017

thaJeztah added the impact/changelog label Apr 11, 2017

thaJeztah mentioned this pull request Apr 11, 2017

17.05.0 rc1 changelog #32498

Merged

thaJeztah merged commit f30e94a into moby:master Apr 11, 2017

fcrisciani deleted the conntrack_test branch April 11, 2017 15:06

rfay mentioned this pull request Apr 11, 2017

UDP memberlist checks always timing out hashicorp/consul#2200

Closed

seemethere reviewed Aug 31, 2017

View reviewed changes

thaJeztah mentioned this pull request Sep 13, 2017

Remove "appropriate/nc" image from integration test #34832

Closed

thaJeztah mentioned this pull request Nov 18, 2021

TestConntrackFlowsLeak: use busybox "nc" #43031

Merged

vincentbernat mentioned this pull request Mar 23, 2022

bridge: also flush conntrack entries when setting up endpoints #43409

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conntrack flush support #32505

Conntrack flush support #32505

fcrisciani commented Apr 11, 2017 •

edited by aboch

Loading

mavenugo commented Apr 11, 2017

vieux commented Apr 11, 2017

AkihiroSuda Apr 11, 2017

fcrisciani Apr 11, 2017

mavenugo commented Apr 11, 2017

thaJeztah commented Apr 11, 2017

thaJeztah commented Apr 11, 2017

mavenugo commented Apr 11, 2017

thaJeztah commented Apr 11, 2017

seemethere Aug 31, 2017

fcrisciani Aug 31, 2017

vieux Aug 31, 2017

fcrisciani Aug 31, 2017

Conntrack flush support #32505

Conntrack flush support #32505

Conversation

fcrisciani commented Apr 11, 2017 • edited by aboch Loading

mavenugo commented Apr 11, 2017

vieux commented Apr 11, 2017

AkihiroSuda Apr 11, 2017

Choose a reason for hiding this comment

fcrisciani Apr 11, 2017

Choose a reason for hiding this comment

mavenugo commented Apr 11, 2017

thaJeztah commented Apr 11, 2017

thaJeztah commented Apr 11, 2017

mavenugo commented Apr 11, 2017

thaJeztah commented Apr 11, 2017

seemethere Aug 31, 2017

Choose a reason for hiding this comment

fcrisciani Aug 31, 2017

Choose a reason for hiding this comment

vieux Aug 31, 2017

Choose a reason for hiding this comment

fcrisciani Aug 31, 2017

Choose a reason for hiding this comment

fcrisciani commented Apr 11, 2017 •

edited by aboch

Loading