Mount /run/xtables.lock to prevent unwanted race conditions. #1186

kgtw · 2020-09-01T18:46:15Z

What type of PR is this?
bug

Which issue(s) this PR fixes:
#989 (comment)

What this PR does / why we need it:
As initially mentioned in #989 (comment) we should mount /run/xtables.lock to prevent race conditions where multiple programs are manipulating iptables at the same time.

The --wait on iptables commands isn't actually respected unless we mount the iptables lock file in to the container.

The downside of this is that it introduces a blocking action where the cni will wait until the lock is released.

I'm not sure how the kubernetes manifests under v1.x are generated, can copy by hand if thats the case..

If issue # is not available please add repro steps and logs from IPAMD/CNI showing the issue:

Testing:
Running flock -e /run/xtables.lock sleep 180 on the host before starting the container you will observe that the cni will wait 3 minutes until the lock is released. In theory this could be indefinitely according to the iptables manual.

Trying to replicate the exact bug I mentioned in the comment is extremely difficult because you have to be updating iptables at the exact same time as the cni is. We have only seen this bug observed in our production environments a handful of times so far.

Automation added to e2e:
n/a

Will this break upgrades and downgrades. Has it been tested?:
Will not break upgrades/downgrades.

Does this require config only upgrades?:
yes?

Does this PR introduce a user-facing change?:
yes

Users are recommended to mount `/run/xtables.lock` to prevent unwanted race conditions.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

…grams are manipulating iptables at the same time.

anguslees

(Thanks!)

SaranBalaji90 · 2020-09-01T23:28:38Z

Thanks @kgtw for the fix. This will help closing some of the issues. @mogren or @jayanthvn we might have to backport this to 1.7 release branch.

Mount /run/xtables.lock to prevent race conditions where multiple pro…

20d7974

…grams are manipulating iptables at the same time.

anguslees self-requested a review September 1, 2020 23:23

anguslees approved these changes Sep 1, 2020

View reviewed changes

anguslees added the bug label Sep 1, 2020

SaranBalaji90 merged commit 8fffd38 into aws:master Sep 1, 2020

mogren mentioned this pull request Sep 2, 2020

Add iptables fix and update to v1.7.1 #1187

Merged

kgtw deleted the iptables-lock branch September 2, 2020 07:36

mogren mentioned this pull request Sep 4, 2020

Container Restarts on 1.6.1 (EKS 1.15) #1054

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mount /run/xtables.lock to prevent unwanted race conditions. #1186

Mount /run/xtables.lock to prevent unwanted race conditions. #1186

kgtw commented Sep 1, 2020 •

edited

Loading

anguslees left a comment

SaranBalaji90 commented Sep 1, 2020

Mount /run/xtables.lock to prevent unwanted race conditions. #1186

Mount /run/xtables.lock to prevent unwanted race conditions. #1186

Conversation

kgtw commented Sep 1, 2020 • edited Loading

anguslees left a comment

Choose a reason for hiding this comment

SaranBalaji90 commented Sep 1, 2020

kgtw commented Sep 1, 2020 •

edited

Loading