-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
UDP access to a service from another node is broken with hostNetworking #6664
Comments
This sounds kind of like: flannel-io/flannel#1279 Do you see any different behavior if you run the I see that you're running Ubuntu - what version? What infrastructure is this running on? Bare metal, vsphere, ec2, etc? |
also cc @thomasferrandiz @manuelbuil |
@brandond Thanks for the fast response. This is running on Ubuntu 20.04 bare metal. We use Puppet to setup our OS and k3s so it should be consistent across the 4 clusters we have. I'll follow through that flannel issue and report back. |
I have since rebooted that machine again and the problem no longer happens so I can't test ethtool disabling checksum offloading. It seems to be intermittent with some reboots fixing it. I'll close this for now. If it happens again I'll re-open the issue and provide more info. |
@brandond Ended up in the same situation again. Can confirm that running What is the appropriate action from here? |
Your node has a kernel with broken tx checksum offload. You should disable it using that ethtool command during node startup. |
Thanks @brandond. I've applied that across all our machines. |
Environmental Info:
K3s Version:
Node(s) CPU architecture, OS, and Version:
Linux ****** 5.4.0-135-generic #152-Ubuntu SMP Wed Nov 23 20:19:22 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux
Cluster Configuration:
Describe the bug:
We run multiple sites/clusters on k3s. We had no issues with v1.23.8+k3s1. But after upgrading to v1.23.14+k3s1 we regularly have issues looking up service names in coredns. It does not happen all the time (rebooting sometimes fixes it) but I have narrowed it down to a combination of all the following:
hostNetworking
Steps To Reproduce:
INSTALL_K3S_VERSION=v1.23.14+k3s1 K3S_TOKEN=**** install.sh --datastore-endpoint=*** --datastore-cafile=*** --datastore-certfile=*** --datastore-keyfile=*** --disable=metrics-server,traefik --node-name=****
Expected behavior:
In this example coredns is running on
proc2
. I expect the following to work but it fails to connect:All of the following examples do work tho.
From the same node that the coredns service is on (proc2):
Using TCP instead of UDP:
Not using hostNetowkr:
Using the cluster IP of coredns:
Additional context / logs:
There is nothing in the coredns or k3s logs that seems relevant.
The text was updated successfully, but these errors were encountered: