-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tunneldigger process management broken (?): can end up with multiple tunneldigger running #148
Comments
Can you maybe reproduce something like this if you try to do multiple tunneldigger restarts at the same time? Something like:
I'm wondering if the tunneldigger-watchdog micron can sometimes result it multiple restarts being run in parallel? Just some weild guesses. |
Hm, when I tried this even just with a loop count of 20, my device just reboots after a bit... nothing it prints via SSH shows any indication why. It's a pretty weak device with very little RAM, so it's probably not good for such tests. It's the only one I have though... |
You should be able to find out if it's an out-of-memory or other crash via /sys/kernel/debug/crashlog after the device rebooted, as long as you don't power cycle it. Or via a serial console, of course. Not sure if that'd help for this issue, but maybe there could be some unexpected hints in there? |
We had this issue on alot of routers and it seems that this occours after a reboot (daily reboots). Patched this function
to
With this change we dont have this issue anymore, but i still cant tell how the routers end up |
tunneldigger has been deprecated in gluon and removed in main branch: freifunk-gluon/gluon#3109 |
The latest lead in a long-standing issue seems to indicate that tunneldigger process management sometimes goes wrong, and we can end up with 2 instances of tunneldigger running (
ps
showing 6 tunneldigger processes, rather than the usual 3). This then leads to those 2 instances interrupting each other all the time, which is essentially a DoS attack on the gateway.I don't know how to reproduce this, and have not actually seen these 6 tunneldigger processes myself (I never managed to get SSH onto an affected device), but this is the best lead so far. So I wonder... how could a Gluon device end up in a situation where tunneldigger runs twice?
The text was updated successfully, but these errors were encountered: