-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bug: nodes don't respond after a while #1831
Comments
I have marked this bug as critical as it is stopping js-waku nodes to connect to the fleet. |
As specified in status-im/infra-nim-waku#77, one can simply use
do note it was mentioned that the ws port was not the only one affected: status-im/infra-nim-waku#77 (comment) Also, why this was not detected in consul? |
Might be related to status-im/nimbus-eth2#5004 |
Actually that is 99% likely the cause, just bump libp2p and it will be fixed |
Merci @Menduist for the info I was looking at that code change wondering if it was related. I will try with new version of libp2p. |
Btw, this issue is also happening on the |
Checking the A On the other hand, both ports are opened:
So we have an issue with |
The reason why the which in turn, makes the main accept loop to stop in switch.nim I will apply a @Menduist 's suggestion to enhance that. cc - @SionoiS, @fryorcraken, @jm-clius |
Great investigating work! |
Problem
In
wakuv2.prod
all nodes were not responding towss
requests.@fryorcraken informed that the
js-waku
(https://examples.waku.org/light-js/) requests were not properly handled. That issue was reported at status-im/infra-nim-waku#77. While there was initially a firewall issue, in the end, we found that thenwaku
nodes were blocked.For any reason, the node couldn't attend to more requests on port 8000.
@jakubgs discovered the next:
and the next, where we can see that the p2p port seems to be blocked as well:
Impact
The node doesn't attend to any request until it gets restarted.
To reproduce
If you can reproduce the behavior, steps to reproduce it:
dial
many times.CLOSE_WAIT
in the node's container )@fryorcraken - kindly elaborate more on this if I missed any point on how to replicate the issue.
nwaku version/commit hash
v0.18.0-13-g44f9d8
We encountered this issue in all three
wakuv2.prod
nodes.The text was updated successfully, but these errors were encountered: