Worker node fails to recover from a join failure because of transient networking issue #26646
Labels
area/swarm
kind/bug
Bugs are bugs. The cause may or may not be known at triage time so debugging may be needed.
version/master
Description
If there is a transient networking issue when a a
swarm join
command is issued in worker node which persists until the join command times out, then the worker node never recovers from that even if the engine instance is restarted.Steps to reproduce the issue:
Reproduced in a set of dind instances:
Additional information you deem important (e.g. issue happens only occasionally):
Output of
docker version
:Logs
Node ls on first node
Certificates on worker node
/cc @tonistiigi
The text was updated successfully, but these errors were encountered: