-
Notifications
You must be signed in to change notification settings - Fork 18.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Docker 1.12.2 docker ps hangs #27323
Comments
@saithala Can you try to send |
/cc @tonistiigi |
This looks pretty awful |
I'm experiencing the same issue with 1.12.2-rc2 running on CentOS 7.2.1511, kernel 3.10.0-327.36.1.el7.x86_64 #1 SMP, with Swam active (this node is a manager): The stack from Docker engine is attached.
|
As an update, apparently the remote API is not responding to calls:
|
Sounds like it might be related to #27272 |
Looks like a deadlock on the network controller. This seems like the relevent code holding the controller lock:
While the controller is locked it's stuck trying to acquire the lock on a "serviceBinding" |
Btw, the network controller is deadlocked, making any actions like start/stop hang while the container is locked, which makes commands like |
Bumping to P1 |
It is a AB/BA deadlock between controller and service lock. Pushed a PR to fix it in moby/libnetwork#1507 |
Fixes moby#27323 Signed-off-by: Jana Radhakrishnan <[email protected]>
@mrjana Can you share more details on the likelihood of the deadlock triggering? Does this warrant a 1.12.3? /cc @vieux @thaJeztah |
@aluzzardi It is indeed surprising that nobody hit this problem during 1.12.2-rc phase but the likelihood is entirely dependent on timing of two goroutines exactly to have acquired one of the locks and waiting on the other. In terms of possibility it is more likely to happen when there is more task failures which can start triggering cleanup concurrently when another task of the same service is trying to get started. May be that is why we haven't hit this because we haven't probably had too many task failures in the kind of testing that we have done. |
I created a 1.12.3 milestone for tracking |
Description
After upgrading to the latest version of Docker Engine with Swarm mode (1.12.2) executing command docker ps hangs every now and then. Restarting the docker daemon solves the issue
The text was updated successfully, but these errors were encountered: