-
Notifications
You must be signed in to change notification settings - Fork 18.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cannot stop container; status gets out of sync with LXC #1906
Comments
I think it would be useful to attach to |
Sounds like it might be related to #1300? |
@nekto0n, @pwaller I faced the same problem. However, I am seeing it with a simple sleep command.
For eg: I run
On another terminal
My system information
|
I just got a similar one, docker stop would not stop it, it simply hung. when I did inspect it had a pid loggged, when I looked up the pid it was not there. eventually I restarted the docker service and stop started working, then I was able to rm the image but got:
|
Tentatively scheduling for 0.8. I think this will be affected by the execution drivers work, so @crosbymichael and @creack I'll assign it to one of you. |
We had similar problems with non stoppable containers/hanging processes and subsequently locked files on the following configuration (RedHat EL 6.4) and a Docker container with a CMD issuing the Tomcat start command
Docker version:
Changing the CMD to Is there any advice on what the CMD instructions should start? |
Similar issue on a ubntu 13.04 box, with we have few container running
Which left about 300 MB free memory. The docker is latest one, v0.8.1
|
Some problems described at http://phusion.github.io/baseimage-docker/ might be relevant. Whether the proposed solution of their baseimage is a good one should everyone decide for themselves :) In order to minimize suprises or to avoid too many issues I would propose to add some details to the Docker documentation. |
Is this bug still present now that Docker uses straight libcontainer by default? |
Does anyone have a good way to reproduce this? |
@crosbymichael pretty sure bootstrapping and starting Discourse with device mapper is broken https://github.com/discourse/discourse_docker , follow the guide with DM picked (edit out the line in ./launcher that does the pre-req) |
Any news on this issue? We are experiencing the same on At some point docker ps is not up to date, stopping/starting containers doesn't work anymore. Restarting the docker daemon at that point gives a lot of broken containers due to the volume mounting issues (umount'ing doesn't work) |
@mrdfuse That kernel is outdated and it needs to be updated. Please keep your systems up to date using system provided packages. @ashahab-altiscale Can you look into this, please? |
@unclejack Looking at this. |
I checked with the company that does our infrastructure, there are 2 support programs from RedHat Enterprise: either you install the dvd version and get support on that, or you choose the daily version and get only support when you update all your packages constantly. I'm not saying I expect the docker devs to keep supporting olders kernels/packages, I'm only trying to explain you can't expect from everyone to always use the latest/greatest. Docker is fairly new and as I understand depends upon kernel features and bugfixes in later packages. As such I think Docker is not (yet) fit for us. Again, not blaming anyone, I understand you choose to only support later kernels/packages. |
@mrdfuse
|
@mrdfuse I remember the problems you're facing in your environment. However, RHEL6 should be kept up to date. The 2.6.32 kernel is actually receiving fixes and backports from newer kernels. Kernel 2.6.32 with the features it was released when it was put on kernel.org isn't supported by Docker in any way. That kernel is maintained by Red Hat to ensure that it's also OK for containers and that's why I was recommending an update in this case. Red Hat is actually maintaining that kernel to provide their customers with a stable system to be used for containers and many other things. Kernels such as 3.10, 3.12, 3.13 (on Ubuntu 14.04) and 3.14 might be better, but updates are always recommended. Installing updates is more important for kernel 2.6.32 because it didn't have some of the features and fixes when it was released. Since you're already paying that company for support, please tell them about issues like this one and the other one you've reported in that environment. They should test and upgrade to new kernels when you run into such bugs, just like they do when they need to update for security fixes. You're going to miss out on the newest fixes otherwise. I've seen bugs go away after installing the system updates on Ubuntu and CentOS. From what I recall, it was always kernel related and that's to be expected - the kernel is being worked on all the time and fixes are pulled in all the time. There's also no way to fix certain kernel bugs through Docker or work around them. I actually know some problems related to devicemapper were fixed through kernel updates on RHEL6 (some affected all systems). If you have an easy way to reproduce this on your systems, please provide the exact steps and the output so we can reproduce and investigate. |
In the few months I have been running Docker I only ran into this issue twice, so I highly doubt I can simply reproduce this :( About the kernel, I thought I read here that 2.6.32-431 is the minimum version? We'll be updating to RHEL6.6 anyways in the near future, so it doesn't matter that much anymore. |
can you try with the latest version of docker and lxc 1.0.7 |
closing as stale please ping me with details to reproduce on latest and I will reopen |
Seems I can reproduce it consistently with
Container, which image is based on From SSH of docker-machine VM
What should I do to get some useful debug information? |
@sadovnikov are you using the LXC driver, or the native driver? |
I'm not very familiar with these technologies yet. The command I use to create docker-machine VM is
How do I know LXC driver? |
@sadovnikov in that case, you're using the default (native) driver, so your issue is probably unrelated to the issue discussed here |
I came across an interesting behavior where a Docker container's status got out of sync with an lxc container.
docker ps
showed my container (ae23c705afdb
) with a status of running. However, whenever I try runningdocker kill
ordocker stop
, the command hangs indefinitely.docker rm
says that it cannot remove a container that is running.docker inspect ae23c705afdb
confirmed the running state and showed a PID of 11780.ps aux | grep 11780
showed a process that was running (lxc-start -n ae23c705afdbbcfcd723c7bb17fbdbc7c8632da41e5e8c38bbf714a701b5b536 -f /var/lib/docker/containers/...
), but it's state is shown asD
, or "uninterruptible sleep". This process thus does not respond to any interrupts.lxc-list
does not show the corresponding lxc container running, solxc-kill xxx
does not work.I ended up just having to do a reboot. Any thoughts on this?
The text was updated successfully, but these errors were encountered: