-
Notifications
You must be signed in to change notification settings - Fork 59
Unable to start a container in CentOS 7 with Docker #950
Comments
Hi @syed - thanks for reporting this issue. A few questions:
A few tests to try:
|
Hi @jodh-intel,
I enabled logging as per your directions. The I also started the proxy in debug mode and here are the logs from
|
Hi @syed - thanks for the debug info. The runtime log indicates a problem talking to the proxy. From
Please could you run the following so we can get more detail from the proxy:
Then in another terminal:
Finally, control-c the terminal running |
Interesting. I am pretty sure the proxy is getting the message. It just doesn't respond with a success. Here are the logs
The corresponding logs in the
From what I can understand, the |
After the runtime does a hello, the proxy is supposed to receive a message from hyperstart, the agent running inside the VM. That message is sent as soon as the VM has booted and hyperstart is started. Unfortunately it seems to me that qemu isn't properly booting. Could you confirm |
You can confirm your qemu has the correct support like this:
It's a long-shot, but did you capture the output of Also, are you running on bare metal, or in a virtualised environment? If virtualised, how much memory is available? Another question: what do you get as output from these commands:
|
Please could you paste the output of:
... and attach files:
Also, do you see any errors that might be related in:
|
@dlespiau @jodh-intel Unfortunately I don't have the output of the installer script. I am running on a baremetal host with 2 sockets of Xeon L5520 giving me 16 cores and 24GB of RAM. I checked for
Here is the output for the vhost module info:
The
I've attached the |
Hi @syed - that all looks fine and your config files seem to be unmodified. Could you try running this (assuming you only have a single qemu process running):
|
@jodh-intel I do have a qemu process running.
Strangely, the stack looks empty
|
@syed are you using a custom linux kernel in the host? |
@devimc AFAIK, no. I'm using the one which is shipped by default
|
@syed - Could you try disabling
Then, capture the debug proxy output again:
And try creating a container once more. You should see more output from the proxy this time. |
@jodh-intel ... I don't see any output on the proxy
Here is the qemu process for the container
|
@syed I ran into the same problem. I noticed that my /etc/systemd/system/docker.service.d/clr-containers.conf file had two "/" in the path to the cc-oci-runtime binary. When I removed the extra / the cor runtime functioned properly for me. It should be "cor=/usr/local/bin/cc-oci-runtime" Can you check your conf file to see if the path to the cor binary is correct? |
@eadamsintel I removed the extra
I restarted Docker, I even rebooted the host after this but the Docker daemon just hangs if I start a container. |
@jodh-intel is there something that I can use to check if the qemu is working correctly? Can I start a simple VM from command line using the clear container kernel? |
At this point, I have
When I run this, I get no output at all. I also see that the qemu process is taking 100% CPU (all other qemu processes started from |
The
|
I repeated the installation steps on Ubuntu 16.04 and I still ran into this issue. |
I ran your QEMU command and it ran at 100% CPU for me as well. The MD5 of my qemu was bb7fb35aaa35990f686053629a24fcb9 for reference and it came from running the rhel script. I set the / back in my script and restarted docker and had the containerd issue come backup. When I removed it again and restarted Docker I continued to have the containerd issue. I killed the cc-oci-runtime process, restarted docker again, and enabled the cc-proxy and continued to have issues. I rebooted and that fixed it up. There might be some caching of that configuration file or I might have had something else going on that is different from you. My issue was with not being able to connect to containerd and qemu never spawned a process when trying to do a docker run. |
@eadamsintel @jodh-intel it looks like
I am attaching the repeating pattern that I found with strace. From what I can tell, it looks like qemu opens the |
Hi @syed - Can you clarify your comments on Ubuntu - are you saying you've followed the doc below and installed Ubuntu + Clear Containers on the same Xeon system you used for CentOS and you see (exactly?) the same behaviour with qemu spinning at 100% CPU? If so, this sounds like a hardware issue. Do you have access to another physical system to try and rule that out? @anthonyzxu - do you have any thoughts on this? |
@jodh-intel I the setup on a VM running inside a KVM host and it is working there. I don't have another physical box at the moment but I will try to free one up and test it on that. If it works, it looks like there is some incompatibility with the processor and Qemu. |
Hi @syed - unfortunately, its looking like your Xeon isn't new enough. To prove that, can you run the following:
|
@jodh-intel You are absolutely right. Looks like the CPU doesn't have the aes flag
|
Hi @syed - I'm sorry to hear that, but atleast we eventually got to the bottom of the problem :) We'll be updating the check-config script [1] for the missing Thanks for your patience and hope you are able to source another system to allow you to have the proper Clear Containers experience. [1] - https://download.clearlinux.org/current/clear-linux-check-config.sh |
Thanks @jodh-intel and everyone for promptly answering my queries. It's always brings me happiness to see the community willing to help someone who's starting out. Nice to see a healthy project 👍 I'll go ahead and close this issue for now. Thanks again. |
Hi @syed - would you mind checking a couple of things that might give us a more precise answer as to why your system can't run Clear Containers?:
Also, just to let you know that the script below has now been updated to perform the extra check (and hopefully save other users from having the bad experience you've had): |
@jodh-intel Absolutely. Here is the output:
|
Hi @syed - Thanks very much - this is useful information. The actual limitation of your system is the |
The check script below has now been updated to check for Example run on a system that supports Clear Containers:
|
For what it's worth, the new runtime (still in alpha) now has a built-in command called |
Hi— Sorry to barge into this closed issue. I followed all the debugging steps that @jodh-intel provided, which were superb, and everything looks like it should work. But I still get 100% CPU usage from qemu-lite. I tried both the old and new checking scripts¹, and both look fine. I’m attaching a bunch of other output files. Many thanks for your help in advance.
(¹) I only compiled the cc-check.go and utils.go files, as I would prefer Attached files: |
Hi again— I didn’t know 3.0 was so closed to being released! I’ve now installed the new runtime, and it works perfectly. Thank you! |
Hi,
I've basically followed the instructions mentioned at https://github.com/01org/cc-oci-runtime/blob/master/documentation/Installing-Clear-Containers-on-Centos-7.md. However, when try to start a container using
It just hangs. I ran the
linux-check-config.sh
script to see if there are any problems, but I don't find anyI can see a
qemu-lite
process in theps -e
However, I don't see anything on the docker side. I tried to start thecc-proxy
in debug mode and saw that it receives a hello but it never responds with a success.What would be the best way to debug this? Here are more details about my environment
The text was updated successfully, but these errors were encountered: