-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dangling file in /var/lib/cni/networks/podman prevents container starting podman 3.0 #9465
Comments
I assume you have a static IP for your container defined. |
No i do not have static ip configured for the container.
my versions are as follows of some other components as well:
Can this be caused by container being launched by one user via sudo and then stopped and re-launched by root's crontab? |
Can you check if |
Funny you should ask. The reason i went with the stop and start to begin with is that restart seems to be so fast that it doesnt allow time for the bind for the port to release. That's the issue i had from the beginning that led me to using stop && wait && start.
|
Alright, that's very strange. |
this is a secondary issue and should be split off. |
@mheon does cleanup remove files in |
or how about when we detect a restart has occurred? |
As of Podman 3.0, we ship a systemd-tmpfiles file instructing systemd to wipe the CNI reservations directory on reboot. |
I doubt this is actually a CNI reservation issue given the way this is being described, though. If @skitoxe Can you check if there are any orphan |
@mheon could selinux be interupting things? |
Possible? Checking the audit log could be helpful. I'm still not certain what's actually going on here, though. We're getting port in use errors from Conmon, which usually indicate that Conmon is hanging around for a while and blocking the ports - but we fixed |
Can you please paste your cni config file for the podman_default network. |
You mean this file right? /etc/cni/net.d
|
i didnt appear to have, granted i could have been to slow to catch them if they stick around too long but still very short, depends on the speed the restart allocates the port again, in comparison to how fast i get the process grep. I did however get this error while stopping this time:
|
Hm. Any chance the firewall was reloaded since the container was created? That could be related, though I'm not 100% sure how yet. |
Yes that's the correct file and it also looks correct to me.
Do you really have 40 cores on your system? This could be race which is more likely to happen with so many cpu cores. |
couple of observations here. the network config provided ^^^ was created by docker-compose (based on the labels). So the first question I have is if that is true? And then /me wonders if you should be using a network that you create with podman network create? |
Yes it has been reloaded.
10 cores and 20 threads per CPU. 2x xeon e5 2640v4.
The containers were created by Docker-compose but with no additional network configuration in the .yml file apart from dns configuration passed to the container. |
A friendly reminder that this issue had no activity for 30 days. |
A friendly reminder that this issue had no activity for 30 days. |
Since I was never answered, I am going to assume no and close. Reopen if I am mistaken. |
Is this a BUG REPORT or FEATURE REQUEST? (leave only one on its own line)
/kind bug
Description
When doing an
podman stop <container> && sleep 5 && podman start <container>
or just a normal stop and start i on some occasions get an orphan in /var/lib/cni/networks/podman/.In exactly the same manner as described in issue #3759, which is now closed. But is see further comments indicating this is still an issue and i can see that its still a problem in 3.0. I have to manually delete the file to get the container to start again. Error message recived is:
Steps to reproduce the issue:
Describe the results you received:
No starting container and the follwing error when trying to start:
Describe the results you expected:
Container starts
Additional information you deem important (e.g. issue happens only occasionally):
Seems to be more common when run by a crontab. But that might be my imagination.
Output of
podman version
:Output of
podman info --debug
:Package info (e.g. output of
rpm -q podman
orapt list podman
):Have you tested with the latest version of Podman and have you checked the Podman Troubleshooting Guide?
Yes
Additional environment details (AWS, VirtualBox, physical, etc.):
Bare metal server dual xeon cpus and 192GB RAM.
I also have this in my /usr/lib/tmpfiles.d/podman.conf
The text was updated successfully, but these errors were encountered: