-
Notifications
You must be signed in to change notification settings - Fork 30
systemd[1]: Failed to send queued message: No buffer space available #1744
Comments
maybe related rkt/rkt#3486 |
It sounds similar but I don't think rkt is involved in that. How is fleet being run? |
@lucab what do you mean? Its default fleet from stable coreos with etcd2 |
This might be related to #1742. Can you look at the available system memory? |
@crawford the system had 128GB memory... and not all memory was used |
Heh, yeah probably didn't chew through all of that. |
and fleet does not consume much
|
@crawford happend again on another node, exactly the same behaviour |
@f0 We have still never been able to reproduce this failure. This upcoming Alpha should have system 233, so it will be interesting to see if you still run into this. |
@f0 Are you still seeing this issue? |
Closing due to inactivity. |
I came across this error (not in coreos), the following command resolved the issue, which appears to be a bug in systemd
|
the problem is still relevant!
|
The underlying issue here is the same as upstream bug systemd/systemd#4068. Synchronous dbus operations requires some buffering on systemd side, and under high workload those buffers may saturate. Buffer sizes got bumped in v232, which should alleviate this is issue in most cases, but buffers can still fill up in extreme situations. As this is starting to attract unrelated non-coreos followups, I'm going to lock this conversation. Further specific bugs can be discussed in new dedicated tickets. |
Today i found that one of our Servers (Bare Metal) no longer get new units via fleet.
After some debugging , i see that fleet dos not sync the state with etcd
Direct after restarting the fleet daemon, i see this error in the logs
Also all systemd comands are "slow" or runs into a timeout (e.g systemctl status...)
I try to restart the server via reboot , this hangs and in the HW Console i see a loop of umount commands. Only a hard reset from the HW Console works
Here is the journal from this
(this is actually truncated, there are 10x such messages in the log)
After the reboot, the Problem is gone.....
CoreOS Version
The text was updated successfully, but these errors were encountered: