-
-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Stuck forever in down
command (despite shutdown.timeout_seconds
)
#258
Comments
down
commanddown
command (despite shutdown.timeout_seconds
)
Hi @mprimi, Thank you for such a detailed analysis, this is very helpful. version: "0.5"
log_level: info
log_length: 300
processes:
pc_log:
command: "tail -f -n100 process-compose-${USER}.log"
working_dir: "/tmp"
shutdown:
timeout_seconds: 5
sigterm_resistant:
command: "trap '' SIGTERM && sleep 20"
shutdown:
timeout_seconds: 5
sleeper:
command: "sleep 60"
replicas: 5
shutdown:
timeout_seconds: 5 Started it about 20 times and stopped it with the for i in {1..20} ; do ./bin/process-compose up -f issues/issue_258/process-compose.yaml -t=false & sleep 1 && ./bin/process-compose down ; done Are there any additional details we might have missed? |
Thank you for looking into this. I also have been trying to repro with a variant of Nevertheless our CI pipes (1.27 on Linux via I'll keep trying to see if i can learn more from it and report back. Here's a snip of the configuration I'm using:
|
This is very helpful. |
We're seeing a similar issue with Devbox using our process-compose.yaml for Postgresql:
It looks like the progresql process here gets stuck in Trying different versions, it looks like this bug was introduced sometime after 1.24.2 -- If I run the process-compose file above with 1.24.2, I get a |
Hi @Lagoja, Issue 1Process Compose skipped termination of a Issue 2But why did >ps -aef | grep 80415
eugene 80415 80400 0 14:25 pts/1 00:00:00 [pg_ctl] <defunct> # pg_ctl is a zombie Possible Fixes
Personally, I prefer the 1st option, but I will be happy to get feedback from the users. CC: |
@F1bonacc1, thank you for investigating this. While it seems likely the problems are related, the issue I originally reported is not exactly what you identified:
Since I can reproduce, please let me know of any additional info I can capture. The configuration I posted above is still current. Overview:
While in this state,
The Looking with
Here's the tail of process-compose log:
Notice processes are dying and getting restarted (as expected). Very interesting that the one last restarted ( Seems likely that this process was not "ready" when the I tried to peek a the a stack trace (via SIGQUIT), but could not find where |
Confirming issue is fixed in v1.40.0. Thank you @F1bonacc1 |
@F1bonacc1 thanks for the ping and sorry I missed that earlier! Glad to see you already merged a fix |
This is a follow-up to #251 where i was seeing call to
list
getting stuck. Since then I have added ashutdown.timeout_seconds
parameter to ensure timely shutdown.However I am now seeing
down
getting stuck forever, e.g..:In this configuration, i am running 6 processes.
up
command.I expected
shutdown.timeout_seconds
(set to 5 seconds) to kick in and KILL the culprit.However the
down
process has been stuck where it is for the last hour.Unfortunately not much to go on here...
Current repro (maybe you can repro this yourself):
shutdown.timeout_seconds
for all of themdown
getting stuckThis is reproducing for me with v1.27 on Linux.
Happy to capture any more data you think is interesting, or to try other configuration settings.
The text was updated successfully, but these errors were encountered: