-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reports of updater hanging without observable activity #668
Comments
As part of investigating this issue, one immediate improvement we could make to the updater is to ensure that all output (not just the launcher's own log file) is written to disk for easier debugging. |
I have observed this issue locally while testing #666 , the updater will try to run a full apply run, but hangs at 35% after applying dom0 state. The reboot did resolve. I did observe that a task failed while initially applying dom0 state, prior to running the full provisioning run:
Did you also observe this error? If so, we should:
|
Some initial observations:
Either Before modifying the script, I'd like to understand why it's (sometimes?) attempting to access the wrong DBUS session. I will add some debugging lines and see if I can reproduce the error you observed:
Please let me know if you have other pointers or debugging suggestions. :) |
So far no dice on a repro. Here's what I did:
Saw a successful |
I still haven't been able to reproduce the state reported by @emkll in #668 (comment) but #684 should make it impossible to get into that specific failure mode and should have no negative side effects. Next up: more graceful error handling for the updater itself. |
We've received several reports of the preflight updater hanging and not recovering. I've personally seen it happen with the following STR:
prod
sdw-admin --apply
Expected behavior
Updater will trigger full migration due to postinst logic added in 6cf625c, but will eventually run to completion
Actual behavior
Updater gets stuck at 35% with no observable activity. The issue is resolved on reboot, which is expected because
/tmp
will be empty and the migration will not be applied.The text was updated successfully, but these errors were encountered: