bpo-6721: Sanitize logging locks while forking #4071

gpshead · 2017-10-21T20:24:02Z

This part is simple, acquire and release the logging lock and each logging handler's lock around fork.

A unittest is still needed (relatively trivial to create).

The logging.handlers module has a QueueHandler and QueueListener which uses the queue module... that is full of locks. I'm not addressing that as I don't see an immediately obvious correct thing to do with that queue. A QueueListener thread won't be running in the forked child anyways so the right thing for anyone using one of those is likely to remove the QueueHandler from their logging config entirely... unless they go on to re-establish their complicated queue based logging handler setup in the child. suggestion: Do nothing. Let people using QueueHandler deal with their own problems as their application sees fit.

https://bugs.python.org/issue6721

pitrou · 2017-10-21T20:26:40Z

Lib/logging/__init__.py

@@ -793,6 +798,9 @@ def createLock(self):
        Acquire a thread lock for serializing access to the underlying I/O.
        """
        self.lock = threading.RLock()
+        os.register_at_fork(before=self.acquire,


This may register an arbitrary number of callbacks across an application's lifetime, also it makes the handles eternal by keeping a reference to all of them. Instead, we could have a global WeakSet of handlers and a single set of callbacks that iterate over the live handlers.

Another option would be a before=lambda acquire=weakref.WeakMethod(self.acquire): acquire()()... I'm used to applications that don't go creating tons of logging.Handler instances and throwing them away all over the place. :)

I think I'll go with the WeakSet approach to avoid an every growing list of callables in that strange Handler discarding application case...

ambv · 2018-08-31T16:09:49Z

@gpshead Please come back to this patch during the sprint!

Now it fails on the previous code and is fixed by the logging changes. The test now uses a thread to hold the locks during the fork in a synchronized manner. Due to mixing of fork and a thread there is potential for deadlock in the child process. buildbots and time will tell if this actually manifests itself in this test or not. :/

gpshead · 2018-09-13T23:02:07Z

A regression test that actually catches the bug and is fixed by this change has been added.

Due to mixing of fork and a thread there is potential for deadlock
in the child process. buildbots and time will tell if this actually
manifests itself in this test or not. :/

miss-islington · 2018-09-14T05:08:34Z

Thanks @gpshead for the PR 🌮🎉.. I'm working now to backport this PR to: 3.7.
🐍🍒⛏🤖

bpo-6721: When os.fork() was called while another thread holds a logging lock, the child process may deadlock when it tries to log. This fixes that by acquiring all logging locks before fork and releasing them afterwards. A regression test that fails before this change is included. Within the new unittest itself: There is a small _potential_ due to mixing of fork and a thread in the child process if the parent's thread happened to hold a non-reentrant library call lock (malloc?) when the os.fork() happens. buildbots and time will tell if this actually manifests itself in this test or not. :/ A functionality test that avoids that would be a challenge. An alternate test that isn't trying to produce the deadlock itself but just checking that the release and acquire calls are made would be the next best alternative if so. (cherry picked from commit 1900384) Co-authored-by: Gregory P. Smith <[email protected]>

bedevere-bot · 2018-09-14T05:08:43Z

GH-9291 is a backport of this pull request to the 3.7 branch.

bpo-6721: When os.fork() was called while another thread holds a logging lock, the child process may deadlock when it tries to log. This fixes that by acquiring all logging locks before fork and releasing them afterwards. A regression test that fails before this change is included. Within the new unittest itself: There is a small _potential_ due to mixing of fork and a thread in the child process if the parent's thread happened to hold a non-reentrant library call lock (malloc?) when the os.fork() happens. buildbots and time will tell if this actually manifests itself in this test or not. :/ A functionality test that avoids that would be a challenge. An alternate test that isn't trying to produce the deadlock itself but just checking that the release and acquire calls are made would be the next best alternative if so. (cherry picked from commit 1900384) Co-authored-by: Gregory P. Smith <[email protected]> [Google]

hroncok · 2018-11-06T19:35:22Z

Just a heads up: We have reverted this commit in Fedora's Python package (version 3.7.1), because it rendered our graphical installers not bootable. This may have just exposed some bug in the installer or it may have introduced a regression. We are currently investigating the problem with Fedora QA and the installer developers and we will open a BPO issue if we find out a simple reproducer.

Details: https://bugzilla.redhat.com/show_bug.cgi?id=1644936

the-knights-who-say-ni added the CLA signed label Oct 21, 2017

bedevere-bot added the awaiting merge label Oct 21, 2017

pitrou reviewed Oct 21, 2017

View reviewed changes

jakirkham mentioned this pull request Jul 16, 2018

Default multiprocessing context is broken and should never be used dask/dask#3759

Closed

gpshead self-assigned this Sep 12, 2018

gpshead force-pushed the logging_locks_at_fork branch 2 times, most recently from 8d51698 to caa3e6f Compare September 13, 2018 08:41

gpshead added 2 commits September 13, 2018 10:03

[bpo-6721] Sanitize logging locks while forking.

64a53be

Use the WeakSet approach per code review.

a0c2cfb

gpshead force-pushed the logging_locks_at_fork branch from caa3e6f to a0c2cfb Compare September 13, 2018 17:03

blurb

1d5bbcb

gpshead requested a review from ambv September 13, 2018 17:10

Make the logic conditional on register_at_fork.

6be017a

gpshead added type-bug An unexpected behavior, bug, or error sprint awaiting changes needs backport to 3.7 labels Sep 13, 2018

gpshead added 3 commits September 13, 2018 15:17

Add a unittest!

ebb4b74

Actually use the registration fn...

5d5426f

gpshead removed the awaiting changes label Sep 14, 2018

gpshead merged commit 1900384 into python:master Sep 14, 2018

bedevere-bot removed the awaiting merge label Sep 14, 2018

gpshead deleted the logging_locks_at_fork branch September 14, 2018 05:08

bedevere-bot removed the needs backport to 3.7 label Sep 14, 2018

tomkooij mentioned this pull request Jan 9, 2019

Using multiple threads results in deadlock of the hisparc-update (updatehistograms job) HiSPARC/publicdb#242

Closed

gpshead mentioned this pull request Apr 10, 2022

Locks in the standard library should be sanitized on fork #50970

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bpo-6721: Sanitize logging locks while forking #4071

bpo-6721: Sanitize logging locks while forking #4071

gpshead commented Oct 21, 2017 •

edited by bedevere-bot

Loading

pitrou Oct 21, 2017

gpshead Sep 12, 2018

ambv commented Aug 31, 2018

gpshead commented Sep 13, 2018

miss-islington commented Sep 14, 2018

bedevere-bot commented Sep 14, 2018

hroncok commented Nov 6, 2018 •

edited

Loading

bpo-6721: Sanitize logging locks while forking #4071

bpo-6721: Sanitize logging locks while forking #4071

Conversation

gpshead commented Oct 21, 2017 • edited by bedevere-bot Loading

pitrou Oct 21, 2017

Choose a reason for hiding this comment

gpshead Sep 12, 2018

Choose a reason for hiding this comment

ambv commented Aug 31, 2018

gpshead commented Sep 13, 2018

miss-islington commented Sep 14, 2018

bedevere-bot commented Sep 14, 2018

hroncok commented Nov 6, 2018 • edited Loading

gpshead commented Oct 21, 2017 •

edited by bedevere-bot

Loading

hroncok commented Nov 6, 2018 •

edited

Loading