gh-109653: Speedup import of threading module #114509

danielhollas · 2024-01-23T23:09:36Z

Delayed import of functools speeds up the import threading by ~50% (2ms -> 1ms) in my testing.

Since the functools module is only used in the internal _register_atexit function that is called by concurrent.futures, this seems like a worthwhile win for users of threading module who do not use asyncio.

Part of #109653

CC @AlexWaygood

Issue: Improve import time of various stdlib modules #109653

Delayed import of functools leads to 50% speedup of import time.

bedevere-app · 2024-01-23T23:09:44Z

Most changes to Python require a NEWS entry. Add one using the blurb_it web app or the blurb command-line tool.

If this change has little impact on Python users, wait for a maintainer to apply the skip news label instead.

danielhollas · 2024-01-24T00:34:54Z

To be precise, compiling python with ./configure --enable-optimizations and measuring with python -Ximporttime -c "import threading", I am getting 2.44ms on main and 1.17ms on this PR.

ajoino · 2024-01-24T12:33:02Z

Just a thought, is it even necessary to use functools.partial here? Could this not be replaced with lambda: f(*args, **kwargs), that would avoid importing functools at all. Is there something I'm missing here?

AlexWaygood · 2024-01-24T12:39:28Z

Just a thought, is it even necessary to use functools.partial here? Could this not be replaced with lambda: f(*args, **kwargs), that would avoid importing functools at all. Is there something I'm missing here?

This was my thought as well on first seeing the patch. functools.partial can be faster than a lambda function, but here I doubt it makes a significant difference. (If we wanted to check whether using a lambda here slowed things down, we'd need to do a benchmark using concurrent.futures, since the concurrent.futures module is the only public API that makes use of this private API. It might be possible to write such a benchmark, but it also might be difficult -- not sure.)

danielhollas · 2024-01-24T13:37:03Z

functools.partial can be faster than a lambda function, but here I doubt it makes a significant difference. (If we wanted to check whether using a lambda here slowed things down, we'd need to do a benchmark using concurrent.futures, since the concurrent.futures module is the only public API that makes use of this private API. It might be possible to write such a benchmark, but it also might be difficult -- not sure.)

Looking at the code, the threading.register_atexit() is only ever called during concurrent.futures import, so I would assume any performance difference here would be marginal?

AlexWaygood · 2024-01-24T13:44:33Z

Looking at the code, the threading.register_atexit() is only ever called during concurrent.futures import, so I would assume any performance difference here would be marginal?

Oh, great point 😄

In that case, let's just go with a lambda here -- it seems simpler :)

Lib/threading.py

Co-authored-by: Alex Waygood <[email protected]>

AlexWaygood

LGTM, thanks! I'd love to check with a core dev more familiar with subinterpreters before merging, though (since this feature was specifically added to help with subinterpreter support).

@ericsnowcurrently, there's no reason why switching to a lambda rather than functools.partial could be problematic for subinterpreter support, is there?

danielhollas · 2024-01-24T15:14:12Z

@AlexWaygood thanks!

@ericsnowcurrently, there's no reason why switching to a lambda rather than functools.partial could be problematic for subinterpreter support, is there?

Just a note, if this was a problem, we could still get away with it by simply not doing either: the function is (at least currently) being called without any extra *args or **args arguments so we could make _register_atexit less general and simply pass the callback function directly to _threading_atexits list.

AlexWaygood · 2024-01-31T09:28:13Z

I can't see a way in which this would cause problems — I'll go ahead and merge, since it's been a few days :)

Thanks @danielhollas!

ericsnowcurrently · 2024-02-01T21:11:29Z

@ericsnowcurrently, there's no reason why switching to a lambda rather than functools.partial could be problematic for subinterpreter support, is there?

I'm not aware of any such reason.

Lib/threading.py

Avoiding an import of functools leads to 50% speedup of import time. Co-authored-by: Alex Waygood <[email protected]>

Speed up import of threading module

2a73912

Delayed import of functools leads to 50% speedup of import time.

bedevere-app bot added the awaiting review label Jan 23, 2024

bedevere-app bot mentioned this pull request Jan 23, 2024

Improve import time of various stdlib modules #109653

Closed

📜🤖 Added by blurb_it.

b70b7e5

Eclips4 added the performance Performance or resource usage label Jan 24, 2024

AlexWaygood reviewed Jan 24, 2024

View reviewed changes

Lib/threading.py Outdated Show resolved Hide resolved

Let's got with lambda instead of functools.partial

e76cec2

Co-authored-by: Alex Waygood <[email protected]>

AlexWaygood approved these changes Jan 24, 2024

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting review labels Jan 24, 2024

AlexWaygood requested a review from ericsnowcurrently January 24, 2024 14:18

danielhollas mentioned this pull request Jan 24, 2024

gh-109653: Improve import time of logging by lazy loading traceback #112995

Closed

danielhollas changed the title ~~gh-109653: Speedup import of threading module~~ gh-109653: Speedup import of threading module Jan 24, 2024

AlexWaygood merged commit 5e390a0 into python:main Jan 31, 2024
31 checks passed

bedevere-app bot removed the awaiting merge label Jan 31, 2024

danielhollas deleted the import-threading-speedup branch January 31, 2024 10:59

ericsnowcurrently reviewed Feb 1, 2024

View reviewed changes

Lib/threading.py Show resolved Hide resolved

aisk pushed a commit to aisk/cpython that referenced this pull request Feb 11, 2024

pythongh-109653: Speedup import of threading module (python#114509)

e7225f8

Avoiding an import of functools leads to 50% speedup of import time. Co-authored-by: Alex Waygood <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-109653: Speedup import of threading module #114509

gh-109653: Speedup import of threading module #114509

danielhollas commented Jan 23, 2024 •

edited

Loading

bedevere-app bot commented Jan 23, 2024

danielhollas commented Jan 24, 2024 •

edited by hugovk

Loading

ajoino commented Jan 24, 2024

AlexWaygood commented Jan 24, 2024

danielhollas commented Jan 24, 2024

AlexWaygood commented Jan 24, 2024

AlexWaygood left a comment

danielhollas commented Jan 24, 2024

AlexWaygood commented Jan 31, 2024

ericsnowcurrently commented Feb 1, 2024

gh-109653: Speedup import of threading module #114509

gh-109653: Speedup import of threading module #114509

Conversation

danielhollas commented Jan 23, 2024 • edited Loading

bedevere-app bot commented Jan 23, 2024

danielhollas commented Jan 24, 2024 • edited by hugovk Loading

ajoino commented Jan 24, 2024

AlexWaygood commented Jan 24, 2024

danielhollas commented Jan 24, 2024

AlexWaygood commented Jan 24, 2024

AlexWaygood left a comment

Choose a reason for hiding this comment

danielhollas commented Jan 24, 2024

AlexWaygood commented Jan 31, 2024

ericsnowcurrently commented Feb 1, 2024

danielhollas commented Jan 23, 2024 •

edited

Loading

danielhollas commented Jan 24, 2024 •

edited by hugovk

Loading