gh-105766: Add Locking Around Custom Allocators #105619

ericsnowcurrently · 2023-06-10T00:32:06Z

The "mem" and "object" allocators are documented as dependent on the GIL. However, after per-interpreter GIL landed we weren't enforcing that. This fixes that by using wrapping all allocations/frees with a dedicated runtime-global lock, but only if necessary. Note that we actually use the main interpreter's GIL as that lock, but only for interpreter's that have their own GIL. Doing so closely matches the original behavior.

Issue: "mem" and "object" Allocators are No Longer Protected by the GIL #105766

…r_lock().

ericsnowcurrently · 2023-06-12T15:25:30Z

@vstinner, any objections? I tried to do this in a way that would not penalize the main interpreter or interpreters that share the GIL. Likewise, allocators that only wrap the current allocator should not be affected.

markshannon · 2023-06-12T15:45:05Z

The rules was (is?) that you need to hold the GIL to call PyObject_Malloc, etc.
So why do we now need any locks (apart from the GIL)?

Presumably, the presence of per-interpreter GILs, means that allocators need to use per-interpreter state rather than process-wide state, but I don't see how this enforces that.

ericsnowcurrently · 2023-06-12T15:50:39Z

The allocators are process-global, so a per-interpreter GIL wouldn't guard against races between interpreters. Requiring that they be per-interpreter would mean we'd have to change the allocator API.

markshannon · 2023-06-12T15:54:06Z

If the allocators are process-wide, there would need to be a lock on every call to ob_malloc, which would be disastrous for performance. So that doesn't sound right

ericsnowcurrently · 2023-06-12T16:26:16Z

Yeah, we lock around every allocation when a custom, non-wrapper allocator is used in a subinterpreter that has its own GIL. I don't see what else we can do, aside from doing nothing (allowing races).

ericsnowcurrently · 2023-06-12T16:50:56Z

Maybe we just ask custom allocators to make sure they are thread-safe?

gpshead

overall yes I think something like this PR is needed for the current state of affairs. reusing the main runtime's GIL as the allocator lock is also what I'd assumed would be a natural first implementation.

gpshead · 2023-07-26T23:50:25Z

Python/pylifecycle.c

@@ -603,6 +604,9 @@ init_interp_create_gil(PyThreadState *tstate, int own_gil)
    if (_PyStatus_EXCEPTION(status)) {
        return status;
    }
+    HEAD_LOCK(runtime);
+    runtime->allocators.num_gils++;


Check for overflow. if the value is already INT_MAX pre-increment, we need to bail, even if that means SystemError.

gpshead · 2023-07-26T23:52:18Z

Python/pylifecycle.c

@@ -1730,6 +1734,11 @@ finalize_interp_delete(PyInterpreterState *interp)
    /* Cleanup auto-thread-state */
    _PyGILState_Fini(interp);

+    _PyRuntimeState *runtime = interp->runtime;
+    HEAD_LOCK(runtime);
+    runtime->allocators.num_gils--;


Add an assert(runtime->allocators.num_gils > 0); before this.

gpshead · 2023-07-26T23:53:46Z

Include/internal/pycore_pymem.h

@@ -30,6 +30,11 @@ struct _pymem_allocators {
        debug_alloc_api_t obj;
    } debug;
    PyObjectArenaAllocator obj_arena;
+    int num_gils;


I suggest unsigned int

gpshead · 2023-07-27T00:04:24Z

Objects/obmalloc.c

+_PyMem_MallocLocked(void *ctx, size_t size)
+{
+    PyMemAllocatorEx *wrapped = (PyMemAllocatorEx *)ctx;
+    if (_PyRuntime.allocators.num_gils > 1) {


You lock updates (writes) to this value. Is it safe to use without atomic access or a lock? (requiring an atomic read here would presumably be a performance hit?).

Rather than always checking... could the logic that does num_gils++ with the (main runtime) lock held also do:

... num_gils++; if (num_gils == 2 && !has_locking_wrapper(...)) { maybe_add_locking_wrapper(...); }

such that the wrapped pointer switch happens upon first creation of an additional gil while the main runtime gil (the only gil prior to the current code running) is still held.

I'm not sure it is worth doing the opposite during finalization. Once wrapped due to per subinterpreter GILs, just stay wrapped. At least the wrappers won't have this conditional anymore.

I suspect this thought is either overoptimization... or actually necessary to avoid locking/atomic access to num_gils.

That's a great idea. I'll try it out.

bedevere-bot · 2023-07-27T00:05:22Z

When you're done making the requested changes, leave the comment: I have made the requested changes; please review again.

gpshead · 2023-07-27T00:11:04Z

note the discussion Eric started: https://discuss.python.org/t/what-to-do-about-custom-allocators-and-the-gil/30327

ericsnowcurrently · 2023-07-28T18:03:21Z

I'm closing this. Instead, we'll specify that allocators must be thread-safe (at least when isolated subinterpreters are in play).

ericsnowcurrently added 4 commits June 8, 2023 14:16

Add wrapper allocators with locking when needed.

fcbeb6d

Fix set_allocator_unlocked.

fbe26eb

Implement should_lock().

7191629

Implement acquire_custom_allocator_lock() and release_custom_allocato…

dd60989

…r_lock().

ericsnowcurrently requested review from vstinner and markshannon June 10, 2023 00:32

bedevere-bot mentioned this pull request Jun 10, 2023

Improve Interpreter Isolation #100227

Closed

ericsnowcurrently added skip news needs backport to 3.12 bug and security fixes labels Jun 10, 2023

ericsnowcurrently marked this pull request as ready for review June 12, 2023 15:50

bedevere-bot added the awaiting core review label Jun 12, 2023

ericsnowcurrently mentioned this pull request Jun 14, 2023

"mem" and "object" Allocators are No Longer Protected by the GIL #105766

Closed

ericsnowcurrently changed the title ~~gh-100227: Add Locking Around Custom Allocators~~ gh-105766: Add Locking Around Custom Allocators Jun 14, 2023

gpshead requested changes Jul 27, 2023

View reviewed changes

bedevere-bot removed the awaiting core review label Jul 27, 2023

bedevere-bot added the awaiting changes label Jul 27, 2023

ericsnowcurrently added 4 commits July 27, 2023 14:21

Merge branch 'main' into lock-around-custom-allocators

0f2a242

Check for overflow/underflow.

321d1c2

Use an unsigned int.

52c01ea

Only modify num_gils if the interpreter has its own GIL.

a213597

ericsnowcurrently closed this Jul 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-105766: Add Locking Around Custom Allocators #105619

gh-105766: Add Locking Around Custom Allocators #105619

ericsnowcurrently commented Jun 10, 2023 •

edited

Loading

ericsnowcurrently commented Jun 12, 2023

markshannon commented Jun 12, 2023

ericsnowcurrently commented Jun 12, 2023

markshannon commented Jun 12, 2023

ericsnowcurrently commented Jun 12, 2023

ericsnowcurrently commented Jun 12, 2023

gpshead left a comment

gpshead Jul 26, 2023

gpshead Jul 26, 2023

gpshead Jul 26, 2023

gpshead Jul 27, 2023

ericsnowcurrently Jul 27, 2023

bedevere-bot commented Jul 27, 2023

gpshead commented Jul 27, 2023

ericsnowcurrently commented Jul 28, 2023

gh-105766: Add Locking Around Custom Allocators #105619

gh-105766: Add Locking Around Custom Allocators #105619

Conversation

ericsnowcurrently commented Jun 10, 2023 • edited Loading

ericsnowcurrently commented Jun 12, 2023

markshannon commented Jun 12, 2023

ericsnowcurrently commented Jun 12, 2023

markshannon commented Jun 12, 2023

ericsnowcurrently commented Jun 12, 2023

ericsnowcurrently commented Jun 12, 2023

gpshead left a comment

Choose a reason for hiding this comment

gpshead Jul 26, 2023

Choose a reason for hiding this comment

gpshead Jul 26, 2023

Choose a reason for hiding this comment

gpshead Jul 26, 2023

Choose a reason for hiding this comment

gpshead Jul 27, 2023

Choose a reason for hiding this comment

ericsnowcurrently Jul 27, 2023

Choose a reason for hiding this comment

bedevere-bot commented Jul 27, 2023

gpshead commented Jul 27, 2023

ericsnowcurrently commented Jul 28, 2023

ericsnowcurrently commented Jun 10, 2023 •

edited

Loading