Add `PyWeakref_IsDead()` to test if a weak reference is dead #48

colesbury · 2024-11-14T19:31:27Z

EDIT: Added, -1 error return case per @vstinner's suggestion.

EDIT 2: @vstinner added a vote.

I propose adding a dedicated C API function to check if a weak reference is dead:

// Returns 1 if the pointed to object is dead, 0 if it's alive, and -1 with an error set if `ref` is not a weak reference.
int PyWeakref_IsDead(PyObject *ref);

Motivation

Prior to Python 3.13, you could check if a weak reference is dead via PyWeakref_GetObject(ref) == Py_None, but that function is now deprecated. You might try writing an "is dead" check using PyWeakref_GetRef. For example:

int is_dead(PyObject *ref) {
    PyObject *tmp;
    if (PyWeakref_GetRef(&ref, &tmp) < 0) {
        return -1;
    }
    else if (tmp == NULL) {
        return 1;
    }
    Py_DECREF(tmp);
    return 0;
}

In addition to not being ergonomic, the problem with this code is that the Py_DECREF(tmp) may introduce a side effect from a calling a destructor, at least in the free threading build where some other thread may concurrently drop the last reference. Our internal _PyWeakref_IS_DEAD implementation avoids this problem, but it's not possible to reimplement that code using our existing public APIs.

This can be a problem when you need to check if a weak reference is dead within a lock, such as when cleaning up dictionaries or lists of weak references -- you don't want to execute arbitrary code via a destructor while holding the lock.

I've run into this in two C API extensions this week that are not currently thread-safe with free threading:

CFFI uses a dictionary that maps a string keys to unowned references. I'm working on update it to use PyWeakReference, but the "is dead" clean-up checks are more difficult due to the above issues.
Pandas cleans up a list of weak references. (The code is not currently thread-safe, and probably needs a lock.)

Vote

The text was updated successfully, but these errors were encountered:

vstinner · 2024-11-15T09:58:52Z

Your rationale makes sense so adding int PyWeakref_IsDead(PyObject *ref) LGTM.

I would just add an error case: raise an exception (TypeError) and return -1 if the argument is not a weak reference object.

encukou · 2024-11-15T10:48:20Z

An alternative is allowing int PyWeakref_GetRef(ref, NULL).
IMO, we should generally default to allowing NULL for PyObject ** output arguments, so that we can skip refcounting when the user doesn't need that result. It's very common that functions like this have other interesting effects, and it seems suboptimal to keep adding a separate “check” function some releases after the “get” function.
(Note that the output argument case is very different from accepting NULL for a PyObject * argument -- that is something to avoid, since it can easily come from a failed API call.)

Unfortunately, allowing the NULL now would mean that code tested on 3.14 will fail on 3.13. Even with that, I'd personally still slightly prefer PyWeakref_GetRef(ref, NULL).
(If we go that way, in 3.13.1+ it should fail with exception if it gets a NULL.)

colesbury · 2024-11-15T14:50:27Z

I've updated the issue to specify a -1 return value if the argument is not a weak reference object.

serhiy-storchaka · 2024-11-29T16:44:33Z

On one hand, PyWeakref_IsDead may be significantly more efficient. On other hand, how often do you need to know that the reference was alive without getting its value? Note that the result can become obsolete right after obtaining it in a GIL-less build.

colesbury · 2024-11-29T16:51:28Z

@serhiy-storchaka - you need it any time you want to implement a collection of weakrefs (things like WeakValueDictionary) and things like that are pretty common. As I wrote above, I ran into this twice in a single week separately in pandas and cffi. You can find a number of other examples if you search GitHub.

It's not just a matter of efficiency. As I wrote in the issue:

In addition to not being ergonomic, the problem with this code is that the Py_DECREF(tmp) may introduce a side effect from a calling a destructor, at least in the free threading build where some other thread may concurrently drop the last reference. Our internal _PyWeakref_IS_DEAD implementation avoids this problem, but it's not possible to reimplement that code using our existing public APIs.

This can be a problem when you need to check if a weak reference is dead within a lock, such as when cleaning up dictionaries or lists of weak references -- you don't want to execute arbitrary code via a destructor while holding the lock.

serhiy-storchaka · 2024-11-29T16:58:11Z

On third hand, what is dead will remain dead forever. For the purpose of weakref collections this should be enough. I support this proposition.

vstinner · 2024-12-02T15:59:38Z

I added a vote in the first message: please vote :-)

serhiy-storchaka · 2024-12-02T16:25:52Z

Can we guarantee that it does not fail if the argument is a weakref?

vstinner · 2024-12-02T17:19:34Z

Can we guarantee that it does not fail if the argument is a weakref?

I think that it's a reasonable assumption, yes.

encukou mentioned this issue Nov 29, 2024

Return value conventions capi-workgroup/api-evolution#13

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `PyWeakref_IsDead()` to test if a weak reference is dead #48

Add `PyWeakref_IsDead()` to test if a weak reference is dead #48

colesbury commented Nov 14, 2024 •

edited by erlend-aasland

Loading

vstinner commented Nov 15, 2024

encukou commented Nov 15, 2024

colesbury commented Nov 15, 2024

serhiy-storchaka commented Nov 29, 2024

colesbury commented Nov 29, 2024

serhiy-storchaka commented Nov 29, 2024

vstinner commented Dec 2, 2024

serhiy-storchaka commented Dec 2, 2024

vstinner commented Dec 2, 2024 •

edited

Loading

Add PyWeakref_IsDead() to test if a weak reference is dead #48

Add PyWeakref_IsDead() to test if a weak reference is dead #48

Comments

colesbury commented Nov 14, 2024 • edited by erlend-aasland Loading

Motivation

Vote

vstinner commented Nov 15, 2024

encukou commented Nov 15, 2024

colesbury commented Nov 15, 2024

serhiy-storchaka commented Nov 29, 2024

colesbury commented Nov 29, 2024

serhiy-storchaka commented Nov 29, 2024

vstinner commented Dec 2, 2024

serhiy-storchaka commented Dec 2, 2024

vstinner commented Dec 2, 2024 • edited Loading

Add `PyWeakref_IsDead()` to test if a weak reference is dead #48

Add `PyWeakref_IsDead()` to test if a weak reference is dead #48

colesbury commented Nov 14, 2024 •

edited by erlend-aasland

Loading

vstinner commented Dec 2, 2024 •

edited

Loading