Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bpo-41842: Add a unregister function in _codecs module #22360

Merged
merged 15 commits into from
Sep 28, 2020
11 changes: 7 additions & 4 deletions Doc/library/codecs.rst
Original file line number Diff line number Diff line change
Expand Up @@ -163,11 +163,14 @@ function:
:class:`CodecInfo` object. In case a search function cannot find
a given encoding, it should return ``None``.

.. note::

Search function registration is not currently reversible,
which may cause problems in some cases, such as unit testing or
module reloading.
.. function:: unregister(search_function)

Unregister a codec search function and clear the registry's cache.
If the search function is not registered, do nothing.

.. versionadded:: 3.10


While the builtin :func:`open` and the associated :mod:`io` module are the
recommended approach for working with encoded text files, this module
Expand Down
5 changes: 5 additions & 0 deletions Doc/whatsnew/3.10.rst
Original file line number Diff line number Diff line change
Expand Up @@ -109,6 +109,11 @@ base64
Add :func:`base64.b32hexencode` and :func:`base64.b32hexdecode` to support the
Base32 Encoding with Extended Hex Alphabet.

codecs
------

Add :func:`codecs.unregister` to unregister a codec search function.

curses
------

Expand Down
10 changes: 10 additions & 0 deletions Include/codecs.h
Original file line number Diff line number Diff line change
Expand Up @@ -27,6 +27,16 @@ PyAPI_FUNC(int) PyCodec_Register(
PyObject *search_function
);

/* Unregister a codec search function.

Unregister a codec search function and clear the registry's cache.
shihai1991 marked this conversation as resolved.
Show resolved Hide resolved
If the search function is not registered, do nothing.
shihai1991 marked this conversation as resolved.
Show resolved Hide resolved
*/

PyAPI_FUNC(int) PyCodec_Unregister(
PyObject *search_function
);
shihai1991 marked this conversation as resolved.
Show resolved Hide resolved

/* Codec registry lookup API.

Looks up the given encoding and returns a CodecInfo object with
Expand Down
7 changes: 7 additions & 0 deletions Lib/test/test_codecs.py
Original file line number Diff line number Diff line change
Expand Up @@ -1641,6 +1641,13 @@ def test_register(self):
self.assertRaises(TypeError, codecs.register)
self.assertRaises(TypeError, codecs.register, 42)

def test_unregister(self):
search_function = mock.Mock(return_value=(1, 2, 3, 4))
shihai1991 marked this conversation as resolved.
Show resolved Hide resolved
codecs.register(search_function)
shihai1991 marked this conversation as resolved.
Show resolved Hide resolved
codecs.unregister(search_function)
self.assertRaises(LookupError, codecs.lookup, "test")
shihai1991 marked this conversation as resolved.
Show resolved Hide resolved
search_function.assert_not_called()

def test_lookup(self):
self.assertRaises(TypeError, codecs.lookup)
self.assertRaises(LookupError, codecs.lookup, "__spam__")
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Add a :func:`codecs.unregister` to unregister a codec search function.
22 changes: 22 additions & 0 deletions Modules/_codecsmodule.c
Original file line number Diff line number Diff line change
Expand Up @@ -68,6 +68,27 @@ _codecs_register(PyObject *module, PyObject *search_function)
Py_RETURN_NONE;
}

/*[clinic input]
_codecs.unregister
search_function: object
/

Unregister a codec search function and clear the registry's cache.

If the search function is not registered, do nothing.
[clinic start generated code]*/

static PyObject *
_codecs_unregister(PyObject *module, PyObject *search_function)
/*[clinic end generated code: output=1f0edee9cf246399 input=dd7c004c652d345e]*/
{
if (PyCodec_Unregister(search_function) < 0) {
return NULL;
}

Py_RETURN_NONE;
}

/*[clinic input]
_codecs.lookup
encoding: str
Expand Down Expand Up @@ -992,6 +1013,7 @@ _codecs_lookup_error_impl(PyObject *module, const char *name)

static PyMethodDef _codecs_functions[] = {
_CODECS_REGISTER_METHODDEF
_CODECS_UNREGISTER_METHODDEF
_CODECS_LOOKUP_METHODDEF
_CODECS_ENCODE_METHODDEF
_CODECS_DECODE_METHODDEF
Expand Down
13 changes: 12 additions & 1 deletion Modules/clinic/_codecsmodule.c.h

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

20 changes: 20 additions & 0 deletions Python/codecs.c
Original file line number Diff line number Diff line change
Expand Up @@ -50,6 +50,26 @@ int PyCodec_Register(PyObject *search_function)
return -1;
}

int
PyCodec_Unregister(PyObject *search_function)
{
PyInterpreterState *interp = PyInterpreterState_Get();
/* Do nothing if codec_search_path is not created yet or was created. */
shihai1991 marked this conversation as resolved.
Show resolved Hide resolved
if (interp->codec_search_path == NULL) {
shihai1991 marked this conversation as resolved.
Show resolved Hide resolved
return 0;
}

Py_ssize_t n = PyList_Size(interp->codec_search_path);
shihai1991 marked this conversation as resolved.
Show resolved Hide resolved
for (Py_ssize_t i = 0; i < n; i++) {
PyObject *item = PyList_GetItem(interp->codec_search_path, i);
if (item == search_function) {
shihai1991 marked this conversation as resolved.
Show resolved Hide resolved
PyDict_Clear(interp->codec_search_cache);
shihai1991 marked this conversation as resolved.
Show resolved Hide resolved
return PyList_SetSlice(interp->codec_search_path, i, i+1, NULL);
}
}
shihai1991 marked this conversation as resolved.
Show resolved Hide resolved
return 0;
}

extern int _Py_normalize_encoding(const char *, char *, size_t);

/* Convert a string to a normalized Python string(decoded from UTF-8): all characters are
Expand Down