Add optional dependency management #537

mauicv · 2022-06-21T13:47:15Z

TODO:

feature branch for tensorflow and pytorch optional dependencies
~~feature branch for numba optional dependencies~~
feature branch for prophet optional dependencies
~~Image branch (see notion)~~
docs branch
Update Licenses

What is this PR?

This PR addresses optional dependency import functionality for Alibi Detect. The core behaviour this adds is to throw errors informing the user if they haven't installed the necessary optional dependency for the functionality they wish to use. These errors are thrown at use time rather than at import.

There are three behaviours in general:

Optional dependency in a public object. In this case, we might have an object that can optionally use an optional dependency.
- In this case, if the object is used in a way that doesn't use this dependency no error should be thrown.
- If The object is used such that the optional dependency is used then the error should be thrown.
Conditionally Optional dependency in an object. As an example, the value backend should be one of 'tensorflow' or 'torch' but not neither.
- An error should be thrown if the user requests a backend that isn't installed.
Objects that are completely dependent on an optional dependency
- If the user imports and uses these then an error should be thrown

This PR addresses behaviours 1 and 3 above.

Note:

In order to manually test alibi-detect in each of the different environments developers can use:

make repl tox-env=<env>

and this will give them a REPL with the <env> optional dependencies installed.

For example: make repl tox-env=torch will give them the torch optional dependency environment REPL.

The main changes are:

The MissingDependency and optional_import functionality,
The refactoring of the codebase
The tests.

1. MissingDependency

The MissingDependency metaclass is used to allow imports that have missing dependencies. If these imports are used later they will throw an error.

Implementation

The basic pattern is to replace constructs that require uninstalled dependencies with something like:

class MissingDependency:
    def __init__(self, missing_dependency: str, object_name: str):
        self.object_name = object_name
        self.missing_dependency = missing_dependency

    def __getattr__(self, key):
        raise ImportError()

    def __call__(self, *args, **kwargs):
        raise ImportError()

We also use an import function which should be used throughout by developers when importing constructs dependent on optional dependencies in order to ensure consistent behaviour. This looks roughly like:

def import_optional(module_name: str):
    try:
        return import_module(module_name)
    except (ImportError, ModuleNotFoundError) as err:
        return MissingDependency(
            missing_dependency=err.name, 
            object_name=module_name)

The above is called with:

UsefulClass = import_optional('alibi_detect.utils.useful_class')

The above raises an error if the user attempts to access an attribute or call the object.

The error message informs the user that the UsefulClass object is missing optional dependencies.
It tells the user how to resolve using pip install alibi-detect[optional-dependency].
And finally it links to the original error thrown.

2. Refactoring:

The idea here is wherever the is an object that is dependent on an optional install the relevant functionality should be fenced off into a single file or collection of files. Access to that object should be via the __init__.py file. Within the __init__.py we will implement the MissingDependency pattern mentioned above.

Note on Type checking issue:

The import_optional function returns an object instance rather than an object. This will cause typechecking to fail
if not all optional dependencies are installed. Because of this we also need to 1. Conditionally import the true object
dependent on TYPE_CHECKING and 2. Use forward referencing within typing constructs such as Union. We use forward
referencing because in a user environment the optional dependency may not be installed in which case it'll be replaced
with an instance of the MissingDependency class. This will throw an error when passed to Union. See CONTRIBUTING.md note for more details and example.

3. Tests:

The tests import all the named objects from the public API of alibi and test that they throw the correct errors if the relevant optional dependencies are not installed. If these tests fail, it is likely that:

The optional dependency relation hasn't been added to the test script. In this case, this test assumes that your
functionality should work for the default alibi-detect install. If this is not the case you should add the exported object name to the dependency_map in the relevant test.
The relevant export in the public API hasn't been protected with the MissingDependency class. In this case, see the docs string for the utils.missing_dependency m1odule.

Notes:

The tests will be skipped in the normal test suite. To run correctly use tox. This should be as simple as running tox from the terminal. The environments will take a long time to build the first time but afterwards, this should be a quick process.
If you need to configure a new optional dependency you will need to update the setup.cfg file and add a testenv environment as well as to the extra-requires.txt file
Backend functionality may be unique to specific explainers/functions and so there may be multiple such modules that need to be tested separately.
We assume all public functionality is exposed in modules via the __init__.py files.
We assume all imports are top-level, if an import is nested within an object or function call this functionality will avoid being caught by the tests.

See also Further Notes

Merging 0.10.0rc1 back to master so that we can include the optional dependency work (#537) in the final 0.10.0 release.

Repeating #550 (which was reverted) to merge without squashing: "Merging 0.10.0rc1 back to master so that we can include the optional dependency work (#537) in the final 0.10.0 release."

review-notebook-app · 2022-07-07T13:58:54Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

* Add BackendValidator class * Protect ad, od and cd and other API objects from tensorflow and torch optional dependency import errors * Update import statements in notebooks

README.md

alibi_detect/utils/missing_optional_dependency.py

alibi_detect/utils/frameworks.py

* Add pydantic validator to check model wrt to the selected backend * protect `saving.tensorflow` imports with import_optional * Make error message more descriptive for preprocessing function missing from registry * Update README.md * Make `Detector` and `ConfigurableDetector` protocols for typing config-driven save and load functionality Co-authored-by: Ashley Scillitoe <[email protected]>

* Make prophet an optional dependency * Add check for missing prophet in save_detector_legacy

* Add pip install instructions for optional dependencies to docs * Update README.md

ascillitoe · 2022-07-22T15:52:38Z

alibi_detect/tests/test_dep_management.py

+    check_correct_dependencies(tensorflow_utils, dependency_map, opt_dep)
+
+
+def test_pytorch_utils_dependencies(opt_dep):


The other test functions are all named test_torch_... rather than test_pytorch_.... Change for consistency?

ascillitoe · 2022-07-22T15:55:02Z

alibi_detect/utils/missing_optional_dependency.py

@@ -0,0 +1,114 @@
+"""Functionality for optional importing
+This module provides a way to import optional dependencies. In the case that the user imports some functionality from
+alibi that is not usable due to missing optional dependencies this code is used to allow the import but replace it


Change alibi to alibi_detect? Maybe worth a quick grep to check if any other places where comments have been copied from the alibi implementation?

ascillitoe · 2022-07-22T15:56:57Z

Commenting here as can't comment on empty file. For alibi_detect/utils/tests/__init__.py, is it worth adding a comment to the file explaining why we need an empty __init__.py? Assuming its for duplicate conftest.py's...

ascillitoe

Few final comments. LGTM once resolved!

Before merging we should remove [WIP] from the PR name, and decide on whether to squash before merging. @jklaise @mauicv

* Add changelog for optional dependency work

bluepark-sk · 2022-09-05T04:18:31Z

Hi. Do you have any plan to add saving/loading functionality for PyTorch model/backend?
If you have, when will it be developed?

ascillitoe · 2022-09-05T08:06:24Z

Hi. Do you have any plan to add saving/loading functionality for PyTorch model/backend? If you have, when will it be developed?

Hi @bluepark-sk, we sure do. Time permitting, I will be beginning work on PyTorch save/load this week (for drift detectors only)!

bluepark-sk · 2022-09-06T00:58:57Z

@ascillitoe Thank you! I'll be waiting~~

mauicv mentioned this pull request Jun 21, 2022

Add saving and loading functionality for PyTorch backend #210

Closed

ascillitoe mentioned this pull request Jun 28, 2022

Revisit conda recipe #443

Open

ascillitoe self-requested a review June 30, 2022 14:05

ascillitoe added this to the v0.10.0 milestone Jul 6, 2022

ascillitoe mentioned this pull request Jul 7, 2022

Merging 0.10.0rc1 back to master #550

Merged

ascillitoe added a commit that referenced this pull request Jul 7, 2022

Merging 0.10.0rc1 back to master (#550)

d05ec7d

Merging 0.10.0rc1 back to master so that we can include the optional dependency work (#537) in the final 0.10.0 release.

ascillitoe mentioned this pull request Jul 7, 2022

Merging 0.10.0rc1 back to master #552

Merged

mauicv added 10 commits July 7, 2022 15:25

Add optional dependency functionality and tests

d8d6c6b

Add tox envs and test stubs

2915ccc

Update CONTRIBUTING.md

702c11e

Make numba a cor-dep, will be reverted in later PR

317806a

Add correct error messages for multiple optional dependency objects

0b8f869

Set license checks for just default dependencies

a56e6ad

Merge tensorflow and tensorflow-prob into single bucket

fc7618b

Fix flake8 errors and remove redundant test

1196c3b

Minor fix

642de25

Feature Optional backends functionality (#538)

a7c883d

* Add BackendValidator class * Protect ad, od and cd and other API objects from tensorflow and torch optional dependency import errors * Update import statements in notebooks

mauicv force-pushed the feature/dep-management branch from 5047c71 to a7c883d Compare July 8, 2022 08:36

remove numba optional dep

99d697a

This was referenced Jul 14, 2022

Add KeOps MMD detector #548

Merged

Add support for linear-time mmd estimator. #475

Open