From 5ad6ebb78d14c776381648be14193ae70ca85245 Mon Sep 17 00:00:00 2001 From: Jan Vesely Date: Fri, 15 Jan 2021 17:15:44 -0500 Subject: [PATCH] documentation: Add a short overview of PsyNeuLink compiltion Signed-off-by: Jan Vesely --- docs/source/Compilation.rst | 39 +++++++++++++++++++++++++++++++++++++ docs/source/Core.rst | 1 + docs/source/Services.rst | 1 + 3 files changed, 41 insertions(+) create mode 100644 docs/source/Compilation.rst diff --git a/docs/source/Compilation.rst b/docs/source/Compilation.rst new file mode 100644 index 00000000000..a763be57f49 --- /dev/null +++ b/docs/source/Compilation.rst @@ -0,0 +1,39 @@ +Compilation +=========== + +PsyNeulink includes a runtime compiler to improve performance of executed models. +This section describes the overview of the compiler design and its use. +The performance improvements varies, but it has been observed to be between one and three orders of magnitude depending on the model. + + +Overview +-------- + +The PsyNeuLink runtime compiler works in several steps when invoked via `run` or `execute`: +*Compilation*: + #. The model is initialized. This is step is identical to non-compiled execution. + #. Data structures (input/output/parameters) are flattened and converted to LLVM IR form. + #. LLVM IR code is generated to match to semantics of individual components and the used scheduling rules. + #. Host CPU compatible binary code is generated + #. The resulting function is saved as `ctypes` function and the parameter types are converted to `ctypes` binary structures. + +*Execution*: + #. parameter structures are populated with the data from `Composition` based on the provided `execution_id`. These structures are preserved between invocations so executions with the same `execution_id` will reuse the same binary structures. + #. `ctype` function from step 5. is executed + #. Results are extracted from the binary structures and converted to Python format. + + +Use +--- + +Compiled form of a model can be invoked by passing one of the following values to the `bin_execute` parameter of `Composition.run`, or `Composition.exec`: + + * `False` or `Python`: Normal python execution + * `LLVM`: Compile and execute individual nodes. The scheduling loop still runs in Python. If any of the nodes fails to compile, an error is raised. *NOTE:* Schedules that require access to node data will not work correctly. + * `LLVMExec`: Execution of `Composition.exec` is replaced by a compiled equivalent. If the `Composition` fails to compile, an error is raised. + * `LLVMRun`: Execution of `Composition.run` is replaced by a compiled equivalent. If the `Composition` fails to compiler, an error is raised. + * `True`: This option attempts all three above mentioned granularities, and gracefully falls back to lower granularity. Warnings are raised in place of errors. This is the recommended way to invoke compiled execution as the final fallback is the Python baseline. + +Note that data other than `Composition.run` outputs are not synchronized between Python and compiled execution. + + It is possible to invoke compiled version of `FUnction` s and `Mechanism` s. This functionality is provided for testing purposes only, because of the lack of data synchronization it is not recommended for general use. diff --git a/docs/source/Core.rst b/docs/source/Core.rst index b0828df0b17..56dd660972c 100644 --- a/docs/source/Core.rst +++ b/docs/source/Core.rst @@ -17,3 +17,4 @@ Core - `Registry` - `Preferences` - `json` + - `Compilation` diff --git a/docs/source/Services.rst b/docs/source/Services.rst index 142d725026b..29e3aead2b1 100644 --- a/docs/source/Services.rst +++ b/docs/source/Services.rst @@ -9,3 +9,4 @@ Services Registry Preferences json + Compilation