Skip to content

Latest commit

 

History

History
37 lines (25 loc) · 1.33 KB

hybrid_engine.md

File metadata and controls

37 lines (25 loc) · 1.33 KB

Hybrid Engine for ONNX Runtime and DLR

Introduction

ONNX Runtime and DLR are a DL library with limited support for NDArray operations. Currently, it only covers the basic NDArray creation methods. To better support the necessary preprocessing and postprocessing, you can use one of the other Engine along with it to run in a hybrid mode.

The minimum dependencies for the ONNX Runtime Engine are:

gradle

runtimeOnly "ai.djl.onnxruntime:onnxruntime-engine:0.10.0"
runtimeOnly "ai.djl.dlr:dlr-engine:0.10.0"

To use it along with Apache MXNet for additional API support, add the following two dependencies:

runtimeOnly "ai.djl.mxnet:mxnet-engine:0.10.0"
runtimeOnly "ai.djl.mxnet:mxnet-native-auto:1.7.0-backport"

You can also use PyTorch or TensorFlow Engine as the supplemental engine by adding their corresponding dependencies.

How Hybrid works

Internally, DJL will find two or more engines available. When you start using the ONNX Runtime Engine, the engine will search for additional Engines. If it finds any, all NDArrays will be created in the supplemental engine. After that, the NDArrays will only be converted to ONNX Runtime NDArrays right before inference.

If a second Engine is not found, the ONNX Runtime Engine will use its own NDManager and NDArray class to support limited NDArray creation methods.