ONNX Runtime Inference Demo

Java version

Use ORT to inference model at Android device by Java. Inference time is 17ms.

You can run this demo by following steps:

export your model to ONNX format following pytorch official tutorial.
- [optional] optimize your ONNX model to ORT format by the following command:

python -m onnxruntime.tools.convert_onnx_models_to_ort  /path/to/onnx/model --optimization_style Runtime

place your exported model file to app/src/main/res/raw/
reference your exported model file in MainActivity.java
build app and run it on your device. You can see the inference time in the Run panel.
- [optional] you can use Android NNAPI to accelerate inference via GPU. You can open it in MainActivity.java.

Show Output!

``` D/MainActivity: ONNXRuntime available provider: CPU
D/MainActivity: ONNXRuntime available provider: NNAPI
D/ORTAnalyzer: InputName: data, JavaType: FLOAT, ONNXType: ONNX_TENSOR_ELEMENT_DATA_TYPE_FLOAT, Shape: [1, 3, 224, 224]
D/ORTAnalyzer: OutputName: mobilenetv20_output_flatten0_reshape0, JavaType: FLOAT, ONNXType: ONNX_TENSOR_ELEMENT_DATA_TYPE_FLOAT, Shape: [1, 1000]
D/ORTAnalyzer: Warmup time cost: 14 ms
D/ORTAnalyzer: Warmup time cost: 12 ms
D/ORTAnalyzer: Warmup time cost: 12 ms
D/ORTAnalyzer: Warmup time cost: 11 ms
D/ORTAnalyzer: Warmup time cost: 11 ms
D/ORTAnalyzer: Time cost: 11 ms
D/ORTAnalyzer: Time cost: 11 ms
D/ORTAnalyzer: Time cost: 12 ms
D/ORTAnalyzer: Time cost: 11 ms
D/ORTAnalyzer: Time cost: 12 ms
D/ORTAnalyzer: Time cost: 13 ms
D/ORTAnalyzer: Time cost: 11 ms
D/ORTAnalyzer: Time cost: 11 ms
D/ORTAnalyzer: Time cost: 11 ms
D/ORTAnalyzer: Time cost: 11 ms
D/ORTAnalyzer: Time cost: 11 ms
D/ORTAnalyzer: Time cost: 11 ms
D/ORTAnalyzer: Time cost: 11 ms
D/ORTAnalyzer: Time cost: 11 ms
D/ORTAnalyzer: Time cost: 10 ms
D/ORTAnalyzer: Time cost: 10 ms
D/ORTAnalyzer: Time cost: 10 ms
D/ORTAnalyzer: Time cost: 9 ms
D/ORTAnalyzer: Time cost: 10 ms
D/ORTAnalyzer: Time cost: 10 ms
D/ORTAnalyzer: Average time cost: 11 ms
```

C++ version

The libonnxruntime.so library from com.microsoft.onnxruntime:onnxruntime-android:1.11.0 used by the Java version is the same as the one used by the c++ version. Thus, we can't merge the two submodules together.

Use the prebuilt libonnxruntime.so library which is directly copied from the onnxruntime-android(1.11.0) aar file. The only thing which makes it different from the Java version is that it can only use a file path instead of a resource id. What's more, due to the JNI performance loss, the latency of c++ version will be slightly lower than the latency of Java version.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.idea		.idea
app		app
gradle/wrapper		gradle/wrapper
native_infer		native_infer
.gitignore		.gitignore
README.md		README.md
build.gradle		build.gradle
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle		settings.gradle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ONNX Runtime Inference Demo

Java version

C++ version

About

Releases

Packages

Languages

hebangwen/ORTSegDemo

Folders and files

Latest commit

History

Repository files navigation

ONNX Runtime Inference Demo

Java version

C++ version

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages