Add Module-Based Model Runtime Interface for AOT (support C++ runtime) #9697

areusch · 2021-12-10T01:25:15Z

This PR implements Module-based Model Runtime for AOT RFC. It adds support for targeting the C++ runtime with the Ahead-of-Time Executor for workloads which are purely CPU-based (e.g. the Device API cannot be used).

cc @manupa-arm @kparzysz-quic @csullivan @mehrdadh @adstraw for early feedback while I iterate to prepare to merge this.

mehrdadh

Just few nit suggestions.

mehrdadh · 2021-12-10T20:33:15Z

CMakeLists.txt

@@ -390,6 +391,13 @@ if(USE_PROFILER)
  list(APPEND RUNTIME_SRCS ${RUNTIME_VM_PROFILER_SRCS})
 endif(USE_PROFILER)

+if(USE_AOT_EXECUTOR)


I suggest to add USE_AOT_EXECUTOR to cmake/config.cmake with default value ON

mehrdadh · 2021-12-10T21:35:29Z

tests/python/relay/aot/test_cpp_aot.py

+
+
+def test_conv2d():
+    RELAY_MODEL = textwrap.dedent(


I suggest to use relay directly to build the relay model. Here is an example:

tvm/tests/micro/zephyr/test_zephyr.py

Line 394 in 01599d1

y = relay.nn.conv2d(

mehrdadh · 2021-12-10T21:50:16Z

tests/python/relay/aot/test_cpp_aot.py

+    output_list = generate_ref_data(ir_mod, inputs, params)
+
+    with tvm.transform.PassContext(opt_level=3, config={"tir.disable_vectorize": True}):
+        mod = tvm.relay.build(ir_mod, params=params, target="c -executor=aot -link-params")


change this to use executor and runtime args?

* These were autogenerated in the original PR, but checking them in as plain code until we can revisit the auto-generator approach.

multiple-models case.

masahi · 2022-02-26T07:53:03Z

#10283

areusch force-pushed the aot-executor-no-codegen branch 2 times, most recently from 66cc5a8 to e879841 Compare December 13, 2021 04:50

mehrdadh reviewed Dec 14, 2021

View reviewed changes

areusch and others added 27 commits January 4, 2022 16:24

Move ShapeToJSON to utils.

0bb6fca

Return new Metadata from graph-level codegen.

5ccf3b0

Stack-allocate DLTensor instances when necessary.

55ca67c

Rename MetadataModule to ConstLoaderModule.

027078d

Add new Metadata classes and base implementation.

f338305

* These were autogenerated in the original PR, but checking them in as plain code until we can revisit the auto-generator approach.

Add runtime AOT executor module.

86cc6ac

Add AOT code-generation.

7bba41b

Remove old Metadata

acb56c8

compilation fixes in codegen?

d7d0518

replace MetadataModuleCreate

a01947a

Add a runtime Module to mux between .text Metadata and live Metadata.

1950d8b

Move launch_param to namespace

2e7123d

Add test of c++ AOT.

55dfb23

Fix c++ lint and formatting.

bd25708

DNS lint hacks, idk what's up here...

b714c29

Fix python formatting

f1fbed1

git-clang-format

d940826

fix span.h

96544e9

Move kTvmExecutor consts to runtime and fix improper references.

06e3b93

git-clang-format

c250f16

black format

ff1bd79

git-clang-format

7735395

Fix incongruity between kTvmRuntimeCrt constant

1f84fdf

fix segfault with devices

2732dc0

fix packed/c interface api restriction

0bedaad

Only emit __tvm_module_ctx when using C++ runtime; breaks

f5a268f

multiple-models case.

fixup! Return new Metadata from graph-level codegen.

574da50

areusch added 2 commits January 5, 2022 12:05

fixup! Stack-allocate DLTensor instances when necessary.

b16f605

fixup! Stack-allocate DLTensor instances when necessary.

94bcb1c

areusch force-pushed the aot-executor-no-codegen branch from 94a72c3 to 94bcb1c Compare January 5, 2022 20:06

areusch added 5 commits January 6, 2022 15:55

fix aot executor codegen for C

47bde10

random logging code

b976a2e

LLVM serializer

b8cb34e

switch to encoding using MetadataTypeIndex

d42e94d

Implement LLVM serializer.

5a52859

areusch mentioned this pull request Jan 26, 2022

[Tracking Issue] Support RPC-based execution with AoT Executor #10076

Closed

checkpoint

3deabb2

github-actions bot requested review from kparzysz-quic and manupak January 26, 2022 21:41

areusch added 6 commits February 4, 2022 15:58

emit cpacked_lowered in llvm codegen

fb9bcdd

emit tir::lookup_param node in llvm codegen

0b62680

expand test to cover llvm

3d28fd1

cpptests for Metadata class

5f0a02e

checkpoint

bb604e7

latest llvm for masa

cd1de42

masahi closed this Feb 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Module-Based Model Runtime Interface for AOT (support C++ runtime) #9697

Add Module-Based Model Runtime Interface for AOT (support C++ runtime) #9697

areusch commented Dec 10, 2021

mehrdadh left a comment

mehrdadh Dec 10, 2021

mehrdadh Dec 10, 2021

mehrdadh Dec 10, 2021

masahi commented Feb 26, 2022

Add Module-Based Model Runtime Interface for AOT (support C++ runtime) #9697

Add Module-Based Model Runtime Interface for AOT (support C++ runtime) #9697

Conversation

areusch commented Dec 10, 2021

mehrdadh left a comment

Choose a reason for hiding this comment

mehrdadh Dec 10, 2021

Choose a reason for hiding this comment

mehrdadh Dec 10, 2021

Choose a reason for hiding this comment

mehrdadh Dec 10, 2021

Choose a reason for hiding this comment

masahi commented Feb 26, 2022