[WIP] Unify Rust lib #4

nhynes · 2019-02-03T02:09:31Z

No description provided.

thanks @icemelon9, this is merged

@vinx13

* a preliminary version is done? * we no longer need the redundant hybrid/api.py * support assert stmt * cast supported * intrin -> runtime; util is mainly in charge of compilation time * assert statement * fix python lint * fix cpp lint * on the way to module * rollback .cc * fix typo, no direct expose then * @vinx13 ceil is added i guess? * wip... * temp commit * fix import * i preliminary version is done? * on the way to build hybrid module * nearly fixed... * dumped python are equiv as original python * on the way to bootstrap * cpu bootstrap done * bootstrap! * fix lint * fix doc * resolve some review concerns * support load/save * fix lint * thanks to xqdan fixed my typo * fix build, make dump non-optional * add vthread * jesus why i added this

* [TOPI][CUDA] Add faster-rcnn proposal op * Fix doc * Add global barrier * Use vthread in argsort * Update sort and nms ir * Fix lint * Update sort ir in ssd nms

* fix storage_rewrite bug when input is big * cast when necessary * simplification * simplification * int64->uint32 * revert uint32->int64

…ache#2597) * update titles to reflect tutorial content (nnvm vs. relay) * move things around * fix typo

* move fix test fix lint fix test add more code fix lint better type infer ability * fix build * address comment

* Version 0.5 * update version.py * update news * update news * update news

* First pass on ADTs * Add doc string for tag field * Visit constructors in TypeVisitor for TypeData * Add to description of type call * Add type call to type solving and unification * Make type mutator for typecall consistent with others (only create new node if there's a change) * Ensure kindchecking can handle type calls and typedata * Fix bad nesting in module constructor * Correctly construct call in typecall test * Add call override for ordinary vars (do we want this?) * Remove generalization hack from type inference because it was breaking ADT constructors * Check that there are no free type vars in exprs after inferring type * Free var checks need module because of ADT constructors * Typecall test can't have unbound type var, make it global * Uncomment tmap test and remove comments about failing to infer ret type; those work now * Put in dummy visits for ADTs in graph runtime codegen to placate pylint * Fix Relay type infer test module constructor * Mark override for TypeCallNode in type solver * Ensure free vars check treats patern vars as bound * Run interpreter in more ADT test cases * Refactor kind check to return the kind, like typechecking * Fix invalid typecall in test * Add kind check to type inference, do not use nulls in func_type_annotation()! * Redundant whitespace * Make TypeData a separate kind * Make ADT handles a separate kind too, document calling convention better * Remove nats and tree from prelude, move to test, document prelude * Restore and document nat and tree to prelude, add more tree tests * Add alpha equality tests for match cases, fix variable binding bug * Add more kind check tests for ADTs * Add more tests for finding free or bound vars in match exprs * Add unification tests for type call * Update main() for alpha equality tests * Add simple type inference test cases for match exprs and ADT constructors * Add more ADT interpreter tests * Allow incomplete types when typechecking match cases * Type inference for pattern vars should use the type annotation if it's there * Two more specific test cases for ADT matching * Add option ADT to prelude * Fix broken reference to kind enum * Fix rebase snags * Do not attach checked types to constructors * More docstrings for module fields * Use proper wrapper for indexing into module type data * checked_type for constructors is not populated * Expand type call docstring * Rename PatternConstructor con field * Use error reporter for pattern constructor case * Condense error reporting in kind check, use error reporter * Expand docstrings and rename ADT fields * Rename 'option' ADT to 'optional' for consistency with Python * Add various list iterators and utility functions to prelude * Add smoke tests for new iterators in prelude * Add concat to prelude * Add smoke test for concat * Correct docstrings in prelude * Ensure that type defs are written in module initialization * Various requested renamings * Correct rebase snags * Add kind check tests for ref types * Update the main() for kind checking tests

* check in * update build and run

* alter_op_layout for x86 * cleanup * cleanup * fix lint * fix lint * fix lint * fix lint * change support level * change other support levels

* nms data race solved * tst_topi_vision reference results are gonna be updated in PR apache#2353 * proposal nms_ir updated

…ache#2615) * Add eager simplication for FloatImm * fix * fix lint * Fix gcc warning * fix * Add test case

* gather_nd added * gather_nd test added * more test added * fix lint * fix build error * fix lint * comments addressed

* error fixed * rename * solve conlicts with master * more test added * fix error * remove test * comment addressed

* Fix bias add default axis * update * Fix canonicalize ops for bias_add

* [Relay][Frontend] Support TF Gather * fix comments

…e#2897 (apache#2950) There are many OpenCL platforms that do not yet support OpenCL 2.0, hence we use 1.2 APIs, some of which are now deprecated. In order to turn off the deprecation warnings (elevated to errors by -Werror) we explicitly disable the 1.2 deprecation warnings. At the point TVM supports minimum version 2.0, this commit can be reverted.

* [Relay][Frontend] Support tf.where * fix comments

…g the MobileNetV2 (apache#2919)

…2819)

…ewer c++(https://en.cppreference.com/w/cpp/utility/functional/unary_function) (apache#2962)

* relay op strategy fix lint bitpack strategy bitserial_dense (apache#6) * update strategy * address comments fix a few topi test Dense strategy (apache#5) * dense * add biforst; remove comments * address comment Refactor x86 conv2d_NCHWc (#4) * Refactor x86 conv2d * Add x86 depthwise_conv2d_NCHWc * Add back topi x86 conv2d_nchw * Merge x86 conv2d_nchw and conv2d_NCHWc * Minor fix for x86 conv2d fix more strategy Add x86 conv2d_NCHWc_int8 strategy (apache#8) * Add x86 conv2d_NCHWc_int8 strategy * Remove contrib_conv2d_nchwc_int8 * Fix generic conv2d_NCHWc for int8 * Fix topi arm_cpu conv2d_NCHWc_int8 update x86 conv2d enable specify relay ops to be tuned for autotvm add cuda conv2d strategy add conv2d strategy for rocm add conv2d strategy for hls add conv2d strategy for arm cpu add conv2d strategy for mali add conv2d strategy for bifrost add conv2d strategy for intel graphics clean up and fix lint remove template keys from autotvm remove 2 in the func name address comments fix * fix bugs * lint * address comments * add name to op implement * Modify topi tests (apache#9) * Add pooling, reorg, softmax and vision * Add lrn * fix topi test * fix more topi test * lint * address comments * x * fix more tests & bugs * Modify more tests (apache#10) * Modify tests for bitserial_conv2d, bitserial_dense, bitserial_conv2d_rasp and bnn * Minor fix * More minor fix * fix more test * try to update vta using strategy * fix cpptest * x * fix rebase err * Fix two tests (apache#11) * change autotvm log format * lint * minor fix * try fix vta test * fix rebase err * tweak * tmp hack for vta pass * fix tutorial * fix * fix more tutorials * fix vta tutorial * minor * address comments * fix * address comments * fix cpptest * fix docs * change data structure name and api * address comments * lint * fix rebase err * updates * fix winograd test * fix doc * rebase * upgrade tophub version number * fix bug * re-enable vta tsim test after tophub is upgraded * fix vta test to use the correct args so the config can be found in tophub Co-authored-by: Yao Wang <[email protected]>

…generating (apache#5962) * Code migration Start (#1) * Init commit: Code migration Start * Add loop_state.cc/h * Add ComputeDAG basic test * Split transform_step out & Update more UTs (#3) * Split transform_step out * Update GetProducers & GetConsumers * Update UTs * Add UT for CacheReadWrite & Some bug fix * Add search_task, measure and serialization (#4) * Add FollowSplit & FollowFusedSplit tests * Update dag.InferBound & its UT * Add search_task, measure and serialization * Update Serialization UT * Add MetaTileRewritePolicy (apache#5) * Add feature * Add cost_model, meta_tile_rewrite_policy * Add MetaTileRewritePolicy basic UT * Basic Python API for State (apache#6) * Add Basic Python API for State * Add UTs for State * Add Python API: Measure & Task (apache#7) * Update the return value of state operation * Add task * Copy measure.py & utils.py * Fix LocalBuilder * Fix LocalRunner * Add ansor.auto_schedule() API; First AutoSchedule working version(apache#8) * Add basic Python support for ansor.auto_schedule * Update AutoSchedule API * Bug fix for get the attach point of a fused iter * Update UT after infer bug fix * Bug fix & Add python serialization API (apache#10) * Delete C++ UT hack since Python is ready * Add ndarray.non_empty * Update Serialization python API * Improve code style, python wrapper and test cases (apache#11) * Update c++ code style and unit test * Update python State wrapper and test cases * fix unit tests * Add RPCRunner & OpenCL/CUDA test (apache#12) * Add RPCRunner & OpenCL search test * Add CUDA search test * Add RPCRunner test * rebase to upstream/master * Add Ansor basic tutorial (apache#13) * Add basic tutorial * migrate feature extraction (apache#14) * Add XGBModel & RPCRunnerWarpper (apache#15) * Add XGBModel & RPCRunnerWarpper * Revert "Add Parallel Granularity Mutation" * Migrate workload_registry.py (apache#16) * add workload registry * update * update * add task scheduler (apache#17) * Add conv2d cuda tutorial with workload registry (apache#18) * add tune_test.py (the old tune_wkl.py) (apache#19) * add tune_test.py (the old tune_wkl.py) * update * fix measure * fix for gpu * Code refine for tune_test.py & Add a pre load callback (apache#20) * Bug fix for tutorials * Add PreLoadMeasuredStates * Add search_callback support for task tuner * Code refine for tune_test.py * Update * Update * Update * Update * Bug fix * Add python custom sketch rule (apache#21) * Add custom sketch rule * Bug fix * Ansor Relay Integration (without layout rewrite) (apache#22) * relay integration * Add tune_op_subgraph.py & Some code clean for tune_network.py (apache#23) * Add single op tune scripts * Add tune subgraph support * Merge all op & all subgraph to one file * Rename file * add explicit_unroll_max_extent (apache#25) * Add Index simplification & API update (apache#26) * Add vectorized cooperative_fetching test * Update math simplify for vectorized CF * File rename * Update tune_network * API update * Update PreLoadMeasuredStates & Some bug fix (apache#27) * Add a threading wrapper to fix the test bug * Set default TVM_USE_AUTO_SCHEDULER to false * Update PreLoadMeasuredStates callback * Add tensorize step for loop_state (apache#31) * Add tensorize step * State python api update (apache#33) * Start to update api * Add compute_dag to state * API update * kernel layout rewrite (apache#28) * kernel layout rewrite * remove some hacks * add defuse_ops pass and move kernel_layout_rewrite pass after fuse_ops pass * set TVM_RELAY_DISABLE_BUILD_CACHE for task extraction and prepare_layout_rewrite * [cache flush] port cache flush to ansor (apache#32) * Improve relay integration (apache#34) * tmp checkpoint * Improve relay integration * Improve relay integration * Fix xgb error & Simplify dispatcher (apache#35) * Rename "MetaTileRewritePolicy" to "SketchPolicy". (apache#36) * Rename "MetaTileRewritePolicy" to "SketchPolicy". * Add a new class for auto_unroll_max_step, storage_offset in StageNode * fix tune_op_subgraph.py * rebase * Migrate all node::make to noderef's construct function (apache#37) * Start to move xxxnode::make to noderef() * Update * Update * Finish transform_step * Finish comute dag & auto schedule * Update * Update * Update * Update * Update * Code refine * Code refine * Code refine * Update * Update * Some lint fix & Recover the double constructor of tvm::PrimExpr (apache#39) * lint fix * clang-format-fix * pylint fix * Update * Recover the double constructor of tvm::PrimExpr * Fix pylint * pylint fix * pylint fix * Add MutateComputeLocation and MutateParallel in evolutionary search (apache#40) * Add MutateComputeLocation and MutateParallel in evolutionary search * fix lint * Improve loop state python API (stage_tensors -> stage_ops) (apache#41) * improve loop state python API (stage_tensors -> stage_ops) * fix * ComputeDAG bug fix & Add Custom TensorCore Matmul Example (apache#42) * Bug Fix * Sample example of Custom TensorCore Matmul * Rever Commits, Start to build minimum Ansor system * Code clean for minimum Ansor system * Bug fix & Delete AccessAnalyzer * Delete attachmap & Code clean * Doc update Update statenode::stages from vector to Array * Headfile update & Python doc update * clang-format fix * pylint fix * Update * Doc update * Update * Bug fix after code merge to the new master * clang-format fix * Update * Update * Update std::vector to Array; Update verbosity setting; Some commemts addressed * std::vector->Array & std::string->String * Add init_state to ComputeDAG * Update * Update some unordered_map to Map * clang-format fix * Comments addressed Delete ReplayAndInferBound Delete ReplaySteps & InferBoundCommon * Lint fix * Update * Update * Update * Update * Update * Update * Update * Update * Update * Rename ansor namespace to auto_schedule * Update * Rename ThreadPool to ParallelFor * Add parallel_for * Remove ThreadPool * Update python/tvm/auto_schedule/auto_schedule.py * trigger CI Co-authored-by: Lianmin Zheng <[email protected]> Co-authored-by: Minmin Sun (孙敏敏) <[email protected]> Co-authored-by: Zhao Wu <[email protected]>

abergeron and others added 20 commits February 9, 2019 09:55

Conda packages with cuda support (apache#2577)

919bea8

[IR] Update HalideIR (apache#2582)

89deaa6

[TUTORIAL] Fix downloaded file path (apache#2590)

77718f8

A couple of fixes for GEN (apache#2593)

ec3a425

Tighten buffer bound for TensorComputeOp by improving EvalSet on rang…

7e6cba4

…es (apache#2565)

Fix typo (apache#2595)

ebcad89

thanks @icemelon9, this is merged

[AUTOTVM][RELAY][DOCS] relay ports of tune_nnvm_* autotvm tutorials (…

129eb64

…apache#2594)

[TEST] Remove script that references previously removed content. (apa…

7e2a9fc

…che#2596)

[RUNTIME] Enable NDArray type extension (apache#2598)

6b0157b

[TOPI][CUDA] Add faster-rcnn proposal op (apache#2420)

d20646c

* [TOPI][CUDA] Add faster-rcnn proposal op * Fix doc * Add global barrier * Use vthread in argsort * Update sort and nms ir * Fix lint * Update sort ir in ssd nms

[TVM][Bugfix] fix storage_rewrite bug when input is big (apache#2580)

326fff5

* fix storage_rewrite bug when input is big * cast when necessary * simplification * simplification * int64->uint32 * revert uint32->int64

[DOCS] update titles to reflect tutorial content (nnvm vs. relay) (ap…

895ef97

…ache#2597) * update titles to reflect tutorial content (nnvm vs. relay) * move things around * fix typo

[Relay] Reference (apache#2489)

d05fed2

* move fix test fix lint fix test add more code fix lint better type infer ability * fix build * address comment

Version 0.5 (apache#2604)

18c36ab

* Version 0.5 * update version.py * update news * update news * update news

fix get layout in to_relay (apache#2610)

c716542

[Quantize] Skip for same input-output domain scale. (apache#2611)

66cd036

[RELAY][DOCS] Port from_mxnet tutorial to relay (apache#2608)

231bac2

* check in * update build and run

[RELAY][TOPI] alter_op_layout for x86 (apache#2602)

cdf8dff

* alter_op_layout for x86 * cleanup * cleanup * fix lint * fix lint * fix lint * fix lint * change support level * change other support levels

nhynes force-pushed the rust-next branch 2 times, most recently from 4877614 to 2f70ae7 Compare February 18, 2019 01:35

Laurawly and others added 8 commits February 18, 2019 18:45

[Bugfix] Nms_ir data_race solved (apache#2600)

fef7282

* nms data race solved * tst_topi_vision reference results are gonna be updated in PR apache#2353 * proposal nms_ir updated

Fix the FInplaceIdentity (apache#2572)

8518c7d

[Relay][Docs] Documentation for Algebraic Data Types (apache#2575)

d7390d1

Fix issue mutating if expressions (apache#2601)

f6be4d6

[EXPR] Expression-template based pattern matching. (apache#2589)

255c187

[TVM][LANG] Add eager simplification for operations with FloatImm (ap…

c59a78e

…ache#2615) * Add eager simplication for FloatImm * fix * fix lint * Fix gcc warning * fix * Add test case

remove batch_norm_inference (apache#2626)

80f8e98

[RELAY][OP] ROI Align (apache#2618)

8b1d07f

vinx13 and others added 15 commits April 1, 2019 15:57

[Relay, Quantization] Quantize all fields of concatenate (apache#2913)

ee95f6c

Fix makedirs() condition in contrib. (apache#2942)

fbd1c16

[Relay][OP] Gather_nd exposed to relay (apache#2945)

ae21edd

* gather_nd added * gather_nd test added * more test added * fix lint * fix build error * fix lint * comments addressed

Add missing #!/bin/bash directive. (apache#2951)

a42ad8e

[Bugfix] Bilinear resize bug fix from PR apache#2777 (apache#2857)

1dab4dc

* error fixed * rename * solve conlicts with master * more test added * fix error * remove test * comment addressed

[Relay][OP] Fix bias_add default axis (apache#2829)

71abe36

* Fix bias add default axis * update * Fix canonicalize ops for bias_add

[Rust] Unify types between bindings and pure Rust impl (apache#2616)

4968279

[Relay][Frontend] Support TF Gather (apache#2935)

38151ab

* [Relay][Frontend] Support TF Gather * fix comments

[Relay][Frontend] Support tf.where (apache#2936)

eb82e7b

* [Relay][Frontend] Support tf.where * fix comments

[Relay][Frontend] Adding ADD operator to tflite frontend for compilin…

e68874d

…g the MobileNetV2 (apache#2919)

[RUST] Remove empty ty.rs (apache#2958)

da1fd7a

fix undefined reference to dlopen, etc (apache#2957)

cefe07e

[TOPI] bitserial_conv2d move to autotvm template and updates (apache#…

1735187

…2819)

Removed std::unary_function because it is deprecated and removed in n…

3441b95

…ewer c++(https://en.cppreference.com/w/cpp/utility/functional/unary_function) (apache#2962)

nhynes force-pushed the rust-next branch 2 times, most recently from 11feb29 to 0257ddc Compare April 5, 2019 04:45

TVMPODValue macro

b741271

nhynes force-pushed the rust-next branch from 0257ddc to b741271 Compare April 5, 2019 04:47

nhynes added 2 commits April 5, 2019 05:04

Update runtime to use new packedfunc

ab4f7dd

Update frontend

e7dc0f6

nhynes force-pushed the rust-next branch 3 times, most recently from 801da1f to 6698f72 Compare April 5, 2019 08:03

Update tests

0a9ccab

nhynes force-pushed the rust-next branch from 6698f72 to 0a9ccab Compare April 5, 2019 08:28

to_tvm_value

d371edd

nhynes force-pushed the rust-next branch from 784081e to d371edd Compare April 5, 2019 23:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Unify Rust lib #4

[WIP] Unify Rust lib #4

nhynes commented Feb 3, 2019

[WIP] Unify Rust lib #4

Are you sure you want to change the base?

[WIP] Unify Rust lib #4

Conversation

nhynes commented Feb 3, 2019