[WIP] Switch to using ProfiledTensorType #68

Krovatkin · 2019-07-18T18:45:08Z

No description provided.

bwasti · 2019-07-18T19:01:04Z

torch_tvm/compiler.cpp

+  {
+
+    auto csizes = ptt->sizes().concrete_sizes();
+    TORCH_INTERNAL_ASSERT(csizes.has_value());


should we handle the case when this isn't true?

I think we shouldn't just create a TVMCompGroup in this case. Even though, we can compile at a runtime if we get a workload where shapes change all the time it will be pretty wasteful. I'm looking into changing Fuser to not fuse things that don't have ProfiledTensorTypes

if it's alternating between 20 batch size shapes but run 1M times it's probably worth still compiling each shape

hopefully, this will be handled via bailouts. We specialize for shape_set1. Then if we see another frequent set, we will specialize for that one as well and so on.

bwasti · 2019-07-18T19:06:28Z

torch_tvm/compiler.cpp

-      kv.first->inferTypeFrom(kv.second.toTensor());
+      // TODO: convince Fuser to NOT create TVMCompilationGroups
+      // if ANY of subgraph inputs weren't profiled
+      TORCH_INTERNAL_ASSERT(kv.first->type()->cast<ProfiledTensorType>());


I don't think we need this, as it is done on line 111

bwasti · 2019-07-18T19:11:34Z

torch_tvm/compiler.cpp

    }
    // bail out mechanism: try to convert to Relay, if it fails to convert the
    // graph by any reason(i.e. op difference), depend on the user preference,
    // either throw or fall back to the JIT interpreter for execution
+    cache_ = TVMObject {};


at this point, if we run a graph twice with different sizes will we still get TVM compiled code each time?
is the logic for that moved up into the profiled executor?

yup, we will bail out, profile again and generate a graph with a TVMCompGroups for different shapes.

bwasti · 2019-07-18T19:12:42Z

torch_tvm/compiler.cpp

          << e.what() << "\n";
+    }
+
+    if ((*cache_).invalid)


can this block be merged back into block on line 260?

bwasti · 2019-07-18T19:13:23Z

torch_tvm/compiler.cpp

-    cache_[spec].set_input = run_mod.GetFunction("set_input_zero_copy", false);
-    cache_[spec].kernel = run_mod.GetFunction("run", false);
-    cache_[spec].get_output = run_mod.GetFunction("get_output", false);
+    (*cache_).set_input = run_mod.GetFunction("set_input_zero_copy", false);


arrgh, thanks! I forgot to switch it

bwasti · 2019-07-18T19:15:10Z

torch_tvm/compiler.cpp

-  CompleteArgumentSpec spec{false, ArrayRef<IValue>(inputs)};
-
-  if (cache_.find(spec) == cache_.end()) {
+  if (!cache_ || (cache_ && (*cache_).invalid)) {


if cache_ is an optional, can we get rid of the invalid attribute somehow? it's pretty confusing

bwasti · 2019-07-18T19:15:57Z

torch_tvm/register.cpp

@@ -35,6 +36,7 @@ PYBIND11_MODULE(_torch_tvm, m) {
  RegisterOperators op({Operator(
      tvm_sym,
      [](const Node* node) {
+        GRAPH_DUMP("A graph passed to TVMCompiler\n", node->g(attr::Subgraph));


can we move this to a different diff?

wanchaol · 2019-07-18T18:56:00Z

torch_tvm/compiler.cpp

+    {
+      sizes.push_back(HalideIR::Expr(static_cast<int32_t>(size)));
+    }
+  } else if (optional_ivalue.has_value()) {


curious in which case we will not having a value type that is not a ProfiledTensorType since we all switched to profiled graph executor?

wanchaol · 2019-07-18T19:05:27Z

torch_tvm/compiler.cpp

    } catch (const std::exception& e) {
+      (*cache_).invalid = true;


When we failing back to JIT, this means some operators are not converted successfully due to operator semantic mismatch and other behaviors, this invalid flag will let the compiler to re-run the conversion everytime for the same inputs and it will always fail, so I am not sure if that flag would be necessary.

exactly! we don't want to re-run if we already know it's going to fail! Re-running compilations might be pretty expensive

Krovatkin added 2 commits July 18, 2019 09:43

get rid of CompleteTensorType and cache in torch_tvm

c2aa8ab

remove debug statements

8e4c36d

Krovatkin requested review from bwasti and wanchaol July 18, 2019 18:45

bwasti reviewed Jul 18, 2019

View reviewed changes

wanchaol reviewed Jul 18, 2019

View reviewed changes

facebook-github-bot added the cla signed label Oct 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Switch to using ProfiledTensorType #68

[WIP] Switch to using ProfiledTensorType #68

Krovatkin commented Jul 18, 2019

bwasti Jul 18, 2019

Krovatkin Jul 18, 2019

bwasti Jul 18, 2019

Krovatkin Jul 18, 2019

bwasti Jul 18, 2019

bwasti Jul 18, 2019

Krovatkin Jul 18, 2019

bwasti Jul 18, 2019

bwasti Jul 18, 2019

Krovatkin Jul 18, 2019

bwasti Jul 18, 2019

bwasti Jul 18, 2019

Krovatkin Jul 18, 2019

wanchaol Jul 18, 2019

wanchaol Jul 18, 2019

Krovatkin Jul 18, 2019

		} catch (const std::exception& e) {
		(*cache_).invalid = true;

[WIP] Switch to using ProfiledTensorType #68

Are you sure you want to change the base?

[WIP] Switch to using ProfiledTensorType #68

Conversation

Krovatkin commented Jul 18, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment