mlir: Func call reverse diff #2127

Pangoraw · 2024-10-21T06:50:34Z

prompted by EnzymeAD/Reactant.jl#171 (comment).

We can now run enzyme-batch -> enzyme-wrap without inlining.

// ./bazel-bin/enzymexlamlir-opt test/lit_tests/batchtests/autodiff.mlir --enzyme-batch --enzyme-wrap="infn=main retTys=enzyme_active argTys=enzyme_active mode=ReverseModeCombined" --canonicalize --remove-unnecessary-enzyme-ops --arith-raise --cse --canonicalize --arith-raise --enzyme-simplify-math --mlir-print-ir-before=enzyme-batch
// -----// IR Dump Before BatchPass (enzyme-batch) //----- //
module {
  func.func @f(%arg0: tensor<f32>) -> tensor<f32> {
    %0 = stablehlo.multiply %arg0, %arg0 : tensor<f32>
    return %0 : tensor<f32>
  }
  func.func @main(%arg0: tensor<10xf32>) -> tensor<10xf32> {
    %0 = enzyme.batch @f(%arg0) {batch_shape = array<i64: 10>} : (tensor<10xf32>) -> tensor<10xf32>
    return %0 : tensor<10xf32>
  }
}


module {
  func.func @f(%arg0: tensor<f32>) -> tensor<f32> {
    %0 = stablehlo.multiply %arg0, %arg0 : tensor<f32>
    return %0 : tensor<f32>
  }
  func.func private @batched_f(%arg0: tensor<10xf32>) -> tensor<10xf32> {
    %0 = stablehlo.multiply %arg0, %arg0 : tensor<10xf32>
    return %0 : tensor<10xf32>
  }
  func.func @main(%arg0: tensor<10xf32>, %arg1: tensor<10xf32>) -> tensor<10xf32> {
    %cst = arith.constant dense<0.000000e+00> : tensor<10xf32>
    %0 = call @batched_f(%arg0) : (tensor<10xf32>) -> tensor<10xf32>
    %1 = stablehlo.add %arg1, %cst : tensor<10xf32>
    %2 = call @diffebatched_f(%arg0, %1) : (tensor<10xf32>, tensor<10xf32>) -> tensor<10xf32>
    %3 = stablehlo.add %2, %cst : tensor<10xf32>
    return %3 : tensor<10xf32>
  }
  func.func private @diffebatched_f(%arg0: tensor<10xf32>, %arg1: tensor<10xf32>) -> tensor<10xf32> {
    %cst = arith.constant dense<0.000000e+00> : tensor<10xf32>
    %0 = stablehlo.add %arg1, %cst : tensor<10xf32>
    %1 = stablehlo.multiply %0, %arg0 : tensor<10xf32>
    %2 = stablehlo.add %1, %cst : tensor<10xf32>
    %3 = stablehlo.add %2, %1 : tensor<10xf32>
    return %3 : tensor<10xf32>
  }
}

enzyme/Enzyme/MLIR/Implementations/FuncAutoDiffOpInterfaceImpl.cpp

wsmoses · 2024-10-22T02:45:59Z

enzyme/Enzyme/MLIR/Implementations/FuncAutoDiffOpInterfaceImpl.cpp

+
+    std::vector<bool> volatile_args(narg, true);
+    std::vector<bool> returnShadow(narg, false);
+    std::vector<bool> returnPrimal(nret, false);


We should probably default to always returning the primal, no?

I don't really understand how the primal would be used here

Yeah we would only need for fused forward and reverse passes or handling of mutation, we can defer that for later here

wsmoses · 2024-10-22T02:47:42Z

enzyme/Enzyme/MLIR/Implementations/FuncAutoDiffOpInterfaceImpl.cpp

+    bool freeMemory = true;
+    size_t width = 1;
+
+    auto revFn = gutils->Logic.CreateReverseDiff(


Long term We probably need to do the same augmented forward pass and separate reverse pass that enzyme llvm does atm .

if we assume everything in the function is read only and only return active not duplicated results this is fine

ah right, the caching at the primal call site is not enough if the arguments are mutable. I could not make an example input with memrefs (something about invertPointerM). So I added an error if any of the result or arg of the initial call is mutable.

wsmoses

LGTM but some misc fix ups and also the assumptions of the current implementation should throw errors if not met (we have a isReadOnly for example)

wsmoses · 2024-10-22T16:19:10Z

enzyme/Enzyme/MLIR/Implementations/FuncAutoDiffOpInterfaceImpl.cpp

@@ -29,9 +29,139 @@ namespace {
 #include "Implementations/FuncDerivatives.inc"
 } // namespace

+static std::optional<func::FuncOp> getContainingFunction(Operation *orig) {
+  Operation *parent;


Can this be a function interface instead?

wsmoses · 2024-10-23T07:26:49Z

enzyme/Enzyme/MLIR/Implementations/FuncAutoDiffOpInterfaceImpl.cpp

+
+    auto parent = getContainingFunction(orig);
+    if (parent.has_value() &&
+        callOp.getCallee() == parent.value().getNameAttr()) {


Ah I see this use, this is super fragile. For example it won’t work if you have a calls b calls a.

I think this is okay to merge as is but adding proper recursion support shouldn’t be bad. We have it already for forward enzymemlir which basically just requires having a cache in enzymelogic asking if we’ve already made this derivative function

yeah any cycle in the call graph will not be caught here.

Yeah enzymelogic should do this for forwards (and also llvm) already

I will look at implementing something like ForwardCachedFunctions for reverse in a follow-up

* main: (49 commits) Fix iv of constant (#2141) Update benchmarks (#2035) Implement tgamma derivative (#2140) tgamma error improvement (#2139) Improve cache index error message (#2138) Fixes warnings and adds missing header guards (#2124) mlir: cache and reuse reverse funcs (#2133) mlir: implement forward mode for func.call (#2134) mlir: Func call reverse diff (#2127) Update build_tarballs.jl Fix combined temp cache for reverse (#2131) Improve runtime activity err message (#2132) Fix undef value storage (#2129) Adapt to const tblgen (#2128) Add gcloaded TT (#2125) Fix blas decl updater indexing (#2123) Add header files to ClangEnzyme target (#2062) Improve unknown function error messages (#2120) Fix handle sync (#2122) Support more Julia 1.11 functions (#2121) ...

Pangoraw added 2 commits October 18, 2024 16:16

mlir-opt: register func inliner extension

3a3f0aa

func: implement func.call reverse

1c32aef

wsmoses reviewed Oct 22, 2024

View reviewed changes

enzyme/Enzyme/MLIR/Implementations/FuncAutoDiffOpInterfaceImpl.cpp Outdated Show resolved Hide resolved

wsmoses reviewed Oct 22, 2024

View reviewed changes

enzyme/Enzyme/MLIR/Implementations/FuncAutoDiffOpInterfaceImpl.cpp Outdated Show resolved Hide resolved

wsmoses reviewed Oct 22, 2024

View reviewed changes

wsmoses approved these changes Oct 22, 2024

View reviewed changes

Pangoraw added 3 commits October 22, 2024 15:05

specify corect ret activity

beb046d

throw error with mutable types

6780ab7

add constant values in func reverse test

2d0a262

wsmoses reviewed Oct 22, 2024

View reviewed changes

Pangoraw added 2 commits October 23, 2024 09:19

use function interface

1971a58

format

719930d

wsmoses reviewed Oct 23, 2024

View reviewed changes

wsmoses merged commit 11e9b9d into EnzymeAD:main Oct 23, 2024
12 of 27 checks passed

BrewTestBot mentioned this pull request Oct 30, 2024

enzyme 0.0.158 Homebrew/homebrew-core#196060

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mlir: Func call reverse diff #2127

mlir: Func call reverse diff #2127

Pangoraw commented Oct 21, 2024 •

edited

Loading

wsmoses Oct 22, 2024

Pangoraw Oct 22, 2024

wsmoses Oct 22, 2024

wsmoses Oct 22, 2024

Pangoraw Oct 22, 2024

wsmoses left a comment

wsmoses Oct 22, 2024

wsmoses Oct 23, 2024

Pangoraw Oct 23, 2024 •

edited

Loading

wsmoses Oct 23, 2024

Pangoraw Oct 23, 2024

mlir: Func call reverse diff #2127

mlir: Func call reverse diff #2127

Conversation

Pangoraw commented Oct 21, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wsmoses left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Pangoraw Oct 23, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Pangoraw commented Oct 21, 2024 •

edited

Loading

Pangoraw Oct 23, 2024 •

edited

Loading