Mark all TTIR and TTNN ops as pure #1481

LPanosTT · 2024-12-03T15:34:50Z

Added MLIR's Pure attribute to all TTIR and TTNN ops since they are pure SSA ops.
MLIR's built-in RemoveDeadValues pass can now trim dead code off of TTIR and TTNN modules effectively.

svuckovicTT · 2024-12-04T10:41:22Z

Can you update TTNN_EmptyOp in TTNNOps.td?

// Note: NoMemoryEffect is used to indicate that operation can be removed if it is not used.
// Removal of this operation is done by the dead code elimination pass (RemoveDeadValuesPass).
def TTNN_EmptyOp : TTNN_Op<"empty", [NoMemoryEffect]> {

->

def TTNN_EmptyOp : TTNN_Op<"empty"> {

…is erased

LPanosTT · 2024-12-04T15:05:15Z

@sasha, I've removed the comment and redundant NoMemoryEffect trait 👍

azecevicTT

Almost all of our ops are Pure, but not all. In my opinion the best solution would be introducing new class just below TTIR_Op, something like TTIR_PureOp where all other classes would 'extend' that new class, and ops that aren't pure would be defined on the TTIR_Op class. Analogous for the TTNN dialect.

azecevicTT · 2024-12-06T08:03:58Z

include/ttmlir/Dialect/TTIR/IR/TTIRBase.td

@@ -38,6 +39,6 @@ def TTIR_Dialect : Dialect {
 //===----------------------------------------------------------------------===//

 class TTIR_Op<string mnemonic, list<Trait> traits = []> :
-        Op<TTIR_Dialect, mnemonic, traits>;
+        Op<TTIR_Dialect, mnemonic, !listconcat(traits, [Pure])>;


Should Pure trait be the top-level trait? I'm thinking about alloc and dealloc. Every dealloc will trivially be removed since it doesn't produce the result.

Good point, we probably should adopt MemAlloc and MemFree traits for those respectively.

azecevicTT · 2024-12-06T08:04:02Z

include/ttmlir/Dialect/TTNN/IR/TTNNBase.td

@@ -45,6 +46,6 @@ def TTNN_Dialect : Dialect {
 //===----------------------------------------------------------------------===//

 class TTNN_Op<string mnemonic, list<Trait> traits = []> :
-        Op<TTNN_Dialect, mnemonic, !listconcat(traits, [TTNN_OpModelInterface, TTNN_WorkaroundInterface])>;
+        Op<TTNN_Dialect, mnemonic, !listconcat(traits, [Pure, TTNN_OpModelInterface, TTNN_WorkaroundInterface])>;


Same as above.

sdjordjevicTT · 2024-12-09T14:49:47Z

@LPanosTT, could you please address @azecevicTT's comments? I know the comment is after the merge, but it is still a valid concern, and we should address it sooner rather than later.

LPanosTT · 2024-12-09T14:57:59Z

@azecevicTT apologies for not responding earlier, but I agree with both you and nick. We may aswell be more precise in modelling memory effects if we’re going to in the first place

azecevicTT · 2024-12-10T09:05:20Z

@nsmithtt @LPanosTT I think we have an even bigger problem because of DPS. Take this example:

module {
  func.func @forward(%arg0: tensor<64x128xbf16>) -> tensor<128x64xbf16> {
    %0 = tensor.empty() : tensor<128x64xbf16>
    %1 = "ttir.transpose"(%arg0, %0) <{dim0 = 0 : si32, dim1 = 1 : si32, operand_constraints = [#any_device, #any_device]}> : (tensor<64x128xbf16>, tensor<128x64xbf16>) -> tensor<128x64xbf16>
    return %0 : tensor<128x64xbf16>
  }
}

Semantically %1 should be alias of %0 if I understand the idea behind the DPS correctly, so return %0 should return %arg0 transposed. But because we are treating ttir.transpose as Pure, and %1 is unused SSA value it can will be removed, so return %0 will return result of tensor.empty which is incorrect.
We need some kind of alias analysis before we can remove values with no uses.

sdjordjevicTT · 2024-12-10T16:27:02Z

@nsmithtt @LPanosTT I think we have an even bigger problem because of DPS. Take this example:
module {
  func.func @forward(%arg0: tensor<64x128xbf16>) -> tensor<128x64xbf16> {
    %0 = tensor.empty() : tensor<128x64xbf16>
    %1 = "ttir.transpose"(%arg0, %0) <{dim0 = 0 : si32, dim1 = 1 : si32, operand_constraints = [#any_device, #any_device]}> : (tensor<64x128xbf16>, tensor<128x64xbf16>) -> tensor<128x64xbf16>
    return %0 : tensor<128x64xbf16>
  }
}
Semantically %1 should be alias of %0 if I understand the idea behind the DPS correctly, so return %0 should return %arg0 transposed. But because we are treating ttir.transpose as Pure, and %1 is unused SSA value it can will be removed, so return %0 will return result of tensor.empty which is incorrect. We need some kind of alias analysis before we can remove values with no uses.

I believe we need to revert this and consider the proper solution. DPS ops are not considered pure by default.

nsmithtt · 2024-12-10T22:05:54Z

@nsmithtt @LPanosTT I think we have an even bigger problem because of DPS. Take this example:
module {
  func.func @forward(%arg0: tensor<64x128xbf16>) -> tensor<128x64xbf16> {
    %0 = tensor.empty() : tensor<128x64xbf16>
    %1 = "ttir.transpose"(%arg0, %0) <{dim0 = 0 : si32, dim1 = 1 : si32, operand_constraints = [#any_device, #any_device]}> : (tensor<64x128xbf16>, tensor<128x64xbf16>) -> tensor<128x64xbf16>
    return %0 : tensor<128x64xbf16>
  }
}
Semantically %1 should be alias of %0 if I understand the idea behind the DPS correctly, so return %0 should return %arg0 transposed. But because we are treating ttir.transpose as Pure, and %1 is unused SSA value it can will be removed, so return %0 will return result of tensor.empty which is incorrect. We need some kind of alias analysis before we can remove values with no uses.

This is an ill formed program if the intention is to do what you stated. DPS uses SSA form for exactly this reason, if you don't use the result of the op then it should rightfully erase it.

The correct way to express this semantically in ttir (i.e. DPS) is:

return %1 : tensor<128x64xbf16>

See this section of the bufferization doc https://mlir.llvm.org/docs/Bufferization/#destination-passing-style.

Relevant paragraph:

DPS exists in itself independently of bufferization and is tied to SSA semantics: many ops are “updating” a part of their input SSA variables. For example the LLVM instruction insertelement is inserting an element inside a vector. Since SSA values are immutable, the operation returns a copy of the input vector with the element inserted.

So it follows, in your example, returning %0 returns the tensor before it was "copied" to the result. SSA form is a very nice convenience for generically applying various graph passes which is why DPS still attempts to stay SSA, despite looking a little funky.

azecevicTT · 2024-12-11T14:45:51Z

@nsmithtt Thanks for the clarification, it makes more sense now. There are even some examples here https://mlir.llvm.org/docs/Dialects/TensorOps with DPS + Pure. We still need to address the cases of alloc and dealloc (and maybe some other ops), but majority of ops should still stay Pure.

…is erased (#1481)

LPanosTT requested review from sdjordjevicTT, svuckovicTT, mtopalovicTT, rpavlovicTT, nobradovictt, jserbedzijaTT, nsmithtt and mrakitaTT as code owners December 3, 2024 15:34

LPanosTT force-pushed the lpanos/mark_ops_pure branch from 161fc96 to 704c7dc Compare December 3, 2024 15:55

nsmithtt approved these changes Dec 3, 2024

View reviewed changes

svuckovicTT approved these changes Dec 4, 2024

View reviewed changes

Mark all TTIR and TTNN ops as pure. Create tests to ensure dead code …

4b52f8e

…is erased

LPanosTT force-pushed the lpanos/mark_ops_pure branch from 704c7dc to 4b52f8e Compare December 4, 2024 15:04

LPanosTT enabled auto-merge (squash) December 4, 2024 15:20

LPanosTT merged commit 7f6046e into main Dec 4, 2024
21 checks passed

azecevicTT reviewed Dec 6, 2024

View reviewed changes

azecevicTT mentioned this pull request Dec 6, 2024

Removed dead code check for typecast #1483

Merged

ajakovljevicTT mentioned this pull request Dec 16, 2024

Dead code removal pass removes valid operations #1603

Open

azecevicTT pushed a commit that referenced this pull request Dec 17, 2024

Mark all TTIR and TTNN ops as pure. Create tests to ensure dead code …

2add84c

…is erased (#1481)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mark all TTIR and TTNN ops as pure #1481

Mark all TTIR and TTNN ops as pure #1481

LPanosTT commented Dec 3, 2024

svuckovicTT commented Dec 4, 2024

LPanosTT commented Dec 4, 2024 •

edited

Loading

azecevicTT left a comment

azecevicTT Dec 6, 2024

nsmithtt Dec 9, 2024

azecevicTT Dec 6, 2024

sdjordjevicTT commented Dec 9, 2024

LPanosTT commented Dec 9, 2024

azecevicTT commented Dec 10, 2024

sdjordjevicTT commented Dec 10, 2024

nsmithtt commented Dec 10, 2024 •

edited

Loading

azecevicTT commented Dec 11, 2024

Mark all TTIR and TTNN ops as pure #1481

Mark all TTIR and TTNN ops as pure #1481

Conversation

LPanosTT commented Dec 3, 2024

svuckovicTT commented Dec 4, 2024

LPanosTT commented Dec 4, 2024 • edited Loading

azecevicTT left a comment

Choose a reason for hiding this comment

azecevicTT Dec 6, 2024

Choose a reason for hiding this comment

nsmithtt Dec 9, 2024

Choose a reason for hiding this comment

azecevicTT Dec 6, 2024

Choose a reason for hiding this comment

sdjordjevicTT commented Dec 9, 2024

LPanosTT commented Dec 9, 2024

azecevicTT commented Dec 10, 2024

sdjordjevicTT commented Dec 10, 2024

nsmithtt commented Dec 10, 2024 • edited Loading

azecevicTT commented Dec 11, 2024

LPanosTT commented Dec 4, 2024 •

edited

Loading

nsmithtt commented Dec 10, 2024 •

edited

Loading