[TTIR -> TTNN] Fix to_layout op conversion to support chain of ops #880

svuckovicTT · 2024-10-09T12:03:53Z

Test that fails with "Assertion utils::isOnHost(inputTensor) && "Calling ttnn::to_device on a device tensor"' failed.":

func.func @softmax(%arg0: tensor<224x64xbf16>) -> tensor<224x64xbf16> {
  // CHECK: %[[C:.*]] = "ttnn.empty"[[C:.*]]
  %0 = tensor.empty() : tensor<224x64xbf16>
  // CHECK: %[[C:.*]] = "ttnn.softmax"[[C:.*]]
  // Check for positive dimension attribute
  %1 = "ttir.softmax"(%arg0, %0) <{dimension = 1 : si32, operand_constraints = [#l1_block_sharded_tile, #l1_block_sharded]}> : (tensor<224x64xbf16>, tensor<224x64xbf16>) -> tensor<224x64xbf16>
  // CHECK: %[[C:.*]] = "ttnn.empty"[[C:.*]]
  %2 = tensor.empty() : tensor<224x64xbf16>
  // CHECK: %[[C:.*]] = "ttnn.softmax"[[C:.*]]
  // Check for negative dimension attribute
  %3 = "ttir.softmax"(%1, %2) <{dimension = -1 : si32, operand_constraints = [#l1_block_sharded_tile, #l1_block_sharded]}> : (tensor<224x64xbf16>, tensor<224x64xbf16>) -> tensor<224x64xbf16>
  return %3 : tensor<224x64xbf16>
}

The resulting TTNN IR looks like this:

  func.func @softmax(%arg0: tensor<224x64xbf16, #layout11>) -> tensor<224x64xbf16, #layout11> {
    %0 = "ttnn.get_device"() <{mesh_shape = #ttnn<mesh_shape 1x1>}> : () -> !tt.device<#device>
    %1 = "ttnn.to_layout"(%arg0, %0) <{layout = #ttnn.layout<tile>}> : (tensor<224x64xbf16, #layout11>, !tt.device<#device>) -> tensor<224x64xbf16, #layout12>
    %2 = "ttnn.to_device"(%1, %0) <{memory_config = #ttnn.memory_config<<block_sharded>, <l1>>}> : (tensor<224x64xbf16, #layout12>, !tt.device<#device>) -> tensor<224x64xbf16, #layout12>
    %3 = "ttnn.empty"(%0) <{dtype = #tt.supportedDataTypes<bf16>, layout = #ttnn.layout<row_major>, memory_config = #ttnn.memory_config<<block_sharded>, <l1>>, shape = #ttnn.shape<224x64>}> : (!tt.device<#device>) -> tensor<224x64xbf16, #layout13>
    %4 = "ttnn.softmax"(%2, %3) <{dimension = 1 : si32}> : (tensor<224x64xbf16, #layout12>, tensor<224x64xbf16, #layout13>) -> tensor<224x64xbf16, #layout13>
    %5 = "ttnn.to_layout"(%4, %0) <{layout = #ttnn.layout<tile>}> : (tensor<224x64xbf16, #layout13>, !tt.device<#device>) -> tensor<224x64xbf16, #layout12>
    %6 = "ttnn.to_device"(%5, %0) <{memory_config = #ttnn.memory_config<<block_sharded>, <l1>>}> : (tensor<224x64xbf16, #layout12>, !tt.device<#device>) -> tensor<224x64xbf16, #layout12>
    %7 = "ttnn.empty"(%0) <{dtype = #tt.supportedDataTypes<bf16>, layout = #ttnn.layout<row_major>, memory_config = #ttnn.memory_config<<block_sharded>, <l1>>, shape = #ttnn.shape<224x64>}> : (!tt.device<#device>) -> tensor<224x64xbf16, #layout13>
    %8 = "ttnn.softmax"(%6, %7) <{dimension = -1 : si32}> : (tensor<224x64xbf16, #layout12>, tensor<224x64xbf16, #layout13>) -> tensor<224x64xbf16, #layout13>
    %9 = "ttnn.to_memory_config"(%8, %0) : (tensor<224x64xbf16, #layout13>, !tt.device<#device>) -> tensor<224x64xbf16, #layout11>
    return %9 : tensor<224x64xbf16, #layout11>
  }

%6 = "ttnn.to_device" is the problematic call. to_device should be called only on tensors that aren't already on device.

The text was updated successfully, but these errors were encountered:

svuckovicTT · 2024-11-06T13:18:22Z

Solved in #840.

svuckovicTT self-assigned this Oct 9, 2024

svuckovicTT added the MLIR Compiler label Oct 9, 2024

svuckovicTT mentioned this issue Oct 9, 2024

Undo hardcoded tile layout in TTIR->TTNN #863

Merged

svuckovicTT closed this as completed Nov 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TTIR -> TTNN] Fix to_layout op conversion to support chain of ops #880

[TTIR -> TTNN] Fix to_layout op conversion to support chain of ops #880

svuckovicTT commented Oct 9, 2024 •

edited

Loading

svuckovicTT commented Nov 6, 2024

[TTIR -> TTNN] Fix to_layout op conversion to support chain of ops #880

[TTIR -> TTNN] Fix to_layout op conversion to support chain of ops #880

Comments

svuckovicTT commented Oct 9, 2024 • edited Loading

svuckovicTT commented Nov 6, 2024

svuckovicTT commented Oct 9, 2024 •

edited

Loading