[Optimizer] Greedy solution for join nodes in L1 Interleaved policy #1162

fbajraktariTT · 2024-11-05T15:59:10Z

This PR introduces new MemoryLayoutAnalysis policy as an alternative to the DFSharding policy with the goal of greedily solving join nodes. This means that if an op is a join node (node with multiple operands, e.g. Add, Matmul,...) we should consider placing multiple op's operands in the L1 memory and limit ourselves to only one operand being placed in L1 memory as is it the case with DFSharding. For all the combination of operands, we pick the one that maximizes L1 usage (by maximising L1 usage we try to have as few DRAM spills as possible) and minimizes required L1 usage (minimal amount of L1 memory neccesseary in order to reach desired memory configuration). Furthermore, it should be emphasized that this may not lead to globally optimal solution because we pick the optimal config at moment of analysing the op.

nobradovictt · 2024-11-06T09:23:01Z

PR title does not correspond to changes made.

lib/Scheduler/Scheduler.cpp

odjuricicTT

As discussed offline, because it is a big change, let's make a few architectural adjustments first and then review the rest.

lib/Dialect/TTNN/Analysis/L1InterleavedPolicy.cpp

env/CMakeLists.txt

include/ttmlir/Scheduler/PrecedenceScheduler.h

include/ttmlir/Dialect/TTNN/Analysis/L1InterleavedPolicy.h

include/ttmlir/Scheduler/Scheduler.h

lib/Dialect/TTNN/Analysis/MemoryLayoutAnalysis.cpp

lib/Dialect/TTNN/IR/TTNNOpsAttrs.cpp

lib/Dialect/TTNN/Analysis/L1InterleavedPolicy.cpp

odjuricicTT

Left a few comments, mostly about readability. As @nobradovic mentioned, the PR title should try to explain what the change is in a way so that people not familiar with the work can uderstand. Maybe something like: "[Optimizer] Solve join nodes in L1 interleaved policy"

nobradovictt · 2024-11-27T13:19:28Z

lib/Dialect/TTNN/Analysis/L1InterleavedPolicy.cpp

+    }
+    l1ChainConfigs->back().build();
+    l1ChainConfigs->back().resolve();
+    std::unordered_set<Edge> memReconfigEdges;


Do we see a scenario in future where in L1Interleaved policy we will need reconfig edge?

It depends on the op and there may be a scenario where we need reconfigEdges. For example, if we greedily decide that the output of an op's operand should be in DRAM and there is enough available L1 memory this operand then we can insert a ToLayout op on that edge so we can gain on the performance. I assume this may be true depending on the op.

Yes but in this scenario it would not be coming from policy like for example in DFSharding, but only as an external override, because if policy found out that reconfigEdge is needed it could directly change OP output instead. Just thinking if it should be completely removed from L1Interleaved policy as it is currently only used as an empty dummy struct to satisfy API.

lib/Dialect/TTNN/Analysis/L1InterleavedPolicy.cpp

include/ttmlir/Dialect/TTNN/Analysis/L1InterleavedPolicy.h

nobradovictt · 2024-11-27T21:53:39Z

Change has no description.

include/ttmlir/Dialect/TTNN/IR/TTNNOpsAttrs.td

lib/Dialect/TTNN/Analysis/CMakeLists.txt

odjuricicTT

Looks good! Please add a description for the PR with a few sentances describing the PR contains. Do this for all future PRs.

nobradovictt reviewed Nov 6, 2024

View reviewed changes

lib/Scheduler/Scheduler.cpp Outdated Show resolved Hide resolved

nobradovictt requested review from odjuricicTT and mtopalovicTT November 6, 2024 09:28

fbajraktariTT force-pushed the v2_interleaved_policy branch 4 times, most recently from 5a24709 to daed82a Compare November 12, 2024 16:45

fbajraktariTT force-pushed the v2_interleaved_policy branch 2 times, most recently from e6f54a9 to 4d1c249 Compare November 18, 2024 17:21

fbajraktariTT linked an issue Nov 19, 2024 that may be closed by this pull request

Move MemoryLayoutAnalysisPolicyParams from TT to TTNN #1323

Closed

fbajraktariTT force-pushed the v2_interleaved_policy branch 3 times, most recently from 2036257 to 0cb6250 Compare November 25, 2024 15:28

fbajraktariTT marked this pull request as ready for review November 25, 2024 15:29

fbajraktariTT requested review from sdjordjevicTT, svuckovicTT, rpavlovicTT, jserbedzijaTT, jnie-TT, nsmithtt and mrakitaTT as code owners November 25, 2024 15:29

fbajraktariTT marked this pull request as draft November 25, 2024 15:32

odjuricicTT reviewed Nov 25, 2024

View reviewed changes

lib/Dialect/TTNN/Analysis/L1InterleavedPolicy.cpp Show resolved Hide resolved

env/CMakeLists.txt Outdated Show resolved Hide resolved

include/ttmlir/Scheduler/PrecedenceScheduler.h Outdated Show resolved Hide resolved

fbajraktariTT force-pushed the v2_interleaved_policy branch from 0cb6250 to 485bfa0 Compare November 26, 2024 08:40

fbajraktariTT marked this pull request as ready for review November 26, 2024 16:09

fbajraktariTT requested review from odjuricicTT and nobradovictt November 26, 2024 16:09