Merge main -> google #5078

ThomasRaoux · 2021-03-12T01:01:07Z

54749ce Adding TiedOpInterface and wiring it through the Flow dialect. This allows for..
9467a90 Make error message in enforce_glob more copy-pasteable (Make error message in enforce_glob more copy-pasteable #5071)
c05dd63 Update dockers to include CUDA lib (Add CUDA lib to dockers and enable CUDA testing #5055)
f8fb632 Fix enforce_glob midair collision (Fix enforce_glob midair collision #5069)
313c09a Fix file lists in enforce_glob (Fix file lists in enforce_glob #5067)
7116f0c Plumb GPU conv and matmul vectorization through flow.dispatch.workgroups ([CodeGen] Plumb conv vectorization through flow.dispatch.workgroups #4999)
565c97c Add vm.const.i{32,64}.zero to emitc conversions (Add vm.const.i{32,64}.zero to emitc conversions #5066)
93a1ff1 Replace all globs in IREE core with enforce_glob (Replace all globs in IREE core with enforce_glob #5063)
ffcd3fe Merge pull request Making flow ops shape-aware to avoid the need for tie shapes. #4881 from google/benvanik-flow
d34bdc2 Removing the RematerializeDispatchConstants pass. The ClosureOpInterface canon..
0ddc9e9 Making flow ops shape-aware. This removes most of the tie_shape uses outside o..
baf8a2d Enable mhlo.while e2e/xla_ops test for linalg-on-tensors path (Enable mhlo.while e2e/xla_ops test for linalg-on-tensors path #5054)
15d03a4 Adding ShapeCarryingOpInterface to allow ops to carry shape values. shapex.tie..
28ada60 Fixing name of flow tblgen interfaces files.
6f56791 Removing unused StreamableOpInterface methods.
fb9b0cd Fixing leak in bytecode_dispatch_test. Would be really, really nice to have an..

Would be really, really nice to have an ASAN bot.

shapex.tie_shape now implements this interface and is no longer checked for specifically. This allows any op to carry ranked shape-compatible information. Its usage is kind of like ViewLikeOpInterface in that the op stores the dynamic dimensions and can return them as needed via the interface.

…rg#5054)

This removes most of the tie_shape uses outside of dispatch regions. It required reworking ClosureOpDce so that the shape-aware ops can still be optimized. Future changes will start edging us towards handling shapes with this interface such that the ties are only required when interoping with code that is not shape-aware.

The ClosureOpInterface canonicalizer does this now.

This avoids any globs of source files in CMake (which are [discouraged](https://cmake.org/cmake/help/latest/command/file.html#glob)) and any globs being evaluated in bazel_to_cmake (which would make it depend on files other than the BUILD file). It still allows the safety check that you actually included all the files you meant to (which is particularly useful with tests, where a failure to do so is test skipped forever instead of an immediate build failure). The cost is having to explicitly list a new source file when you add it, which seems not so bad. I didn't remove bazel_to_cmake support for glob yet, so I can clean up any stragglers after this lands. Fixes iree-org#1083

…ups (iree-org#4999) This commit adds all necessary plumbing to connect 2-D convolution and matmul vectorization inside flow.dispatch.workgroups. This includes: * Let 2-D convolution be recognized as a root op during dispatch region formation. * Add a pass to concretize the abstract tiling and distribution during flow dispatch region formation. It substitutes symbolic ops with concrete values from CodeGen policy. * Recognize `hal.interface.workgroup.id` ops when folding GPU processor ID uses. * Recognize `hal.interface.binding.subspan` when vectorizing `memref`s for better memory access. Along the way, the old path is refactored to have a better structure for further cleaning up: * `LinalgTileAndDistributePass` is split out of `LinalgTileAndFusePass`. It will be used for tiling and distribution among workgroups in the old path. `LinalgTileAndFusePass` will be for tiling and vectorization in a single workgroup (and it will be renamed later). * A few tests are updated to use the new path.

These come from midair collision between iree-org#5063 and iree-org#4881

Midair collision between iree-org#4999 and iree-org#5063

@foo

This allows for results of operations to be tied back to their operands in storage but not in time. This allows for in-place operations to be defined on tensors that carry enough metadata to be able to correctly form streams, materialize HAL interfaces, and allocate buffers. Example: ```mlir %t = flow.dispatch @foo[...](%input) : (tensor<4xf32>) -> %input ``` This syntax also combines with the shape-carrying op interface to make it possible to also indicate that an input and a result share type and shape information: ```mlir %t = flow.dispatch @foo[...](%input) : (tensor<?xf32>{%dim}) -> %input ``` which is effectively: ```mlir %t = flow.dispatch @foo[...](%input) : (tensor<?xf32>{%dim}) -> tensor<?xf32>{%dim} ``` but with the extra bit that result 0 is tied to operand 0. Here the result %t of the dispatch aliases the storage for %input, making %input a read-write/mutable binding in the resulting HAL executable. %t is a distinct SSA value from %input, though, and represents the value of the storage backing %input after the dispatch has completed. By keeping the SSA use-def chains correct with respect to time they are still meaningful for analysi2As and nothing at this level (and the beginning of the HAL transformations) needs to perform alias analysis, while still giving us all of the information required to induce aliasing during later allocation passes.

- Note the approximations don't have finite math assumption, so they are on by default.

* c9e5da3 Merge pull request iree-org#5078 from ThomasRaoux:main-to-google * 284e68c Synchronize submodules with LLVM at llvm/llvm-project@720a828 * c0ae7a2 Integrate LLVM at llvm/llvm-project@720a828 * 0aebce2 Merge pull request iree-org#5062 from ThomasRaoux:main-to-google * 56e4fd1 Merge pull request iree-org#5060 from google:llvm-dependent-submodule-update * 0f70af6 Integrate LLVM at llvm/llvm-project@4c973ae * f156c24 Synchronize submodules with LLVM at llvm/llvm-project@4c973ae * 894c758 Synchronize submodules with LLVM at llvm/llvm-project@df6d057

benvanik and others added 16 commits March 10, 2021 17:56

Fixing leak in bytecode_dispatch_test.

fb9b0cd

Would be really, really nice to have an ASAN bot.

Removing unused StreamableOpInterface methods.

6f56791

Fixing name of flow tblgen interfaces files.

28ada60

Enable mhlo.while e2e/xla_ops test for linalg-on-tensors path (iree-o…

baf8a2d

…rg#5054)

Removing the RematerializeDispatchConstants pass.

d34bdc2

The ClosureOpInterface canonicalizer does this now.

Merge pull request iree-org#4881 from google/benvanik-flow

ffcd3fe

Add vm.const.i{32,64}.zero to emitc conversions (iree-org#5066)

565c97c

Fix file lists in enforce_glob (iree-org#5067)

313c09a

These come from midair collision between iree-org#5063 and iree-org#4881

Fix enforce_glob midair collision (iree-org#5069)

f8fb632

Midair collision between iree-org#4999 and iree-org#5063

Update dockers to include CUDA lib (iree-org#5055)

c05dd63

Make error message in enforce_glob more copy-pasteable (iree-org#5071)

9467a90

google-cla bot added the cla: yes label Mar 12, 2021

ThomasRaoux force-pushed the google branch from a9eb422 to 284e68c Compare March 12, 2021 01:16

asaadaldien and others added 2 commits March 11, 2021 17:59

Use upstream math dialect polynomial approximation pass (iree-org#5068)

603e9fb

- Note the approximations don't have finite math assumption, so they are on by default.

Fix use after free detected by asan (iree-org#5080)

e1136e3

copybara-service bot merged commit c9e5da3 into iree-org:google Mar 12, 2021

ThomasRaoux mentioned this pull request Mar 12, 2021

Merge google -> main #5086

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge main -> google #5078

Merge main -> google #5078

ThomasRaoux commented Mar 12, 2021

Merge main -> google #5078

Merge main -> google #5078

Conversation

ThomasRaoux commented Mar 12, 2021