Use llvm-dialects as specification layer for Julia LLVM IR #52945

vchuravy · 2024-01-17T17:59:09Z

The LLVM IR we emit in codegen uses pseudo-intrinsics to represent the additional language-specific
semantics needed for the correct optimization of Julia code. Since Julia uses a precise GC we need to
track values in the generated code. We could do so early, but that would clutter the code quite a bit
and thus we decided to take a late-lowering approach. We represent enough semantics with our own LLVM
dialect so that at the end of the optimization pipeline we can legalize/lower our Julia LLVM dialect
to the general LLVM dialect that the backends can emit code for.

We thus far have an informal specification of this Julia LLVM dialect scattered across both codegen
and optimizations, and other producers like Enzyme code-generator. The https://github.com/GPUOpen-Drivers/llvm-dialects
project provides tools for using an approach similar to MLIR to specify a custom dialect on the LLVM substracte.

This PR is mostly meant to open up the discussion if we want this, but my goal for it is to make it
easier for producers like Enzyme (and other GPUCompiler) to emit our dialect correctly, as well
as unifying the definition across codegen and optimization passes, and having one place to document and specify
the behaviour of our dialect operations.

(Side-comment we have technically at least two dialects one produced by codegen and lowered by late-lowering and then a second between late-lowering and final-lowering)

gbaraldi · 2024-01-17T18:35:44Z

So tblgen generates a couple .inc files which have the definitions from the .td files. How does someone downstream get this? Do we ship the .inc files or something like this?

vchuravy · 2024-01-17T23:04:09Z

The generated headers we could ship, the generated cpp get compiled with the rest of the code. We don't check them into the source tree

vtjnash

The docs seem very sparse (https://github.com/GPUOpen-Drivers/llvm-dialects/blob/dev/docs/Constraints.md), but if it can equivalently encode our JuliaFunction descriptors, it seems useful enough to go with it

vchuravy added speculative Whether the change will be implemented is speculative compiler:codegen Generation of LLVM IR and native code compiler:llvm For issues that relate to LLVM labels Jan 17, 2024

vtjnash reviewed Jan 19, 2024

View reviewed changes

vchuravy mentioned this pull request Apr 15, 2024

[BOLT] Add builder JuliaPackaging/Yggdrasil#8391

Merged

vchuravy mentioned this pull request Oct 16, 2024

Support vectorization of the gc.loaded intrinsics #56188

Draft

vchuravy added 9 commits October 16, 2024 16:25

Start on dialect bootstrapping

6918a55

Wire up install step

8224b8a

Wire up build

c97ff55

direct build against our copy of LLVM

0fa0083

link againt llvm-dialects

29fcedc

make build step sensitive to LLVM config

edb088c

use dialect for first thing!

23ba531

update llvm-dialects

e2af2b4

WIP

d1f0801

vchuravy force-pushed the vc/llvm-dialects branch from d8b41bb to d1f0801 Compare October 16, 2024 18:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use llvm-dialects as specification layer for Julia LLVM IR #52945

Use llvm-dialects as specification layer for Julia LLVM IR #52945

vchuravy commented Jan 17, 2024

gbaraldi commented Jan 17, 2024

vchuravy commented Jan 17, 2024

vtjnash left a comment

Use llvm-dialects as specification layer for Julia LLVM IR #52945

Are you sure you want to change the base?

Use llvm-dialects as specification layer for Julia LLVM IR #52945

Conversation

vchuravy commented Jan 17, 2024

gbaraldi commented Jan 17, 2024

vchuravy commented Jan 17, 2024

vtjnash left a comment

Choose a reason for hiding this comment