First draft GPS conv layer #355

abieler · 2023-12-28T12:59:03Z

This is only first "mock" version of a GPSConv layer to see if we would want it in the Repo in that form.

Adds a DotProductAttention layer that uses NNlib.dot_product_attention()
Adds a GPSConv layer
- has the DotPRoductAttention as global attention layer
- takes a conv-layer as local message passing
Not sure about the GNNChain() implementation, if it should stay where it is or move into the struct?
JuliaFormatter() got a bit too greedy and made some changes here and there, I can revert those of course
Did not check for correctness of the implementation yet

Let me know what you think and I can adjust / keep going from here.

Close #351

CarloLucibello · 2023-12-30T04:16:30Z

src/layers/conv.jl

-                 init = glorot_uniform,
-                 bias::Bool = true,
-                 add_self_loops = true,
-                 use_edge_weight = false)


this PR is doing a lot of unrelated changes, please remove them

CarloLucibello · 2023-12-30T04:52:31Z

src/layers/conv.jl

+    gattn = DotProductAttention(ch)
+    convlayer = GNNChain(gconv, Dropout(0.5), LayerNorm(out))
+    attnlayer = GNNChain(gattn, Dropout(0.5), LayerNorm(out))
+    ffn = GNNChain(Dense(in => 2 * in, σ), Dropout(0.5), Dense(2 * in => in), Dropout(0.5))


does the paper specify that we need dropout in the MLP?

CarloLucibello · 2023-12-30T04:55:19Z

src/layers/conv.jl

+                Parallel(+, l.attnlayer, identity),
+            ),
+            l.ffn
+    )


We should avoid constructing a gnnchain in each forward pass. Also, the use of parallel makes the sequence of operations hard to parse.
We should just have a plain sequence of transformations of the input here, like:

x1 = l.bn1(l.dropout1(l.conv(g, x)) .+ x)) x2 =l.bn2(l.dropout2(l.attn(x)) .+ x)) y = l.mlp( x1 + x2) # not sure if we should also add a skip connection here

Notice the order of operations. It is different from what the layer is currently doing. Current implementation doesn't seem to follow eqs. (6)-(11) in the paper.

CarloLucibello · 2023-12-30T04:58:08Z

Thanks, this is a nice start. A few comments:

No need to introduce the DotProductAttention type, we can use the MultiHeadAttention from Flux. According to the table A.2-A.5 in the paper, multi-head attention is the preferred choice. We should have an nheads argument in the constructor. See also https://pytorch-geometric.readthedocs.io/en/latest/generated/torch_geometric.nn.conv.GPSConv.html for the kind of flexibility we could try to achieve (in several PRs).
Part of the contribution of the paper is the discussion of different types of embeddings. This package lacks many of these embeddings. I hope they will be added in the future but in any case, it is ok for this PR to only implement the layer.
I think the current order of operations is wrong, see comment First draft GPS conv layer #355 (comment)
BathcNorm should be used instead of LayerNorm
In the paper is not clear if we should apply a residual connection after the MLP. For figure D.1 it seems there is one, but there is none according to Eq. 11.

abieler · 2024-01-02T13:05:31Z

Thanks for the comments. I'll be going over the authors codebase > paper > pytorch implementation for implementation details for the next version

i.e.

they support batch- and layer-norm in their codbase
do have a skip connection after last ff layer

First draft GPS conv layer

4af876b

CarloLucibello reviewed Dec 30, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First draft GPS conv layer #355

First draft GPS conv layer #355

abieler commented Dec 28, 2023 •

edited by CarloLucibello

Loading

CarloLucibello Dec 30, 2023

CarloLucibello Dec 30, 2023

CarloLucibello Dec 30, 2023

CarloLucibello commented Dec 30, 2023

abieler commented Jan 2, 2024

First draft GPS conv layer #355

Are you sure you want to change the base?

First draft GPS conv layer #355

Conversation

abieler commented Dec 28, 2023 • edited by CarloLucibello Loading

CarloLucibello Dec 30, 2023

Choose a reason for hiding this comment

CarloLucibello Dec 30, 2023

Choose a reason for hiding this comment

CarloLucibello Dec 30, 2023

Choose a reason for hiding this comment

CarloLucibello commented Dec 30, 2023

abieler commented Jan 2, 2024

abieler commented Dec 28, 2023 •

edited by CarloLucibello

Loading