Only functional wrapped layers except `VariableLayer`? #82

albertz · 2021-11-16T22:26:35Z

We want to be able to perform some generic things on parameters, such as weight norm, wight dropout or L2 loss (see #59) in a unified and straightforward way.

When we have some modules where the parameters are hidden inside the RETURNN layer (e.g. Linear), any such logic could be quite counter-intuitive, complicated and potentially even buggy. I expect that when we can directly see all parameters in the returnn-code, that this should become much easier (see e.g. the code behind torch.nn.utils.weight_norm, which is quite simple, but would be tricky if parameters are hidden in RETURNN layers).

There are actually not much such modules:

Linear
Conv
TransposedConv
BatchNorm
RelativePositionalEncoding

We also need to have a functional variant of the RecLayer (rwth-i6/returnn#817).

That's all. And they are all very simple to be reimplemented using pure functional modules, e.g. dot etc.
Specifically:

Linear: Use dot
Conv: Use the functional variant of ConvLayer
TransposedConv: Use the functional variant of TransposedConvLayer
BatchNorm: reimplement, maybe even more efficient by more directly wrapping fused TF ops
RelativePositionalEncoding: anyway reimplement, see discussion in Transformer Modules #55

So then the only module which really is a tf.Variable is the Variable module (or maybe rename to Parameter, to be more consistent to PyTorch). We can also easily implement functions like parameters() and named_parameters() for modules, and then follow very similar simple logic for things like weight norm etc as in PyTorch.

The text was updated successfully, but these errors were encountered:

albertz · 2021-12-16T10:03:37Z

After further thought, I think we should do this. This can simplify many things.

Those were the remaining modules with params. #82

albertz added this to the first-release milestone Nov 16, 2021

albertz mentioned this issue Nov 16, 2021

Missing pieces for first release #32

Open

albertz mentioned this issue Dec 3, 2021

added zoneout to rec.py #86

Merged

albertz closed this as completed in 574e48a Dec 18, 2021

albertz added a commit that referenced this issue Jan 6, 2022

generated layers, remove conv and transposed conv

28450df

Those were the remaining modules with params. #82

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only functional wrapped layers except `VariableLayer`? #82

Only functional wrapped layers except `VariableLayer`? #82

albertz commented Nov 16, 2021 •

edited

Loading

albertz commented Dec 16, 2021

Only functional wrapped layers except VariableLayer? #82

Only functional wrapped layers except VariableLayer? #82

Comments

albertz commented Nov 16, 2021 • edited Loading

albertz commented Dec 16, 2021

Only functional wrapped layers except `VariableLayer`? #82

Only functional wrapped layers except `VariableLayer`? #82

albertz commented Nov 16, 2021 •

edited

Loading