RNNs redesign #2500

CarloLucibello · 2024-10-14T21:52:31Z

A complete rework of our recurrent layers, making them more similar to their pytorch counterpart.
This is in line with the proposal in #1365 and should allow to hook into the cuDNN machinery (future PR).
Hopefully, this ends the infinite source of troubles that the recurrent layers have been.

Recur is no more. Mutating its internal state was a source of problems for AD (explicit differentiation for RNN gives wrong results #2185)
Now RNNCell is exported and takes care of the minimal recursion step, i.e. a single time:
- has forward cell(x , h)
- x can be of size in or in x batch_size
- h can be of size out or out x batch_size
- returns hnew of size out or out x batch_size
RNN instead takes in a (batched) sequence and a (batched) hidden state and returns the hidden state for the whole sequence:
- has forward rnn(x, h)
- x can be of size in x len or in x len x batch_size
- h can be of size out or out x batch_size
- returns hnew of size out x len or out x len x batch_size
LSTM and GRU are similarly changed.

Close #2185, close #2341, close #2258, close #1547, close #807, close #1329

Related to #1678

PR Checklist

CarloLucibello added breaking RNN labels Oct 14, 2024

CarloLucibello added this to the v0.15 milestone Oct 14, 2024

CarloLucibello changed the title ~~Cl/rnn~~ RNN redesign Oct 14, 2024

MartinuzziFrancesco mentioned this pull request Oct 16, 2024

New RNN design in Flux MartinuzziFrancesco/RecurrentLayers.jl#1

Open

CarloLucibello added 10 commits October 17, 2024 10:41

add tests

bbf905f

finish RNNCell

6a2fa61

RNN rework

30269eb

LSTMCell

ce99ea9

LSTM

5014567

more work

16b9ed3

gru

5b5f899

extended testing

a7025a5

reset! deprecation

f288f3f

fix test

aeb421b

CarloLucibello force-pushed the cl/rnn branch from 8abc593 to aeb421b Compare October 17, 2024 10:06

CarloLucibello changed the title ~~RNN redesign~~ RNNs redesign Oct 17, 2024

unbreak l2 test

4c109ac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RNNs redesign #2500

RNNs redesign #2500

CarloLucibello commented Oct 14, 2024 •

edited

Loading

RNNs redesign #2500

Are you sure you want to change the base?

RNNs redesign #2500

Conversation

CarloLucibello commented Oct 14, 2024 • edited Loading

PR Checklist

CarloLucibello commented Oct 14, 2024 •

edited

Loading