Adaptive Computation Time for Recurrent Neural Networks

TLDR; The paper presents Adaptive Computation Time (ACT), an algorithm that allows RNNs to adaptively decide how much computation to expend per time step, also called "pondering". To prevent the network from computng indefinitely an extra term that encourages shorter computation is added to the cost. The architecture is fully differentiable and applicable to any type of RNN (e.g. LSTMs). The authors evaluate ACT on tasks of Parity, Logic, Addition, Sorting and character prediction. An interesting observation is that the numbet of pondering steps seems to predict "boundaries" in the data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

act-rnn.md

act-rnn.md

Adaptive Computation Time for Recurrent Neural Networks

Files

act-rnn.md

Latest commit

History

act-rnn.md

File metadata and controls

Adaptive Computation Time for Recurrent Neural Networks