Memory Augmented Neural Networks with Wormhole Connections

01/30/2017
by   Caglar Gulcehre, et al.
0

Recent empirical results on long-term dependency tasks have shown that neural networks augmented with an external memory can learn the long-term dependency tasks more easily and achieve better generalization than vanilla recurrent neural networks (RNN). We suggest that memory augmented neural networks can reduce the effects of vanishing gradients by creating shortcut (or wormhole) connections. Based on this observation, we propose a novel memory augmented neural network model called TARDIS (Temporal Automatic Relation Discovery in Sequences). The controller of TARDIS can store a selective set of embeddings of its own previous hidden states into an external memory and revisit them as and when needed. For TARDIS, memory acts as a storage for wormhole connections to the past to propagate the gradients more effectively and it helps to learn the temporal dependencies. The memory structure of TARDIS has similarities to both Neural Turing Machines (NTM) and Dynamic Neural Turing Machines (D-NTM), but both read and write operations of TARDIS are simpler and more efficient. We use discrete addressing for read/write operations which helps to substantially to reduce the vanishing gradient problem with very long sequences. Read and write operations in TARDIS are tied with a heuristic once the memory becomes full, and this makes the learning problem simpler when compared to NTM or D-NTM type of architectures. We provide a detailed analysis on the gradient propagation in general for MANNs. We evaluate our models on different long-term dependency tasks and report competitive results in all of them.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/30/2018

Partially Non-Recurrent Controllers for Memory-Augmented Neural Networks

Memory-Augmented Neural Networks (MANNs) are a class of neural networks ...
research
06/13/2019

Multigrid Neural Memory

We introduce a novel architecture that integrates a large addressable me...
research
02/11/2018

Dual Control Memory Augmented Neural Networks for Treatment Recommendations

Machine-assisted treatment recommendations hold a promise to reduce phys...
research
04/04/2019

Learning Numeracy: Binary Arithmetic with Neural Turing Machines

One of the main problems encountered so far with recurrent neural networ...
research
11/16/2022

Token Turing Machines

We propose Token Turing Machines (TTM), a sequential, autoregressive Tra...
research
07/18/2018

General Value Function Networks

In this paper we show that restricting the representation-layer of a Rec...
research
06/30/2016

Dynamic Neural Turing Machine with Soft and Hard Addressing Schemes

We extend neural Turing machine (NTM) model into a dynamic neural Turing...

Please sign up or login with your details

Forgot password? Click here to reset