Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes

10/27/2016
by   Jack W Rae, et al.
0

Neural networks augmented with external memory have the ability to learn algorithmic solutions to complex tasks. These models appear promising for applications such as language modeling and machine translation. However, they scale poorly in both space and time as the amount of memory grows --- limiting their applicability to real-world domains. Here, we present an end-to-end differentiable memory access scheme, which we call Sparse Access Memory (SAM), that retains the representational power of the original approaches whilst training efficiently with very large memories. We show that SAM achieves asymptotic lower bounds in space and time complexity, and find that an implementation runs 1,000× faster and with 3,000× less physical memory than non-sparse models. SAM learns with comparable data efficiency to existing models on a range of synthetic tasks and one-shot Omniglot character recognition, and can scale to tasks requiring 100,000s of time steps and memories. As well, we show how our approach can be adapted for models that maintain temporal associations between memories, as with the recently introduced Differentiable Neural Computer.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2019

Memory-Augmented Neural Networks for Machine Translation

Memory-augmented neural networks (MANNs) have been shown to outperform o...
research
11/08/2019

Memory-Augmented Recurrent Neural Networks Can Learn Generalized Dyck Languages

We introduce three memory-augmented Recurrent Neural Networks (MARNNs) a...
research
05/25/2019

Neural Stored-program Memory

Neural networks powered with external memory simulate computer behaviors...
research
10/18/2021

State-Space Constraints Improve the Generalization of the Differentiable Neural Computer in some Algorithmic Tasks

Memory-augmented neural networks (MANNs) can solve algorithmic tasks lik...
research
07/07/2021

Differentiable Random Access Memory using Lattices

We introduce a differentiable random access memory module with O(1) perf...
research
03/19/2018

Towards Memory Prefetching with Neural Networks: Challenges and Insights

Accurate memory prefetching is paramount for processor performance, and ...
research
11/05/2019

Memory Augmented Recursive Neural Networks

Recursive neural networks have shown an impressive performance for model...

Please sign up or login with your details

Forgot password? Click here to reset