Memory and attention in deep learning

07/03/2021
by   Hung Le, et al.
0

Intelligence necessitates memory. Without memory, humans fail to perform various nontrivial tasks such as reading novels, playing games or solving maths. As the ultimate goal of machine learning is to derive intelligent systems that learn and act automatically just like human, memory construction for machine is inevitable. Artificial neural networks model neurons and synapses in the brain by interconnecting computational units via weights, which is a typical class of machine learning algorithms that resembles memory structure. Their descendants with more complicated modeling techniques (a.k.a deep learning) have been successfully applied to many practical problems and demonstrated the importance of memory in the learning process of machinery systems. Recent progresses on modeling memory in deep learning have revolved around external memory constructions, which are highly inspired by computational Turing models and biological neuronal systems. Attention mechanisms are derived to support acquisition and retention operations on the external memory. Despite the lack of theoretical foundations, these approaches have shown promises to help machinery systems reach a higher level of intelligence. The aim of this thesis is to advance the understanding on memory and attention in deep learning. Its contributions include: (i) presenting a collection of taxonomies for memory, (ii) constructing new memory-augmented neural networks (MANNs) that support multiple control and memory units, (iii) introducing variability via memory in sequential generative models, (iv) searching for optimal writing operations to maximise the memorisation capacity in slot-based memory networks, and (v) simulating the Universal Turing Machine via Neural Stored-program Memory-a new kind of external memory for neural networks.

READ FULL TEXT
research
05/25/2019

Neural Stored-program Memory

Neural networks powered with external memory simulate computer behaviors...
research
04/10/2019

A review on Neural Turing Machine

One of the major objectives of Artificial Intelligence is to design lear...
research
05/29/2021

Ten Quick Tips for Deep Learning in Biology

Machine learning is a modern approach to problem-solving and task automa...
research
01/05/2019

Learning to Remember More with Less Memorization

Memory-augmented neural networks consisting of a neural controller and a...
research
09/24/2020

Neurocoder: Learning General-Purpose Computation Using Stored Neural Programs

Artificial Neural Networks are uniquely adroit at machine learning by pr...
research
11/15/2018

Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks

Deep neural networks have shown superior performance in many regimes to ...
research
11/23/2018

Learning Attractor Dynamics for Generative Memory

A central challenge faced by memory systems is the robust retrieval of a...

Please sign up or login with your details

Forgot password? Click here to reset