Sparse Meta Networks for Sequential Adaptation and its Application to Adaptive Language Modelling

09/03/2020
by   Tsendsuren Munkhdalai, et al.
0

Training a deep neural network requires a large amount of single-task data and involves a long time-consuming optimization phase. This is not scalable to complex, realistic environments with new unexpected changes. Humans can perform fast incremental learning on the fly and memory systems in the brain play a critical role. We introduce Sparse Meta Networks – a meta-learning approach to learn online sequential adaptation algorithms for deep neural networks, by using deep neural networks. We augment a deep neural network with a layer-specific fast-weight memory. The fast-weights are generated sparsely at each time step and accumulated incrementally through time providing a useful inductive bias for online continual adaptation. We demonstrate strong performance on a variety of sequential adaptation scenarios, from a simple online reinforcement learning to a large scale adaptive language modelling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/18/2018

Deep Online Learning via Meta-Learning: Continual Adaptation for Model-Based RL

Humans and animals can learn complex predictive models that allow them t...
research
02/28/2018

Memory-based Parameter Adaptation

Deep neural networks have excelled on a wide range of problems, from vis...
research
07/05/2017

Labeled Memory Networks for Online Model Adaptation

Augmenting a neural network with memory that can grow without growing th...
research
11/16/2020

Learning Associative Inference Using Fast Weight Memory

Humans can quickly associate stimuli to solve problems in novel contexts...
research
07/27/2023

A Sparse Quantized Hopfield Network for Online-Continual Memory

An important difference between brains and deep neural networks is the w...
research
05/10/2020

An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning

DQN (Deep Q-Network) is a method to perform Q-learning for reinforcement...
research
09/17/2018

The Fast and the Flexible: training neural networks to learn to follow instructions from small data

Learning to follow human instructions is a challenging task because whil...

Please sign up or login with your details

Forgot password? Click here to reset