Labeled Memory Networks for Online Model Adaptation

07/05/2017
by   Shiv Shankar, et al.
0

Augmenting a neural network with memory that can grow without growing the number of trained parameters is a recent powerful concept with many exciting applications. We propose a design of memory augmented neural networks (MANNs) called Labeled Memory Networks (LMNs) suited for tasks requiring online adaptation in classification models. LMNs organize the memory with classes as the primary key.The memory acts as a second boosted stage following a regular neural network thereby allowing the memory and the primary network to play complementary roles. Unlike existing MANNs that write to memory for every instance and use LRU based memory replacement, LMNs write only for instances with non-zero loss and use label-based memory replacement. We demonstrate significant accuracy gains on various tasks including word-modelling and few-shot learning. In this paper, we establish their potential in online adapting a batch trained neural network to domain-relevant labeled data at deployment time. We show that LMNs are better than other MANNs designed for meta-learning. We also found them to be more accurate and faster than state-of-the-art methods of retuning model parameters for adapting to domain-specific labeled data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/30/2023

Task-Equivariant Graph Few-shot Learning

Although Graph Neural Networks (GNNs) have been successful in node class...
research
09/03/2020

Sparse Meta Networks for Sequential Adaptation and its Application to Adaptive Language Modelling

Training a deep neural network requires a large amount of single-task da...
research
10/19/2017

Meta-Learning via Feature-Label Memory Network

Deep learning typically requires training a very capable architecture us...
research
06/10/2019

Meta-Learning Neural Bloom Filters

There has been a recent trend in training neural networks to replace dat...
research
05/10/2019

Prototype Propagation Networks (PPN) for Weakly-supervised Few-shot Learning on Category Graph

A variety of machine learning applications expect to achieve rapid learn...
research
10/25/2019

Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection

Event detection (ED), a sub-task of event extraction, involves identifyi...
research
03/27/2018

Fast Parametric Learning with Activation Memorization

Neural networks trained with backpropagation often struggle to identify ...

Please sign up or login with your details

Forgot password? Click here to reset