Learning to Control Latent Representations for Few-Shot Learning of Named Entities

11/19/2019
by   Omar U. Florez, et al.
0

Humans excel in continuously learning with small data without forgetting how to solve old problems. However, neural networks require large datasets to compute latent representations across different tasks while minimizing a loss function. For example, a natural language understanding (NLU) system will often deal with emerging entities during its deployment as interactions with users in realistic scenarios will generate new and infrequent names, events, and locations. Here, we address this scenario by introducing an RL trainable controller that disentangles the representation learning of a neural encoder from its memory management role. Our proposed solution is straightforward and simple: we train a controller to execute an optimal sequence of reading and writing operations on an external memory with the goal of leveraging diverse activations from the past and provide accurate predictions. Our approach is named Learning to Control (LTC) and allows few-shot learning with two degrees of memory plasticity. We experimentally show that our system obtains accurate results for few-shot learning of entity recognition in the Stanford Task-Oriented Dialogue dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/16/2022

Label Semantics for Few Shot Named Entity Recognition

We study the problem of few shot learning for named entity recognition. ...
research
05/03/2023

Causal Interventions-based Few-Shot Named Entity Recognition

Few-shot named entity recognition (NER) systems aims at recognizing new ...
research
09/17/2020

FewJoint: A Few-shot Learning Benchmark for Joint Language Understanding

Few-learn learning (FSL) is one of the key future steps in machine learn...
research
11/15/2021

Zero-Shot Learning in Named-Entity Recognition with External Knowledge

A significant shortcoming of current state-of-the-art (SOTA) named-entit...
research
10/21/2022

AROS: Affordance Recognition with One-Shot Human Stances

We present AROS, a one-shot learning approach that uses an explicit repr...
research
05/05/2021

MCGNet: Partial Multi-view Few-shot Learning via Meta-alignment and Context Gated-aggregation

In this paper, we propose a new challenging task named as partial multi-...

Please sign up or login with your details

Forgot password? Click here to reset