A Closer Look At Feature Space Data Augmentation For Few-Shot Intent Classification

10/09/2019
by   Varun Kumar, et al.
0

New conversation topics and functionalities are constantly being added to conversational AI agents like Amazon Alexa and Apple Siri. As data collection and annotation is not scalable and is often costly, only a handful of examples for the new functionalities are available, which results in poor generalization performance. We formulate it as a Few-Shot Integration (FSI) problem where a few examples are used to introduce a new intent. In this paper, we study six feature space data augmentation methods to improve classification performance in FSI setting in combination with both supervised and unsupervised representation learning methods such as BERT. Through realistic experiments on two public conversational datasets, SNIPS, and the Facebook Dialog corpus, we show that data augmentation in feature space provides an effective way to improve intent classification performance in few-shot setting beyond traditional transfer learning approaches. In particular, we show that (a) upsampling in latent space is a competitive baseline for feature space augmentation (b) adding the difference between two examples to a new example is a simple yet effective data augmentation method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/19/2021

Few-shot learning via tensor hallucination

Few-shot classification addresses the challenge of classifying examples ...
research
01/28/2021

ProtoDA: Efficient Transfer Learning for Few-Shot Intent Classification

Practical sequence classification tasks in natural language processing o...
research
02/02/2021

Neural Data Augmentation via Example Extrapolation

In many applications of machine learning, certain categories of examples...
research
02/17/2017

Dataset Augmentation in Feature Space

Dataset augmentation, the practice of applying a wide array of domain-sp...
research
09/17/2021

Semi-Supervised Few-Shot Intent Classification and Slot Filling

Intent classification (IC) and slot filling (SF) are two fundamental tas...
research
12/08/2016

AGA: Attribute Guided Augmentation

We consider the problem of data augmentation, i.e., generating artificia...
research
03/29/2021

AlignMix: Improving representation by interpolating aligned features

Mixup is a powerful data augmentation method that interpolates between t...

Please sign up or login with your details

Forgot password? Click here to reset