Data Augmentation for Intent Classification

06/12/2022
by   Derek Chen, et al.
0

Training accurate intent classifiers requires labeled data, which can be costly to obtain. Data augmentation methods may ameliorate this issue, but the quality of the generated data varies significantly across techniques. We study the process of systematically producing pseudo-labeled data given a small seed set using a wide variety of data augmentation techniques, including mixing methods together. We find that while certain methods dramatically improve qualitative and quantitative performance, other methods have minimal or even negative impact. We also analyze key considerations when implementing data augmentation methods in production.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/02/2020

An Empirical Study of Contextual Data Augmentation for Japanese Zero Anaphora Resolution

One critical issue of zero anaphora resolution (ZAR) is the scarcity of ...
research
09/25/2020

A little goes a long way: Improving toxic language classification despite data scarcity

Detection of some types of toxic language is hampered by extreme scarcit...
research
09/03/2022

Data Augmentation for Deep Receivers

Deep neural networks (DNNs) allow digital receivers to learn to operate ...
research
05/14/2020

Data Augmentation for Deep Candlestick Learner

To successfully build a deep learning model, it will need a large amount...
research
01/12/2021

Data augmentation and feature selection for automatic model recommendation in computational physics

Classification algorithms have recently found applications in computatio...
research
04/04/2019

HoloDetect: Few-Shot Learning for Error Detection

We introduce a few-shot learning framework for error detection. We show ...
research
03/03/2021

Bulk Production Augmentation Towards Explainable Melanoma Diagnosis

Although highly accurate automated diagnostic techniques for melanoma ha...

Please sign up or login with your details

Forgot password? Click here to reset