Efficient Semi-Supervised Learning for Natural Language Understanding by Optimizing Diversity

10/09/2019
by   Eunah Cho, et al.
0

Expanding new functionalities efficiently is an ongoing challenge for single-turn task-oriented dialogue systems. In this work, we explore functionality-specific semi-supervised learning via self-training. We consider methods that augment training data automatically from unlabeled data sets in a functionality-targeted manner. In addition, we examine multiple techniques for efficient selection of augmented utterances to reduce training time and increase diversity. First, we consider paraphrase detection methods that attempt to find utterance variants of labeled training data with good coverage. Second, we explore sub-modular optimization based on n-grams features for utterance selection. Experiments show that functionality-specific self-training is very effective for improving system performance. In addition, methods optimizing diversity can reduce training data in many cases to 50 impact on performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2020

Self-training Improves Pre-training for Natural Language Understanding

Unsupervised pre-training has led to much recent progress in natural lan...
research
04/27/2021

Semi-supervised Interactive Intent Labeling

Building the Natural Language Understanding (NLU) modules of task-orient...
research
03/29/2021

Industry Scale Semi-Supervised Learning for Natural Language Understanding

This paper presents a production Semi-Supervised Learning (SSL) pipeline...
research
05/16/2023

Semi-Supervised Object Detection for Sorghum Panicles in UAV Imagery

The sorghum panicle is an important trait related to grain yield and pla...
research
05/28/2018

UG18 at SemEval-2018 Task 1: Generating Additional Training Data for Predicting Emotion Intensity in Spanish

The present study describes our submission to SemEval 2018 Task 1: Affec...
research
04/12/2021

Learning from Executions for Semantic Parsing

Semantic parsing aims at translating natural language (NL) utterances on...
research
05/19/2020

Self-Updating Models with Error Remediation

Many environments currently employ machine learning models for data proc...

Please sign up or login with your details

Forgot password? Click here to reset