Augmenting Slot Values and Contexts for Spoken Language Understanding with Pretrained Models

08/19/2021
by   Haitao Lin, et al.
0

Spoken Language Understanding (SLU) is one essential step in building a dialogue system. Due to the expensive cost of obtaining the labeled data, SLU suffers from the data scarcity problem. Therefore, in this paper, we focus on data augmentation for slot filling task in SLU. To achieve that, we aim at generating more diverse data based on existing data. Specifically, we try to exploit the latent language knowledge from pretrained language models by finetuning them. We propose two strategies for finetuning process: value-based and context-based augmentation. Experimental results on two public SLU datasets have shown that compared with existing data augmentation methods, our proposed method can generate more diverse sentences and significantly improve the performance on SLU.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2020

Data Augmentation for Spoken Language Understanding via Pretrained Models

The training of spoken language understanding (SLU) models often faces t...
research
04/09/2019

A Hierarchical Decoding Model For Spoken Language Understanding From Unaligned Data

Spoken language understanding (SLU) systems can be trained on two types ...
research
12/13/2020

C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot Filling

Slot filling, a fundamental module of spoken language understanding, oft...
research
09/07/2018

Data Augmentation for Spoken Language Understanding via Joint Variational Generation

Data scarcity is one of the main obstacles of domain adaptation in spoke...
research
01/05/2023

HIT-SCIR at MMNLU-22: Consistency Regularization for Multilingual Spoken Language Understanding

Multilingual spoken language understanding (SLU) consists of two sub-tas...
research
12/21/2020

Pattern-aware Data Augmentation for Query Rewriting in Voice Assistant Systems

Query rewriting (QR) systems are widely used to reduce the friction caus...
research
09/06/2022

Entity Aware Syntax Tree Based Data Augmentation for Natural Language Understanding

Understanding the intention of the users and recognizing the semantic en...

Please sign up or login with your details

Forgot password? Click here to reset