Conditioned Text Generation with Transfer for Closed-Domain Dialogue Systems

11/03/2020
by   Stéphane d'Ascoli, et al.
0

Scarcity of training data for task-oriented dialogue systems is a well known problem that is usually tackled with costly and time-consuming manual data annotation. An alternative solution is to rely on automatic text generation which, although less accurate than human supervision, has the advantage of being cheap and fast. Our contribution is twofold. First we show how to optimally train and control the generation of intent-specific sentences using a conditional variational autoencoder. Then we introduce a new protocol called query transfer that allows to leverage a large unlabelled dataset, possibly containing irrelevant queries, to extract relevant information. Comparison with two different baselines shows that this method, in the appropriate regime, consistently improves the diversity of the generated queries without compromising their quality. We also demonstrate the effectiveness of our generation method as a data augmentation technique for language modelling tasks.

READ FULL TEXT
research
11/09/2019

Conditioned Query Generation for Task-Oriented Dialogue Systems

Scarcity of training data for task-oriented dialogue systems is a well k...
research
04/30/2020

Control, Generate, Augment: A Scalable Framework for Multi-Attribute Text Generation

In this work, we present a text generation approach with multi-attribute...
research
09/02/2021

ConQX: Semantic Expansion of Spoken Queries for Intent Detection based on Conditioned Text Generation

Intent detection of spoken queries is a challenging task due to their no...
research
12/20/2018

Variational Cross-domain Natural Language Generation for Spoken Dialogue Systems

Cross-domain natural language generation (NLG) is still a difficult task...
research
06/11/2021

Local Explanation of Dialogue Response Generation

In comparison to the interpretation of classification models, the explan...
research
10/26/2019

ViGGO: A Video Game Corpus for Data-To-Text Generation in Open-Domain Conversation

The uptake of deep learning in natural language generation (NLG) led to ...
research
10/04/2019

Controlled Text Generation for Data Augmentation in Intelligent Artificial Agents

Data availability is a bottleneck during early stages of development of ...

Please sign up or login with your details

Forgot password? Click here to reset