Conditioned Text Generation with Transfer for Closed-Domain Dialogue Systems

by   Stéphane d'Ascoli, et al.

Scarcity of training data for task-oriented dialogue systems is a well known problem that is usually tackled with costly and time-consuming manual data annotation. An alternative solution is to rely on automatic text generation which, although less accurate than human supervision, has the advantage of being cheap and fast. Our contribution is twofold. First we show how to optimally train and control the generation of intent-specific sentences using a conditional variational autoencoder. Then we introduce a new protocol called query transfer that allows to leverage a large unlabelled dataset, possibly containing irrelevant queries, to extract relevant information. Comparison with two different baselines shows that this method, in the appropriate regime, consistently improves the diversity of the generated queries without compromising their quality. We also demonstrate the effectiveness of our generation method as a data augmentation technique for language modelling tasks.


Conditioned Query Generation for Task-Oriented Dialogue Systems

Scarcity of training data for task-oriented dialogue systems is a well k...

Control, Generate, Augment: A Scalable Framework for Multi-Attribute Text Generation

In this work, we present a text generation approach with multi-attribute...

ConQX: Semantic Expansion of Spoken Queries for Intent Detection based on Conditioned Text Generation

Intent detection of spoken queries is a challenging task due to their no...

Variational Cross-domain Natural Language Generation for Spoken Dialogue Systems

Cross-domain natural language generation (NLG) is still a difficult task...

ViGGO: A Video Game Corpus for Data-To-Text Generation in Open-Domain Conversation

The uptake of deep learning in natural language generation (NLG) led to ...

Local Explanation of Dialogue Response Generation

In comparison to the interpretation of classification models, the explan...

Disentangled Generation Network for Enlarged License Plate Recognition and A Unified Dataset

License plate recognition plays a critical role in many practical applic...