Hello, It's GPT-2 -- How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems

07/12/2019
by   Paweł Budzianowski, et al.
0

Data scarcity is a long-standing and crucial challenge that hinders quick development of task-oriented dialogue systems across multiple domains: task-oriented dialogue models are expected to learn grammar, syntax, dialogue reasoning, decision making, and language generation from absurdly small amounts of task-specific data. In this paper, we demonstrate that recent progress in language modeling pre-training and transfer learning shows promise to overcome this problem. We propose a task-oriented dialogue model that operates solely on text input: it effectively bypasses explicit policy and language generation modules. Building on top of the TransferTransfo framework (Wolf et al., 2019) and generative model pre-training (Radford et al., 2019), we validate the approach on complex multi-domain task-oriented dialogues from the MultiWOZ dataset. Our automatic and human evaluations show that the proposed model is on par with a strong task-specific neural baseline. In the long run, our approach holds promise to mitigate the data scarcity problem, and to support the construction of more engaging and more eloquent task-oriented conversational agents.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/14/2020

Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems

Task-oriented dialogue systems use four connected modules, namely, Natur...
research
04/22/2021

A Short Survey of Pre-trained Language Models for Conversational AI-A NewAge in NLP

Building a dialogue system that can communicate naturally with humans is...
research
07/13/2023

Agreement Tracking for Multi-Issue Negotiation Dialogues

Automated negotiation support systems aim to help human negotiators reac...
research
05/20/2022

Robust Task-Oriented Dialogue Generation with Contrastive Pre-training and Adversarial Filtering

Data artifacts incentivize machine learning models to learn non-transfer...
research
11/30/2022

ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format

Diverse data formats and ontologies of task-oriented dialogue (TOD) data...
research
06/21/2023

Solving Dialogue Grounding Embodied Task in a Simulated Environment using Further Masked Language Modeling

Enhancing AI systems with efficient communication skills that align with...
research
06/06/2023

Leveraging Explicit Procedural Instructions for Data-Efficient Action Prediction

Task-oriented dialogues often require agents to enact complex, multi-ste...

Please sign up or login with your details

Forgot password? Click here to reset