Jointly Reinforced User Simulator and Task-oriented Dialog System with Simplified Generative Architecture

10/13/2022
by   Hong Liu, et al.
0

Recently, there has been progress in supervised funetuning pretrained GPT-2 to build end-to-end task-oriented dialog (TOD) systems. However, online reinforcement learning of a GPT-2 based dialog system (DS), together with a end-to-end user simulator (US), has not ever been explored. Moreover, a drawback with existing GPT-2 based TOD systems is that they mostly employ the whole dialog history as input, which brings inefficiencies in memory and compute. In this paper, we first propose Simplified Generative Architectures (SGA) for DS and US respectively, both based on GPT-2 but using shortened history. Then, we successfully develop Jointly Reinforced US and DS, called SGA-JRUD. Our DS with the proposed SGA, when only supervised trained, achieves state-of-the-art performance on MultiWOZ2.1 and is more compute-efficient in both training and generation. Extensive experiments on MultiWOZ2.1 further show the superiority of SGA-JRUD in both offline and online evaluations.

READ FULL TEXT

page 1

page 12

research
04/13/2022

Revisiting Markovian Generative Architectures for Efficient Task-Oriented Dialog Systems

Recently, Transformer based pretrained language models (PLMs), such as G...
research
10/17/2022

A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems

Building user simulators (USs) for reinforcement learning (RL) of task-o...
research
04/28/2018

Sentiment Adaptive End-to-End Dialog Systems

End-to-end learning framework is useful for building dialog systems for ...
research
11/12/2018

Learning Personalized End-to-End Goal-Oriented Dialog

Most existing works on dialog systems only consider conversation content...
research
04/23/2018

Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems

End-to-end task-oriented dialog systems usually suffer from the challeng...
research
11/30/2022

Reinforced Language Modeling for End-to-End Task Oriented Dialog

In task-oriented dialogs such as MultiWoZ (Budzianowski et al., 2018), a...
research
09/15/2022

UBARv2: Towards Mitigating Exposure Bias in Task-Oriented Dialogs

This paper studies the exposure bias problem in task-oriented dialog sys...

Please sign up or login with your details

Forgot password? Click here to reset