Adversarial learning of neural user simulators for dialogue policy optimisation

06/01/2023
by   Simon Keizer, et al.
0

Reinforcement learning based dialogue policies are typically trained in interaction with a user simulator. To obtain an effective and robust policy, this simulator should generate user behaviour that is both realistic and varied. Current data-driven simulators are trained to accurately model the user behaviour in a dialogue corpus. We propose an alternative method using adversarial learning, with the aim to simulate realistic user behaviour with more variation. We train and evaluate several simulators on a corpus of restaurant search dialogues, and then use them to train dialogue system policies. In policy cross-evaluation experiments we demonstrate that an adversarially trained simulator produces policies with 8.3 than those trained with a maximum likelihood simulator. Subjective results from a crowd-sourced dialogue system user evaluation confirm the effectiveness of adversarially training user simulators.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/17/2018

Neural User Simulation for Corpus-based Policy Optimisation for Spoken Dialogue Systems

User Simulators are one of the major tools that enable offline training ...
research
09/10/2019

A Corpus-free State2Seq User Simulator for Task-oriented Dialogue

Recent reinforcement learning algorithms for task-oriented dialogue syst...
research
04/02/2022

Metaphorical User Simulators for Evaluating Task-oriented Dialogue Systems

Task-oriented dialogue systems (TDSs) are assessed mainly in an offline ...
research
06/16/2021

Domain-independent User Simulation with Transformers for Task-oriented Dialogue Systems

Dialogue policy optimisation via reinforcement learning requires a large...
research
04/14/2022

Dialogue Strategy Adaptation to New Action Sets Using Multi-dimensional Modelling

A major bottleneck for building statistical spoken dialogue systems for ...
research
06/30/2016

A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems

User simulation is essential for generating enough data to train a stati...
research
06/02/2023

EmoUS: Simulating User Emotions in Task-Oriented Dialogues

Existing user simulators (USs) for task-oriented dialogue systems only m...

Please sign up or login with your details

Forgot password? Click here to reset