How to Build User Simulators to Train RL-based Dialog Systems

09/03/2019
by   Weiyan Shi, et al.
0

User simulators are essential for training reinforcement learning (RL) based dialog models. The performance of the simulator directly impacts the RL policy. However, building a good user simulator that models real user behaviors is challenging. We propose a method of standardizing user simulator building that can be used by the community to compare dialog system quality using the same set of user simulators fairly. We present implementations of six user simulators trained with different dialog planning and generation methods. We then calculate a set of automatic metrics to evaluate the quality of these simulators both directly and indirectly. We also ask human users to assess the simulators directly and indirectly by rating the simulated dialogs and interacting with the trained systems. This paper presents a comprehensive evaluation framework for user simulator study and provides a better understanding of the pros and cons of different user simulators, as well as their impacts on the trained systems.

READ FULL TEXT

page 12

page 13

research
09/18/2017

Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural Dialog Models

In this paper, we present a deep reinforcement learning (RL) framework f...
research
10/17/2022

A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems

Building user simulators (USs) for reinforcement learning (RL) of task-o...
research
03/13/2023

Multimodal Reinforcement Learning for Robots Collaborating with Humans

Robot assistants for older adults and people with disabilities need to i...
research
09/03/2019

CMU GetGoing: An Understandable and Memorable Dialog System for Seniors

Voice-based technologies are typically developed for the average user, a...
research
04/24/2023

Development of a Trust-Aware User Simulator for Statistical Proactive Dialog Modeling in Human-AI Teams

The concept of a Human-AI team has gained increasing attention in recent...
research
04/08/2020

Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition

Many studies have applied reinforcement learning to train a dialog polic...
research
09/05/2023

Dialog Action-Aware Transformer for Dialog Policy Learning

Recent works usually address Dialog policy learning DPL by training a re...

Please sign up or login with your details

Forgot password? Click here to reset