Two Approaches to Building Collaborative, Task-Oriented Dialog Agents through Self-Play

09/20/2021
by   Arkady Arkhangorodsky, et al.
0

Task-oriented dialog systems are often trained on human/human dialogs, such as collected from Wizard-of-Oz interfaces. However, human/human corpora are frequently too small for supervised training to be effective. This paper investigates two approaches to training agent-bots and user-bots through self-play, in which they autonomously explore an API environment, discovering communication strategies that enable them to solve the task. We give empirical results for both reinforcement learning and game-theoretic equilibrium finding.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2020

MEEP: An Open-Source Platform for Human-Human Dialog Collection and End-to-End Agent Training

We create a new task-oriented dialog platform (MEEP) where agents are gi...
research
01/18/2022

Toward Self-Learning End-to-End Dialog Systems

End-to-end task-oriented dialog systems often suffer from out-of-distrib...
research
11/25/2022

Towards Improving Proactive Dialog Agents Using Socially-Aware Reinforcement Learning

The next step for intelligent dialog agents is to escape their role as s...
research
05/26/2023

DKAF: KB Arbitration for Learning Task-Oriented Dialog Systems with Dialog-KB Inconsistencies

Task-oriented dialog (TOD) agents often ground their responses on extern...
research
05/28/2018

Memory Augmented Self-Play

Self-play is an unsupervised training procedure which enables the reinfo...
research
05/31/2020

Variational Reward Estimator Bottleneck: Learning Robust Reward Estimator for Multi-Domain Task-Oriented Dialog

Despite its notable success in adversarial learning approaches to multi-...
research
06/26/2018

Learning Social Conventions in Markov Games

Social conventions - arbitrary ways to organize group behavior - are an ...

Please sign up or login with your details

Forgot password? Click here to reset