Towards Improving Proactive Dialog Agents Using Socially-Aware Reinforcement Learning

11/25/2022
by   Matthias Kraus, et al.
0

The next step for intelligent dialog agents is to escape their role as silent bystanders and become proactive. Well-defined proactive behavior may improve human-machine cooperation, as the agent takes a more active role during interaction and takes off responsibility from the user. However, proactivity is a double-edged sword because poorly executed pre-emptive actions may have a devastating effect not only on the task outcome but also on the relationship with the user. For designing adequate proactive dialog strategies, we propose a novel approach including both social as well as task-relevant features in the dialog. Here, the primary goal is to optimize proactive behavior so that it is task-oriented - this implies high task success and efficiency - while also being socially effective by fostering user trust. Including both aspects in the reward function for training a proactive dialog agent using reinforcement learning showed the benefit of our approach for more successful human-machine cooperation.

READ FULL TEXT
research
08/28/2019

Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented Dialog

Dialog policy decides what and how a task-oriented dialog system will re...
research
04/24/2023

Development of a Trust-Aware User Simulator for Statistical Proactive Dialog Modeling in Human-AI Teams

The concept of a Human-AI team has gained increasing attention in recent...
research
04/08/2020

Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition

Many studies have applied reinforcement learning to train a dialog polic...
research
09/20/2021

Two Approaches to Building Collaborative, Task-Oriented Dialog Agents through Self-Play

Task-oriented dialog systems are often trained on human/human dialogs, s...
research
05/07/2020

Adaptive Dialog Policy Learning with Hindsight and User Modeling

Reinforcement learning methods have been used to compute dialog policies...
research
08/07/2019

Task-Oriented Optimal Sequencing of Visualization Charts

A chart sequence is used to describe a series of visualization charts ge...
research
08/10/2018

Mind Your Language: Learning Visually Grounded Dialog in a Multi-Agent Setting

The task of visually grounded dialog involves learning goal-oriented coo...

Please sign up or login with your details

Forgot password? Click here to reset