Dynamic Planning in Open-Ended Dialogue using Reinforcement Learning

07/25/2022
by   Deborah Cohen, et al.
0

Despite recent advances in natural language understanding and generation, and decades of research on the development of conversational bots, building automated agents that can carry on rich open-ended conversations with humans "in the wild" remains a formidable challenge. In this work we develop a real-time, open-ended dialogue system that uses reinforcement learning (RL) to power a bot's conversational skill at scale. Our work pairs the succinct embedding of the conversation state generated using SOTA (supervised) language models with RL techniques that are particularly suited to a dynamic action space that changes as the conversation progresses. Trained using crowd-sourced data, our novel system is able to substantially exceeds the (strong) baseline supervised model with respect to several metrics of interest in a live experiment with real users of the Google Assistant.

READ FULL TEXT
research
02/21/2023

Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management

Reinforcement learning (RL) has shown great promise for developing dialo...
research
10/05/2021

NaRLE: Natural Language Models using Reinforcement Learning with Emotion Feedback

Current research in dialogue systems is focused on conversational assist...
research
05/31/2022

A Mixture-of-Expert Approach to RL-based Dialogue Management

Despite recent advancements in language models (LMs), their application ...
research
02/18/2018

Improving Mild Cognitive Impairment Prediction via Reinforcement Learning and Dialogue Simulation

Mild cognitive impairment (MCI) is a prodromal phase in the progression ...
research
07/14/2023

Understanding Multi-Turn Toxic Behaviors in Open-Domain Chatbots

Recent advances in natural language processing and machine learning have...
research
06/07/2023

Improving Open Language Models by Learning from Organic Interactions

We present BlenderBot 3x, an update on the conversational model BlenderB...
research
11/13/2022

Conversational Pattern Mining using Motif Detection

The subject of conversational mining has become of great interest recent...

Please sign up or login with your details

Forgot password? Click here to reset