Generating Persona-Consistent Dialogue Responses Using Deep Reinforcement Learning

04/30/2020
by   Mohsen Mesgar, et al.
0

Recent transformer-based open-domain dialogue agents are trained by reference responses in a fully supervised scenario. Such agents often display inconsistent personalities as training data potentially contain contradictory responses to identical input utterances and no persona-relevant criteria are used in their training losses. We propose a novel approach to train transformer-based dialogue agents using actor-critic reinforcement learning. We define a new reward function to assess generated responses in terms of persona consistency, topic consistency, and fluency. Our reference-agnostic reward relies only on a dialogue history and a persona defined by a list of facts. Automatic and human evaluations on the PERSONACHAT dataset show that our proposed approach increases the rate of persona-consistent responses compared with its peers that are trained in a fully supervised scenario using reference responses.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/30/2021

Learning to Predict Persona Information forDialogue Personalization without Explicit Persona Description

Personalizing dialogue agents is important for dialogue systems to gener...
research
06/05/2016

Deep Reinforcement Learning for Dialogue Generation

Recent neural models of dialogue generation offer great promise for gene...
research
01/18/2016

SimpleDS: A Simple Deep Reinforcement Learning Dialogue System

This paper presents 'SimpleDS', a simple and publicly available dialogue...
research
10/31/2017

Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning

This paper presents a new method --- adversarial advantage actor-critic ...
research
12/28/2022

Improving a sequence-to-sequence nlp model using a reinforcement learning policy algorithm

Nowadays, the current neural network models of dialogue generation(chatb...
research
06/16/2022

DialogueScript: Using Dialogue Agents to Produce a Script

We present a novel approach to generating scripts by using agents with d...
research
04/16/2020

Generate, Delete and Rewrite: A Three-Stage Framework for Improving Persona Consistency of Dialogue Generation

Maintaining a consistent personality in conversations is quite natural f...

Please sign up or login with your details

Forgot password? Click here to reset