GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning

05/24/2020
by   Jianfeng Liu, et al.
0

A chatbot that converses like a human should be goal-oriented (i.e., be purposeful in conversation), which is beyond language generation. However, existing dialogue systems often heavily rely on cumbersome hand-crafted rules or costly labelled datasets to reach the goals. In this paper, we propose Goal-oriented Chatbots (GoChat), a framework for end-to-end training chatbots to maximize the longterm return from offline multi-turn dialogue datasets. Our framework utilizes hierarchical reinforcement learning (HRL), where the high-level policy guides the conversation towards the final goal by determining some sub-goals, and the low-level policy fulfills the sub-goals by generating the corresponding utterance for response. In our experiments on a real-world dialogue dataset for anti-fraud in financial, our approach outperforms previous methods on both the quality of response generation as well as the success rate of accomplishing the goal.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/19/2019

Goal-Embedded Dual Hierarchical Model for Task-Oriented Dialogue Generation

Hierarchical neural networks are often used to model inherent structures...
research
02/07/2020

I love your chain mail! Making knights smile in a fantasy game world: Open-domain goal-oriented dialogue agents

Dialogue research tends to distinguish between chit-chat and goal-orient...
research
02/07/2020

I love your chain mail! Making knights smile in a fantasy game world: Open-domain goal-orientated dialogue agents

Dialogue research tends to distinguish between chit-chat and goal-orient...
research
11/22/2018

Learning Goal Embeddings via Self-Play for Hierarchical Reinforcement Learning

In hierarchical reinforcement learning a major challenge is determining ...
research
12/15/2017

Hierarchical Text Generation and Planning for Strategic Dialogue

End-to-end models for strategic dialogue are challenging to train, becau...
research
04/29/2020

Task-oriented Dialogue System for Automatic Disease Diagnosis via Hierarchical Reinforcement Learning

In this paper, we focus on automatic disease diagnosis with reinforcemen...
research
06/20/2023

Int-HRL: Towards Intention-based Hierarchical Reinforcement Learning

While deep reinforcement learning (RL) agents outperform humans on an in...

Please sign up or login with your details

Forgot password? Click here to reset