Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems

04/18/2018
by   Bing Liu, et al.
0

In this work, we present a hybrid learning method for training task-oriented dialogue systems through online user interactions. Popular methods for learning task-oriented dialogues include applying reinforcement learning with user feedback on supervised pre-training models. Efficiency of such learning method may suffer from the mismatch of dialogue state distribution between offline training and online interactive learning stages. To address this challenge, we propose a hybrid imitation and reinforcement learning method, with which a dialogue agent can effectively learn from its interaction with users by learning from human teaching and feedback. We design a neural network based task-oriented dialogue agent that can be optimized end-to-end with the proposed learning method. Experimental results show that our end-to-end dialogue agent can learn effectively from the mistake it makes via imitation learning from user teaching. Applying reinforcement learning with user feedback after the imitation learning stage further improves the agent's capability in successfully completing a task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/29/2017

End-to-End Optimization of Task-Oriented Dialogue Model with Deep Reinforcement Learning

In this paper, we present a neural network based task-oriented dialogue ...
research
04/15/2016

A Network-based End-to-End Trainable Task-oriented Dialogue System

Teaching machines to accomplish tasks by conversing naturally with human...
research
05/06/2023

Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization

Policy learning (PL) is a module of a task-oriented dialogue system that...
research
10/11/2022

Graph Neural Network Policies and Imitation Learning for Multi-Domain Task-Oriented Dialogues

Task-oriented dialogue systems are designed to achieve specific goals wh...
research
09/03/2016

Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access

This paper proposes KB-InfoBot -- a multi-turn dialogue agent which help...
research
11/10/2017

Integrating User and Agent Models: A Deep Task-Oriented Dialogue System

Task-oriented dialogue systems can efficiently serve a large number of c...
research
08/15/2019

Towards End-to-End Learning for Efficient Dialogue Agent by Modeling Looking-ahead Ability

Learning an efficient manager of dialogue agent from data with little ma...

Please sign up or login with your details

Forgot password? Click here to reset