Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning

02/10/2017
by   Jason D. Williams, et al.
0

End-to-end learning of recurrent neural networks (RNNs) is an attractive solution for dialog systems; however, current techniques are data-intensive and require thousands of dialogs to learn simple behaviors. We introduce Hybrid Code Networks (HCNs), which combine an RNN with domain-specific knowledge encoded as software and system action templates. Compared to existing end-to-end approaches, HCNs considerably reduce the amount of training data required, while retaining the key benefit of inferring a latent representation of dialog state. In addition, HCNs can be optimized with supervised learning, reinforcement learning, or a mixture of both. HCNs attain state-of-the-art performance on the bAbI dialog dataset, and outperform two commercially deployed customer-facing dialog systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2016

Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning

This paper presents an end-to-end framework for task-oriented dialog sys...
research
12/18/2016

Sample-efficient Deep Reinforcement Learning for Dialog Control

Representing a dialog policy as a recurrent neural network (RNN) is attr...
research
06/03/2016

End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning

This paper presents a model for end-to-end learning of task-oriented dia...
research
07/23/2019

Structured Fusion Networks for Dialog

Neural dialog models have exhibited strong performance, however their en...
research
04/28/2018

Sentiment Adaptive End-to-End Dialog Systems

End-to-end learning framework is useful for building dialog systems for ...
research
11/02/2018

Unsupervised Learning of Interpretable Dialog Models

Recently several deep learning based models have been proposed for end-t...
research
11/29/2018

Improving Robustness of Neural Dialog Systems in a Data-Efficient Way with Turn Dropout

Neural network-based dialog models often lack robustness to anomalous, o...

Please sign up or login with your details

Forgot password? Click here to reset