Deep Active Learning for Dialogue Generation

by   Nabiha Asghar, et al.

We propose an online, end-to-end, neural generative conversational model for open-domain dialogue. It is trained using a unique combination of offline two-phase supervised learning and online human-in-the-loop active learning. While most existing research proposes offline supervision or hand-crafted reward functions for online reinforcement, we devise a novel interactive learning mechanism based on hamming-diverse beam search for response generation and one-character user-feedback at each step. Experiments show that our model inherently promotes the generation of semantically relevant and interesting responses, and can be used to train agents with customized personas, moods and conversational styles.



page 1

page 2

page 3

page 4


Deep Reinforcement Learning for Dialogue Generation

Recent neural models of dialogue generation offer great promise for gene...

Customized Nonlinear Bandits for Online Response Selection in Neural Conversation Models

Dialog response selection is an important step towards natural response ...

Neural Personalized Response Generation as Domain Adaptation

In this paper, we focus on the personalized response generation for conv...

Affective Neural Response Generation

Existing neural conversational models process natural language primarily...

Knowledge-Grounded Response Generation with Deep Attentional Latent-Variable Model

End-to-end dialogue generation has achieved promising results without us...

Dialogue Learning With Human-In-The-Loop

An important aspect of developing conversational agents is to give a bot...

Data Distillation for Controlling Specificity in Dialogue Generation

People speak at different levels of specificity in different situations....
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.