Dialogue Learning With Human-In-The-Loop

11/29/2016
by   Jiwei Li, et al.
0

An important aspect of developing conversational agents is to give a bot the ability to improve through communicating with humans and to learn from the mistakes that it makes. Most research has focused on learning from fixed training sets of labeled data rather than interacting with a dialogue partner in an online fashion. In this paper we explore this direction in a reinforcement learning setting where the bot improves its question-answering ability from feedback a teacher gives following its generated responses. We build a simulator that tests various aspects of such learning in a synthetic environment, and introduce models that work in this regime. Finally, real experiments with Mechanical Turk validate the approach.

READ FULL TEXT

page 7

page 15

research
12/15/2016

Learning through Dialogue Interactions by Asking Questions

A good dialogue agent should have the ability to interact with users by ...
research
06/05/2016

Deep Reinforcement Learning for Dialogue Generation

Recent neural models of dialogue generation offer great promise for gene...
research
12/17/2016

A User Simulator for Task-Completion Dialogues

Despite widespread interests in reinforcement-learning for task-oriented...
research
10/18/2019

Follow Alice into the Rabbit Hole: Giving Dialogue Agents Understanding of Human Level Attributes

For conversational AI and virtual assistants to communicate with humans ...
research
01/31/2020

Teaching Machines to Converse

The ability of a machine to communicate with humans has long been associ...
research
10/28/2022

When Life Gives You Lemons, Make Cherryade: Converting Feedback from Bad Responses into Good Labels

Deployed dialogue agents have the potential to integrate human feedback ...
research
02/22/2017

Data Distillation for Controlling Specificity in Dialogue Generation

People speak at different levels of specificity in different situations....

Please sign up or login with your details

Forgot password? Click here to reset