I love your chain mail! Making knights smile in a fantasy game world: Open-domain goal-oriented dialogue agents

02/07/2020
by   Shrimai Prabhumoye, et al.
8

Dialogue research tends to distinguish between chit-chat and goal-oriented tasks. While the former is arguably more naturalistic and has a wider use of language, the latter has clearer metrics and a straightforward learning signal. Humans effortlessly combine the two, for example engaging in chit-chat with the goal of exchanging information or eliciting a specific response. Here, we bridge the divide between these two domains in the setting of a rich multi-player text-based fantasy environment where agents and humans engage in both actions and dialogue. Specifically, we train a goal-oriented model with reinforcement learning against an imitation-learned “chit-chat” model with two approaches: the policy either learns to pick a topic or learns to pick an utterance given the top-K utterances from the chit-chat model. We show that both models outperform an inverse model baseline and can converse naturally with their dialogue partner in order to achieve goals.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2020

I love your chain mail! Making knights smile in a fantasy game world: Open-domain goal-orientated dialogue agents

Dialogue research tends to distinguish between chit-chat and goal-orient...
research
05/24/2020

GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning

A chatbot that converses like a human should be goal-oriented (i.e., be ...
research
07/02/2018

Improving Goal-Oriented Visual Dialog Agents via Advanced Recurrent Nets with Tempered Policy Gradient

Learning goal-oriented dialogues by means of deep reinforcement learning...
research
07/03/2019

Learning Multi-Party Turn-Taking Models from Dialogue Logs

This paper investigates the application of machine learning (ML) techniq...
research
05/06/2023

Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization

Policy learning (PL) is a module of a task-oriented dialogue system that...
research
10/11/2022

Graph Neural Network Policies and Imitation Learning for Multi-Domain Task-Oriented Dialogues

Task-oriented dialogue systems are designed to achieve specific goals wh...
research
01/07/2020

Attention over Parameters for Dialogue Systems

Dialogue systems require a great deal of different but complementary exp...

Please sign up or login with your details

Forgot password? Click here to reset