Reinforcement Learning of Multi-Domain Dialog Policies Via Action Embeddings

07/01/2022
by   Jorge A. Mendez, et al.
5

Learning task-oriented dialog policies via reinforcement learning typically requires large amounts of interaction with users, which in practice renders such methods unusable for real-world applications. In order to reduce the data requirements, we propose to leverage data from across different dialog domains, thereby reducing the amount of data required from each given domain. In particular, we propose to learn domain-agnostic action embeddings, which capture general-purpose structure that informs the system how to act given the current dialog context, and are then specialized to a specific domain. We show how this approach is capable of learning with significantly less interaction with users, with a reduction of 35 and to a higher level of proficiency than training separate policies for each domain on a set of simulated domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/28/2019

Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented Dialog

Dialog policy decides what and how a task-oriented dialog system will re...
research
04/23/2020

Learning Dialog Policies from Weak Demonstrations

Deep reinforcement learning is a promising approach to training a dialog...
research
08/02/2017

Deep Reinforcement Learning for Inquiry Dialog Policies with Logical Formula Embeddings

This paper is the first attempt to learn the policy of an inquiry dialog...
research
07/13/2023

Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative

Dialog policies, which determine a system's action based on the current ...
research
02/20/2018

Combining Textual Content and Structure to Improve Dialog Similarity

Chatbots, taking advantage of the success of the messaging apps and rece...
research
06/08/2019

Domain Adaptive Dialog Generation via Meta Learning

Domain adaptation is an essential task in dialog system building because...
research
10/02/2019

Abstractive Dialog Summarization with Semantic Scaffolds

The demand for abstractive dialog summary is growing in real-world appli...

Please sign up or login with your details

Forgot password? Click here to reset