Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task

04/28/2020
by   Katya Kudashkina, et al.
0

Human-computer interactive systems that rely on machine learning are becoming paramount to the lives of millions of people who use digital assistants on a daily basis. Yet, further advances are limited by the availability of data and the cost of acquiring new samples. One way to address this problem is by improving the sample efficiency of current approaches. As a solution path, we present a model-based reinforcement learning algorithm for an interactive dialogue task. We build on commonly used actor-critic methods, adding an environment model and planner that augments a learning agent to learn the model of the environment dynamics. Our results show that, on a simulation that mimics the interactive task, our algorithm requires 70 times fewer samples, compared to the baseline of commonly used model-free algorithm, and demonstrates 2 times better performance asymptotically. Moreover, we introduce a novel contribution of computing a soft planner policy and further updating a model-free policy yielding a less computationally expensive model-free agent as good as the model-based one. This model-based architecture serves as a foundation that can be extended to other human-computer interactive tasks allowing further advances in this direction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2021

Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with Deep Reinforcement Learning

Substantial advancements to model-based reinforcement learning algorithm...
research
01/10/2023

Hint assisted reinforcement learning: an application in radio astronomy

Model based reinforcement learning has proven to be more sample efficien...
research
05/30/2018

Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models

Model-based reinforcement learning (RL) algorithms can attain excellent ...
research
04/04/2020

Model-based actor-critic: GAN + DRL (actor-critic) => AGI

Our effort is toward unifying GAN and DRL algorithms into a unifying AI ...
research
10/10/2020

Trust the Model When It Is Confident: Masked Model-based Actor-Critic

It is a popular belief that model-based Reinforcement Learning (RL) is m...
research
04/28/2020

Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels

We propose a simple data augmentation technique that can be applied to s...
research
08/30/2019

High efficiency rl agent

Now a day, model free algorithm achieve state of art performance on many...

Please sign up or login with your details

Forgot password? Click here to reset