Augmented Q Imitation Learning (AQIL)

03/31/2020
by   Xiao-Lei Zhang, et al.
0

The study of unsupervised learning can be generally divided into two categories: imitation learning and reinforcement learning. In imitation learning the machine learns by mimicking the behavior of an expert system whereas in reinforcement learning the machine learns via direct environment feedback. Traditional deep reinforcement learning takes a significant time before the machine starts to converge to an optimal policy. This paper proposes Augmented Q-Imitation-Learning, a method by which deep reinforcement learning convergence can be accelerated by applying Q-imitation-learning as the initial training process in traditional Deep Q-learning.

READ FULL TEXT

page 3

page 4

page 5

research
08/10/2021

Imitation Learning by Reinforcement Learning

Imitation Learning algorithms learn a policy from demonstrations of expe...
research
08/03/2020

Tracking the Race Between Deep Reinforcement Learning and Imitation Learning – Extended Version

Learning-based approaches for solving large sequential decision making p...
research
08/21/2020

Adversarial Imitation Learning via Random Search

Developing agents that can perform challenging complex tasks is the goal...
research
10/27/2019

BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning

The field of Deep Reinforcement Learning (DRL) has recently seen a surge...
research
03/14/2018

Imitation Learning with Concurrent Actions in 3D Games

In this work we describe a novel deep reinforcement learning neural netw...
research
09/02/2021

Reinforcement Learning for Battery Energy Storage Dispatch augmented with Model-based Optimizer

Reinforcement learning has been found useful in solving optimal power fl...
research
08/28/2019

An Empirical Comparison on Imitation Learning and Reinforcement Learning for Paraphrase Generation

Generating paraphrases from given sentences involves decoding words step...

Please sign up or login with your details

Forgot password? Click here to reset