Jointly Pre-training with Supervised, Autoencoder, and Value Losses for Deep Reinforcement Learning

04/03/2019
by   Gabriel V. de la Cruz Jr, et al.
0

Deep Reinforcement Learning (DRL) algorithms are known to be data inefficient. One reason is that a DRL agent learns both the feature and the policy tabula rasa. Integrating prior knowledge into DRL algorithms is one way to improve learning efficiency since it helps to build helpful representations. In this work, we consider incorporating human knowledge to accelerate the asynchronous advantage actor-critic (A3C) algorithm by pre-training a small amount of non-expert human demonstrations. We leverage the supervised autoencoder framework and propose a novel pre-training strategy that jointly trains a weighted supervised classification loss, an unsupervised reconstruction loss, and an expected return loss. The resulting pre-trained model learns more useful features compared to independently training in supervised or unsupervised fashion. Our pre-training method drastically improved the learning performance of the A3C agent in Atari games of Pong and MsPacman, exceeding the performance of the state-of-the-art algorithms at a much smaller number of game interactions. Our method is light-weight and easy to implement in a single machine. For reproducibility, our code is available at github.com/gabrieledcjr/DeepRL/tree/A3C-ALA2019

READ FULL TEXT
research
09/12/2017

Pre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning

Deep reinforcement learning (deep RL) has achieved superior performance ...
research
12/21/2018

Pre-training with Non-expert Human Demonstration for Deep Reinforcement Learning

Deep reinforcement learning (deep RL) has achieved superior performance ...
research
03/19/2021

Efficient Deep Reinforcement Learning with Imitative Expert Priors for Autonomous Driving

Deep reinforcement learning (DRL) is a promising way to achieve human-li...
research
08/03/2023

Avoidance Navigation Based on Offline Pre-Training Reinforcement Learning

This paper presents a Pre-Training Deep Reinforcement Learning(DRL) for ...
research
10/09/2019

Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Sparse Reward Environments

This paper investigates how to efficiently transition and update policie...
research
06/02/2023

ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages

In this paper, we introduce a novel method for enhancing the effectivene...
research
07/09/2019

A Deep Neural Network for Finger Counting and Numerosity Estimation

In this paper, we present neuro-robotics models with a deep artificial n...

Please sign up or login with your details

Forgot password? Click here to reset