How to Make Deep RL Work in Practice

10/25/2020
by   Nirnai Rao, et al.
0

In recent years, challenging control problems became solvable with deep reinforcement learning (RL). To be able to use RL for large-scale real-world applications, a certain degree of reliability in their performance is necessary. Reported results of state-of-the-art algorithms are often difficult to reproduce. One reason for this is that certain implementation details influence the performance significantly. Commonly, these details are not highlighted as important techniques to achieve state-of-the-art performance. Additionally, techniques from supervised learning are often used by default but influence the algorithms in a reinforcement learning setting in different and not well-understood ways. In this paper, we investigate the influence of certain initialization, input normalization, and adaptive learning techniques on the performance of state-of-the-art RL algorithms. We make suggestions which of those techniques to use by default and highlight areas that could benefit from a solution specifically tailored to RL.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2019

Training Agents using Upside-Down Reinforcement Learning

Traditional Reinforcement Learning (RL) algorithms either predict reward...
research
04/18/2018

A Study on Overfitting in Deep Reinforcement Learning

Recent years have witnessed significant progresses in deep Reinforcement...
research
09/19/2017

Deep Reinforcement Learning that Matters

In recent years, significant progress has been made in solving challengi...
research
09/13/2023

Investigating the Impact of Action Representations in Policy Gradient Algorithms

Reinforcement learning (RL) is a versatile framework for learning to sol...
research
02/19/2019

Investigating Generalisation in Continuous Deep Reinforcement Learning

Deep Reinforcement Learning has shown great success in a variety of cont...
research
02/03/2022

Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems

Learning effective policies for real-world problems is still an open cha...

Please sign up or login with your details

Forgot password? Click here to reset