Tracking the Race Between Deep Reinforcement Learning and Imitation Learning – Extended Version

08/03/2020
by   Timo P. Gros, et al.
0

Learning-based approaches for solving large sequential decision making problems have become popular in recent years. The resulting agents perform differently and their characteristics depend on those of the underlying learning approach. Here, we consider a benchmark planning problem from the reinforcement learning domain, the Racetrack, to investigate the properties of agents derived from different deep (reinforcement) learning approaches. We compare the performance of deep supervised learning, in particular imitation learning, to reinforcement learning for the Racetrack model. We find that imitation learning yields agents that follow more risky paths. In contrast, the decisions of deep reinforcement learning are more foresighted, i.e., avoid states in which fatal decisions are more likely. Our evaluations show that for this sequential decision making problem, deep reinforcement learning performs best in many aspects even though for imitation learning optimal decisions are considered.

READ FULL TEXT

page 13

page 15

page 17

research
03/31/2020

Augmented Q Imitation Learning (AQIL)

The study of unsupervised learning can be generally divided into two cat...
research
10/05/2020

Learning to Generalize for Sequential Decision Making

We consider problems of making sequences of decisions to accomplish task...
research
05/05/2021

Learning Algorithms for Regenerative Stopping Problems with Applications to Shipping Consolidation in Logistics

We study regenerative stopping problems in which the system starts anew ...
research
09/22/2018

Geometric Multi-Model Fitting by Deep Reinforcement Learning

This paper deals with the geometric multi-model fitting from noisy, unst...
research
09/11/2019

Correlation Priors for Reinforcement Learning

Many decision-making problems naturally exhibit pronounced structures in...
research
01/22/2023

Deep Reinforcement Learning for Concentric Tube Robot Path Planning

As surgical interventions trend towards minimally invasive approaches, C...
research
03/19/2022

Teachable Reinforcement Learning via Advice Distillation

Training automated agents to complete complex tasks in interactive envir...

Please sign up or login with your details

Forgot password? Click here to reset