A Framework for Reinforcement Learning and Planning

06/26/2020
by   Thomas M. Moerland, et al.
0

Sequential decision making, commonly formalized as Markov Decision Process optimization, is a key challenge in artificial intelligence. Two successful approaches to MDP optimization are planning and reinforcement learning. Both research fields largely have their own research communities. However, if both research fields solve the same problem, then we should be able to disentangle the common factors in their solution approaches. Therefore, this paper presents a unifying framework for reinforcement learning and planning (FRAP), which identifies the underlying dimensions on which any planning or learning algorithm has to decide. At the end of the paper, we compare - in a single table - a variety of well-known planning, model-free and model-based RL algorithms along the dimensions of our framework, illustrating the validity of the framework. Altogether, FRAP provides deeper insight into the algorithmic space of planning and reinforcement learning, and also suggests new approaches to integration of both fields.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2020

Model-based Reinforcement Learning: A Survey

Sequential decision making, commonly formalized as Markov Decision Proce...
research
05/15/2020

Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning

Planning and reinforcement learning are two key approaches to sequential...
research
10/19/2016

A Reinforcement Learning Approach to the View Planning Problem

We present a Reinforcement Learning (RL) solution to the view planning p...
research
01/12/2022

Multi-echelon Supply Chains with Uncertain Seasonal Demands and Lead Times Using Deep Reinforcement Learning

We address the problem of production planning and distribution in multi-...
research
09/11/2023

Career Path Recommendations for Long-term Income Maximization: A Reinforcement Learning Approach

This study explores the potential of reinforcement learning algorithms t...
research
03/02/2023

Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning

Progress in fields of machine learning and adversarial planning has bene...

Please sign up or login with your details

Forgot password? Click here to reset