Model-based Reinforcement Learning: A Survey

06/30/2020
by   Thomas M. Moerland, et al.
0

Sequential decision making, commonly formalized as Markov Decision Process (MDP) optimization, is a key challenge in artificial intelligence. Two key approaches to this problem are reinforcement learning (RL) and planning. This paper presents a survey of the integration of both fields, better known as model-based reinforcement learning. Model-based RL has two main steps. First, we systematically cover approaches to dynamics model learning, including challenges like dealing with stochasticity, uncertainty, partial observability, and temporal abstraction. Second, we present a systematic categorization of planning-learning integration, including aspects like: where to start planning, what budgets to allocate to planning and real data collection, how to plan, and how to integrate planning in the learning and acting loop. After these two key sections, we also discuss the potential benefits of model-based RL, like enhanced data efficiency, targeted exploration, and improved stability. Along the survey, we also draw connections to several related RL fields, like hierarchical RL and transfer, and other research disciplines, like behavioural psychology. Altogether, the survey presents a broad conceptual overview of planning-learning combinations for MDP optimization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/26/2020

A Framework for Reinforcement Learning and Planning

Sequential decision making, commonly formalized as Markov Decision Proce...
research
02/08/2023

Predictable MDP Abstraction for Unsupervised Model-Based RL

A key component of model-based reinforcement learning (RL) is a dynamics...
research
12/12/2022

Reinforcement Learning and Tree Search Methods for the Unit Commitment Problem

The unit commitment (UC) problem, which determines operating schedules o...
research
05/05/2023

A Survey on Offline Model-Based Reinforcement Learning

Model-based approaches are becoming increasingly popular in the field of...
research
08/30/2022

An Analysis of Abstracted Model-Based Reinforcement Learning

Many methods for Model-based Reinforcement learning (MBRL) provide guara...
research
11/15/2021

Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics

Applications of Reinforcement Learning (RL) in robotics are often limite...
research
03/16/2021

Hierarchical Reinforcement Learning Framework for Stochastic Spaceflight Campaign Design

This paper develops a hierarchical reinforcement learning architecture f...

Please sign up or login with your details

Forgot password? Click here to reset