A General Perspective on Objectives of Reinforcement Learning

06/05/2023
by   Long Yang, et al.
0

In this lecture, we present a general perspective on reinforcement learning (RL) objectives, where we show three versions of objectives. The first version is the standard definition of objective in RL literature. Then we extend the standard definition to the λ-return version, which unifies the standard definition of objective. Finally, we propose a general objective that unifies the previous two versions. The last version provides a high level to understand of RL's objective, where it shows a fundamental formulation that connects some widely used RL techniques (e.g., TD(λ) and GAE), and this objective can be potentially applied to extensive RL algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2019

Rethinking Expected Cumulative Reward Formalism of Reinforcement Learning: A Micro-Objective Perspective

The standard reinforcement learning (RL) formulation considers the expec...
research
03/17/2023

Prevalence of Code Smells in Reinforcement Learning Projects

Reinforcement Learning (RL) is being increasingly used to learn and adap...
research
04/24/2017

Reinforcement Learning Based Dynamic Selection of Auxiliary Objectives with Preserving of the Best Found Solution

Efficiency of single-objective optimization can be improved by introduci...
research
06/15/2021

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning

Many advances that have improved the robustness and efficiency of deep r...
research
01/26/2023

Train Hard, Fight Easy: Robust Meta Reinforcement Learning

A major challenge of reinforcement learning (RL) in real-world applicati...
research
02/03/2022

Challenging Common Assumptions in Convex Reinforcement Learning

The classic Reinforcement Learning (RL) formulation concerns the maximiz...
research
03/09/2023

Recent Advances of Deep Robotic Affordance Learning: A Reinforcement Learning Perspective

As a popular concept proposed in the field of psychology, affordance has...

Please sign up or login with your details

Forgot password? Click here to reset