Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning

10/06/2020
by   Rodrigo Toro Icarte, et al.
0

Reinforcement learning (RL) methods usually treat reward functions as black boxes. As such, these methods must extensively interact with the environment in order to discover rewards and optimal policies. In most RL applications, however, users have to program the reward function and, hence, there is the opportunity to treat reward functions as white boxes instead – to show the reward function's code to the RL agent so it can exploit its internal structures to learn optimal policies faster. In this paper, we show how to accomplish this idea in two steps. First, we propose reward machines (RMs), a type of finite state machine that supports the specification of reward functions while exposing reward function structure. We then describe different methodologies to exploit such structures, including automated reward shaping, task decomposition, and counterfactual reasoning for data augmentation. Experiments on tabular and continuous domains show the benefits of exploiting reward structure across different tasks and RL agents.

READ FULL TEXT
research
06/17/2022

Logic-based Reward Shaping for Multi-Agent Reinforcement Learning

Reinforcement learning (RL) relies heavily on exploration to learn from ...
research
12/17/2021

Learning Reward Machines: A Study in Partially Observable Reinforcement Learning

Reinforcement learning (RL) is a central problem in artificial intellige...
research
01/08/2023

Learning Symbolic Representations for Reinforcement Learning of Non-Markovian Behavior

Many real-world reinforcement learning (RL) problems necessitate learnin...
research
02/01/2022

A General, Evolution-Inspired Reward Function for Social Robotics

The field of social robotics will likely need to depart from a paradigm ...
research
05/30/2019

Defining Admissible Rewards for High Confidence Policy Evaluation

A key impediment to reinforcement learning (RL) in real applications wit...
research
05/31/2022

Hierarchies of Reward Machines

Reward machines (RMs) are a recent formalism for representing the reward...
research
05/15/2023

What Matters in Reinforcement Learning for Tractography

Recently, deep reinforcement learning (RL) has been proposed to learn th...

Please sign up or login with your details

Forgot password? Click here to reset