A Hierarchical Bayesian Approach to Inverse Reinforcement Learning with Symbolic Reward Machines

04/20/2022
by   Weichao Zhou, et al.
0

A misspecified reward can degrade sample efficiency and induce undesired behaviors in reinforcement learning (RL) problems. We propose symbolic reward machines for incorporating high-level task knowledge when specifying the reward signals. Symbolic reward machines augment existing reward machine formalism by allowing transitions to carry predicates and symbolic reward outputs. This formalism lends itself well to inverse reinforcement learning, whereby the key challenge is determining appropriate assignments to the symbolic values from a few expert demonstrations. We propose a hierarchical Bayesian approach for inferring the most likely assignments such that the concretized reward machine can discriminate expert demonstrated trajectories from other trajectories with high accuracy. Experimental results show that learned reward machines can significantly improve training efficiency for complex RL tasks and generalize well across different task environment configurations.

READ FULL TEXT
research
12/14/2021

Programmatic Reward Design by Example

Reward design is a fundamental problem in reinforcement learning (RL). A...
research
09/12/2019

Joint Inference of Reward Machines and Policies for Reinforcement Learning

Incorporating high-level knowledge is an effective way to expedite reinf...
research
07/11/2023

Contextual Pre-Planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning

Recent studies show that deep reinforcement learning (DRL) agents tend t...
research
05/31/2022

Hierarchies of Reward Machines

Reward machines (RMs) are a recent formalism for representing the reward...
research
11/20/2022

Noisy Symbolic Abstractions for Deep RL: A case study with Reward Machines

Natural and formal languages provide an effective mechanism for humans t...
research
08/18/2023

Learning Reward Machines through Preference Queries over Sequences

Reward machines have shown great promise at capturing non-Markovian rewa...
research
10/18/2022

Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity

Reinforcement learning provides an automated framework for learning beha...

Please sign up or login with your details

Forgot password? Click here to reset