Reachability-Aware Laplacian Representation in Reinforcement Learning

10/24/2022
by   Kaixin Wang, et al.
0

In Reinforcement Learning (RL), Laplacian Representation (LapRep) is a task-agnostic state representation that encodes the geometry of the environment. A desirable property of LapRep stated in prior works is that the Euclidean distance in the LapRep space roughly reflects the reachability between states, which motivates the usage of this distance for reward shaping. However, we find that LapRep does not necessarily have this property in general: two states having small distance under LapRep can actually be far away in the environment. Such mismatch would impede the learning process in reward shaping. To fix this issue, we introduce a Reachability-Aware Laplacian Representation (RA-LapRep), by properly scaling each dimension of LapRep. Despite the simplicity, we demonstrate that RA-LapRep can better capture the inter-state reachability as compared to LapRep, through both theoretical explanations and experimental results. Additionally, we show that this improvement yields a significant boost in reward shaping performance and also benefits bottleneck state discovery.

READ FULL TEXT

page 6

page 8

page 16

page 17

page 20

page 21

page 22

page 23

research
07/12/2021

Towards Better Laplacian Representation in Reinforcement Learning with Generalized Graph Drawing

The Laplacian representation recently gains increasing attention for rei...
research
09/14/2023

Goal Space Abstraction in Hierarchical Reinforcement Learning via Set-Based Reachability Analysis

Open-ended learning benefits immensely from the use of symbolic methods ...
research
09/12/2023

Goal Space Abstraction in Hierarchical Reinforcement Learning via Reachability Analysis

Open-ended learning benefits immensely from the use of symbolic methods ...
research
03/21/2022

Temporal Abstractions-Augmented Temporally Contrastive Learning: An Alternative to the Laplacian in RL

In reinforcement learning, the graph Laplacian has proved to be a valuab...
research
10/10/2018

The Laplacian in RL: Learning Representations with Efficient Approximations

The smallest eigenvectors of the graph Laplacian are well-known to provi...
research
06/04/2018

Measuring and avoiding side effects using relative reachability

How can we design reinforcement learning agents that avoid causing unnec...

Please sign up or login with your details

Forgot password? Click here to reset