Discovering Intrinsic Reward with Contrastive Random Walk

04/23/2022
by   Zixuan Pan, et al.
0

The aim of this paper is to demonstrate the efficacy of using Contrastive Random Walk as a curiosity method to achieve faster convergence to the optimal policy.Contrastive Random Walk defines the transition matrix of a random walk with the help of neural networks. It learns a meaningful state representation with a closed loop. The loss of Contrastive Random Walk serves as an intrinsic reward and is added to the environment reward. Our method works well in non-tabular sparse reward scenarios, in the sense that our method receives the highest reward within the same iterations compared to other methods. Meanwhile, Contrastive Random Walk is more robust. The performance doesn't change much with different random initialization of environments. We also find that adaptive restart and appropriate temperature are crucial to the performance of Contrastive Random Walk.

READ FULL TEXT

page 7

page 8

research
02/11/2020

Vertex-reinforced Random Walk for Network Embedding

In this paper, we study the fundamental problem of random walk for netwo...
research
01/09/2018

Compressing Deep Neural Networks: A New Hashing Pipeline Using Kac's Random Walk Matrices

The popularity of deep learning is increasing by the day. However, despi...
research
01/25/2018

Random Walk Fundamental Tensor and its Applications to Network Analysis

We first present a comprehensive review of various random walk metrics u...
research
03/08/2022

On the elephant random walk with stops playing hide and seek with the Mittag-Leffler distribution

The aim of this paper is to investigate the asymptotic behavior of the s...
research
03/02/2019

Discovering Options for Exploration by Minimizing Cover Time

One of the main challenges in reinforcement learning is solving tasks wi...
research
06/22/2018

PCA of high dimensional random walks with comparison to neural network training

One technique to visualize the training of neural networks is to perform...
research
11/07/2018

Analysis of visitors' mobility patterns through random walk in the Louvre museum

This paper proposes a random walk model to analyze visitors' mobility pa...

Please sign up or login with your details

Forgot password? Click here to reset