Wasserstein Distance Maximizing Intrinsic Control

10/28/2021
by   Ishan Durugkar, et al.
0

This paper deals with the problem of learning a skill-conditioned policy that acts meaningfully in the absence of a reward signal. Mutual information based objectives have shown some success in learning skills that reach a diverse set of states in this setting. These objectives include a KL-divergence term, which is maximized by visiting distinct states even if those states are not far apart in the MDP. This paper presents an approach that rewards the agent for learning skills that maximize the Wasserstein distance of their state visitation from the start state of the skill. It shows that such an objective leads to a policy that covers more distance in the MDP than diversity based objectives, and validates the results on a variety of Atari environments.

READ FULL TEXT

page 4

page 5

research
12/14/2020

Relative Variational Intrinsic Control

In the absence of external rewards, agents can still learn useful behavi...
research
10/27/2021

Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching

Learning meaningful behaviors in the absence of reward is a difficult pr...
research
07/29/2021

Learning more skills through optimistic exploration

Unsupervised skill learning objectives (Gregor et al., 2016, Eysenbach e...
research
07/21/2023

Diverse Offline Imitation via Fenchel Duality

There has been significant recent progress in the area of unsupervised s...
research
09/03/2020

Action and Perception as Divergence Minimization

We introduce a unified objective for action and perception of intelligen...
research
10/23/2021

Guided Policy Search for Parameterized Skills using Adverbs

We present a method for using adverb phrases to adjust skill parameters ...
research
04/22/2019

Measuring and Assessing Latent Variation in Alliance Design and Objectives

The alliance literature is bifurcated between an empirically-driven appr...

Please sign up or login with your details

Forgot password? Click here to reset