Visual Transfer for Reinforcement Learning via Wasserstein Domain Confusion

06/04/2020
by   Josh Roy, et al.
12

We introduce Wasserstein Adversarial Proximal Policy Optimization (WAPPO), a novel algorithm for visual transfer in Reinforcement Learning that explicitly learns to align the distributions of extracted features between a source and target task. WAPPO approximates and minimizes the Wasserstein-1 distance between the distributions of features from source and target domains via a novel Wasserstein Confusion objective. WAPPO outperforms the prior state-of-the-art in visual transfer and successfully transfers policies across Visual Cartpole and two instantiations of 16 OpenAI Procgen environments.

READ FULL TEXT

page 8

page 15

research
02/12/2018

A note on reinforcement learning with Wasserstein distance regularisation, with applications to multipolicy learning

In this note we describe an application of Wasserstein distance to Reinf...
research
03/22/2023

P^3O: Transferring Visual Representations for Reinforcement Learning via Prompting

It is important for deep reinforcement learning (DRL) algorithms to tran...
research
10/14/2019

Wasserstein Distance Guided Cross-Domain Learning

Domain adaptation aims to generalise a high-performance learner on targe...
research
08/02/2023

Wasserstein Diversity-Enriched Regularizer for Hierarchical Reinforcement Learning

Hierarchical reinforcement learning composites subpolicies in different ...
research
08/18/2019

VUSFA:Variational Universal Successor Features Approximator to Improve Transfer DRL for Target Driven Visual Navigation

In this paper, we show how novel transfer reinforcement learning techniq...
research
05/27/2021

Adversarial Intrinsic Motivation for Reinforcement Learning

Learning with an objective to minimize the mismatch with a reference dis...
research
11/26/2022

Transfer RL via the Undo Maps Formalism

Transferring knowledge across domains is one of the most fundamental pro...

Please sign up or login with your details

Forgot password? Click here to reset