DARLA: Improving Zero-Shot Transfer in Reinforcement Learning

07/26/2017
by   Irina Higgins, et al.
0

Domain adaptation is an important open problem in deep reinforcement learning (RL). In many scenarios of interest data is hard to obtain, so agents may learn a source policy in a setting where data is readily available, with the hope that it generalises well to the target domain. We propose a new multi-stage RL agent, DARLA (DisentAngled Representation Learning Agent), which learns to see before learning to act. DARLA's vision is based on learning a disentangled representation of the observed environment. Once DARLA can see, it is able to acquire source policies that are robust to many domain shifts - even with no access to the target domain. DARLA significantly outperforms conventional baselines in zero-shot domain adaptation scenarios, an effect that holds across a variety of RL environments (Jaco arm, DeepMind Lab) and base RL algorithms (DQN, A3C and EC).

READ FULL TEXT

page 5

page 7

page 13

page 15

research
02/10/2021

Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation

Despite the recent success of deep reinforcement learning (RL), domain a...
research
09/12/2022

Unified State Representation Learning under Data Augmentation

The capacity for rapid domain adaptation is important to increasing the ...
research
12/07/2018

Zero-shot Deep Reinforcement Learning Driving Policy Transfer for Autonomous Vehicles based on Robust Control

Although deep reinforcement learning (deep RL) methods have lots of stre...
research
03/02/2023

Domain Adaptation of Reinforcement Learning Agents based on Network Service Proximity

The dynamic and evolutionary nature of service requirements in wireless ...
research
05/27/2022

Provably Sample-Efficient RL with Side Information about Latent Dynamics

We study reinforcement learning (RL) in settings where observations are ...
research
05/30/2023

Subequivariant Graph Reinforcement Learning in 3D Environments

Learning a shared policy that guides the locomotion of different agents ...
research
10/26/2018

Transfer of Deep Reactive Policies for MDP Planning

Domain-independent probabilistic planners input an MDP description in a ...

Please sign up or login with your details

Forgot password? Click here to reset