Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation

02/10/2021
by   Jinwei Xing, et al.
0

Despite the recent success of deep reinforcement learning (RL), domain adaptation remains an open problem. Although the generalization ability of RL agents is critical for the real-world applicability of Deep RL, zero-shot policy transfer is still a challenging problem since even minor visual changes could make the trained agent completely fail in the new task. To address this issue, we propose a two-stage RL agent that first learns a latent unified state representation (LUSR) which is consistent across multiple domains in the first stage, and then do RL training in one source domain based on LUSR in the second stage. The cross-domain consistency of LUSR allows the policy acquired from the source domain to generalize to other target domains without extra training. We first demonstrate our approach in variants of CarRacing games with customized manipulations, and then verify it in CARLA, an autonomous driving simulator with more complex and realistic visual observations. Our results show that this approach can achieve state-of-the-art domain adaptation performance in related RL tasks and outperforms prior approaches based on latent-representation based RL and image-to-image translation.

READ FULL TEXT

page 4

page 5

page 6

research
09/12/2022

Unified State Representation Learning under Data Augmentation

The capacity for rapid domain adaptation is important to increasing the ...
research
07/26/2017

DARLA: Improving Zero-Shot Transfer in Reinforcement Learning

Domain adaptation is an important open problem in deep reinforcement lea...
research
03/02/2023

Domain Adaptation of Reinforcement Learning Agents based on Network Service Proximity

The dynamic and evolutionary nature of service requirements in wireless ...
research
07/01/2021

Distilling Reinforcement Learning Tricks for Video Games

Reinforcement learning (RL) research focuses on general solutions that c...
research
06/04/2021

Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL

A highly desirable property of a reinforcement learning (RL) agent – and...
research
10/02/2021

Cycle-Consistent World Models for Domain Independent Latent Imagination

End-to-end autonomous driving seeks to solve the perception, decision, a...
research
04/07/2021

Unsupervised Visual Attention and Invariance for Reinforcement Learning

Vision-based reinforcement learning (RL) is successful, but how to gener...

Please sign up or login with your details

Forgot password? Click here to reset