Learning Domain Invariant Representations in Goal-conditioned Block MDPs

10/27/2021
by   Beining Han, et al.
5

Deep Reinforcement Learning (RL) is successful in solving many complex Markov Decision Processes (MDPs) problems. However, agents often face unanticipated environmental changes after deployment in the real world. These changes are often spurious and unrelated to the underlying problem, such as background shifts for visual input agents. Unfortunately, deep RL policies are usually sensitive to these changes and fail to act robustly against them. This resembles the problem of domain generalization in supervised learning. In this work, we study this problem for goal-conditioned RL agents. We propose a theoretical framework in the Block MDP setting that characterizes the generalizability of goal-conditioned policies to new environments. Under this framework, we develop a practical method PA-SkewFit that enhances domain generalization. The empirical evaluation shows that our goal-conditioned RL agent can perform well in various unseen test environments, improving by 50 over baselines.

READ FULL TEXT

page 9

page 29

page 30

research
08/03/2022

AACC: Asymmetric Actor-Critic in Contextual Reinforcement Learning

Reinforcement Learning (RL) techniques have drawn great attention in man...
research
11/02/2020

Instance based Generalization in Reinforcement Learning

Agents trained via deep reinforcement learning (RL) routinely fail to ge...
research
04/01/2021

AdaPool: A Diurnal-Adaptive Fleet Management Framework using Model-Free Deep Reinforcement Learning and Change Point Detection

This paper introduces an adaptive model-free deep reinforcement approach...
research
09/24/2021

Regularization Guarantees Generalization in Bayesian Reinforcement Learning through Algorithmic Stability

In the Bayesian reinforcement learning (RL) setting, a prior distributio...
research
11/13/2022

Goal-Conditioned Reinforcement Learning in the Presence of an Adversary

Reinforcement learning has seen increasing applications in real-world co...
research
07/13/2021

Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability

Generalization is a central challenge for the deployment of reinforcemen...
research
04/15/2020

BabyAI++: Towards Grounded-Language Learning beyond Memorization

Despite success in many real-world tasks (e.g., robotics), reinforcement...

Please sign up or login with your details

Forgot password? Click here to reset