When Is Generalizable Reinforcement Learning Tractable?

01/01/2021
by   Dhruv Malik, et al.
0

Agents trained by reinforcement learning (RL) often fail to generalize beyond the environment they were trained in, even when presented with new scenarios that seem very similar to the training environment. We study the query complexity required to train RL agents that can generalize to multiple environments. Intuitively, tractable generalization is only possible when the environments are similar or close in some sense. To capture this, we introduce Strong Proximity, a structural condition which precisely characterizes the relative closeness of different environments. We provide an algorithm which exploits Strong Proximity to provably and efficiently generalize. We also show that under a natural weakening of this condition, which we call Weak Proximity, RL can require query complexity that is exponential in the horizon to generalize. A key consequence of our theory is that even when the environments share optimal trajectories, and have highly similar reward and transition functions (as measured by classical metrics), tractable generalization is impossible.

READ FULL TEXT
research
07/13/2022

Brick Tic-Tac-Toe: Exploring the Generalizability of AlphaZero to Novel Test Environments

Traditional reinforcement learning (RL) environments typically are the s...
research
03/08/2021

Adversarial Reinforcement Learning for Procedural Content Generation

We present an approach for procedural content generation (PCG), and impr...
research
07/02/2019

Generalizing from a few environments in safety-critical reinforcement learning

Before deploying autonomous agents in the real world, we need to be conf...
research
04/15/2020

BabyAI++: Towards Grounded-Language Learning beyond Memorization

Despite success in many real-world tasks (e.g., robotics), reinforcement...
research
10/24/2022

Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds

Despite impressive successes, deep reinforcement learning (RL) systems s...
research
08/05/2022

Learning to Generalize with Object-centric Agents in the Open World Survival Game Crafter

Reinforcement learning agents must generalize beyond their training expe...

Please sign up or login with your details

Forgot password? Click here to reset