Robust Deep Reinforcement Learning via Multi-View Information Bottleneck

02/26/2021
by   Jiameng Fan, et al.
0

Deep reinforcement learning (DRL) agents are often sensitive to visual changes that were unseen in their training environments. To address this problem, we introduce a robust representation learning approach for RL. We introduce an auxiliary objective based on the multi-view information bottleneck (MIB) principle which encourages learning representations that are both predictive of the future and less sensitive to task-irrelevant distractions. This enables us to train high-performance policies that are robust to visual distractions and can generalize to unseen environments. We demonstrate that our approach can achieve SOTA performance on challenging visual control tasks, even when the background is replaced with natural videos. In addition, we show that our approach outperforms well-established baselines on generalization to unseen environments using the large-scale Procgen benchmark.

READ FULL TEXT

page 6

page 7

page 16

research
08/03/2020

Dynamics Generalization via Information Bottleneck in Deep Reinforcement Learning

Despite the significant progress of deep reinforcement learning (RL) in ...
research
06/14/2023

VIBR: Learning View-Invariant Value Functions for Robust Visual Control

End-to-end reinforcement learning on images showed significant progress ...
research
05/30/2016

Control of Memory, Active Perception, and Action in Minecraft

In this paper, we introduce a new set of reinforcement learning (RL) tas...
research
10/21/2020

Improving Generalization in Reinforcement Learning with Mixture Regularization

Deep reinforcement learning (RL) agents trained in a limited set of envi...
research
07/19/2019

An Actor-Critic-Attention Mechanism for Deep Reinforcement Learning in Multi-view Environments

In reinforcement learning algorithms, leveraging multiple views of the e...
research
02/04/2020

Learning Task-Driven Control Policies via Information Bottlenecks

This paper presents a reinforcement learning approach to synthesizing ta...
research
01/18/2022

Accelerating Representation Learning with View-Consistent Dynamics in Data-Efficient Reinforcement Learning

Learning informative representations from image-based observations is of...

Please sign up or login with your details

Forgot password? Click here to reset