Improving Generalization in Reinforcement Learning with Mixture Regularization

10/21/2020
by   Kaixin Wang, et al.
0

Deep reinforcement learning (RL) agents trained in a limited set of environments tend to suffer overfitting and fail to generalize to unseen testing environments. To improve their generalizability, data augmentation approaches (e.g. cutout and random convolution) are previously explored to increase the data diversity. However, we find these approaches only locally perturb the observations regardless of the training environments, showing limited effectiveness on enhancing the data diversity and the generalization performance. In this work, we introduce a simple approach, named mixreg, which trains agents on a mixture of observations from different training environments and imposes linearity constraints on the observation interpolations and the supervision (e.g. associated reward) interpolations. Mixreg increases the data diversity more effectively and helps learn smoother policies. We verify its effectiveness on improving generalization by conducting extensive experiments on the large-scale Procgen benchmark. Results show mixreg outperforms the well-established baselines on unseen testing environments by a large margin. Mixreg is simple, effective and general. It can be applied to both policy-based and value-based RL algorithms. Code is available at https://github.com/kaixin96/mixreg .

READ FULL TEXT
research
06/23/2020

Automatic Data Augmentation for Generalization in Deep Reinforcement Learning

Deep reinforcement learning (RL) agents often fail to generalize to unse...
research
08/05/2022

Learning to Generalize with Object-centric Agents in the Open World Survival Game Crafter

Reinforcement learning agents must generalize beyond their training expe...
research
04/26/2023

CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing

The safe application of reinforcement learning (RL) requires generalizat...
research
09/14/2020

Decoupling Representation Learning from Reinforcement Learning

In an effort to overcome limitations of reward-driven feature learning i...
research
04/30/2020

Reinforcement Learning with Augmented Data

Learning from visual observations is a fundamental yet challenging probl...
research
10/11/2019

A Simple Randomization Technique for Generalization in Deep Reinforcement Learning

Deep reinforcement learning (RL) agents often fail to generalize to unse...
research
02/26/2021

Robust Deep Reinforcement Learning via Multi-View Information Bottleneck

Deep reinforcement learning (DRL) agents are often sensitive to visual c...

Please sign up or login with your details

Forgot password? Click here to reset