Time Matters in Using Data Augmentation for Vision-based Deep Reinforcement Learning

02/17/2021
by   Byungchan Ko, et al.
0

Data augmentation technique from computer vision has been widely considered as a regularization method to improve data efficiency and generalization performance in vision-based reinforcement learning. We variate the timing of using augmentation, which is, in turn, critical depending on tasks to be solved in training and testing. According to our experiments on Open AI Procgen Benchmark, if the regularization imposed by augmentation is helpful only in testing, it is better to procrastinate the augmentation after training than to use it during training in terms of sample and computation complexity. We note that some of such augmentations can disturb the training process. Conversely, an augmentation providing regularization useful in training needs to be used during the whole training period to fully utilize its benefit in terms of not only generalization but also data efficiency. These phenomena suggest a useful timing control of data augmentation in reinforcement learning.

READ FULL TEXT

page 4

page 13

page 19

research
06/01/2022

Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning

In deep reinforcement learning (RL), data augmentation is widely conside...
research
12/06/2018

Quantifying Generalization in Reinforcement Learning

In this paper, we investigate the problem of overfitting in deep reinfor...
research
07/18/2022

Research Trends and Applications of Data Augmentation Algorithms

In the Machine Learning research community, there is a consensus regardi...
research
10/19/2019

Towards More Sample Efficiency in Reinforcement Learning with Data Augmentation

Deep reinforcement learning (DRL) is a promising approach for adaptive r...
research
05/27/2021

Drawing Multiple Augmentation Samples Per Image During Training Efficiently Decreases Test Error

In computer vision, it is standard practice to draw a single sample from...
research
10/13/2022

Bootstrap Advantage Estimation for Policy Optimization in Reinforcement Learning

This paper proposes an advantage estimation approach based on data augme...
research
05/03/2022

Better plain ViT baselines for ImageNet-1k

It is commonly accepted that the Vision Transformer model requires sophi...

Please sign up or login with your details

Forgot password? Click here to reset