Observation Space Matters: Benchmark and Optimization Algorithm

11/02/2020
by   Joanne Taery Kim, et al.
0

Recent advances in deep reinforcement learning (deep RL) enable researchers to solve challenging control problems, from simulated environments to real-world robotic tasks. However, deep RL algorithms are known to be sensitive to the problem formulation, including observation spaces, action spaces, and reward functions. There exist numerous choices for observation spaces but they are often designed solely based on prior knowledge due to the lack of established principles. In this work, we conduct benchmark experiments to verify common design choices for observation spaces, such as Cartesian transformation, binary contact flags, a short history, or global positions. Then we propose a search algorithm to find the optimal observation spaces, which examines various candidate observation spaces and removes unnecessary observation channels with a Dropout-Permutation test. We demonstrate that our algorithm significantly improves learning speed compared to manually designed observation spaces. We also analyze the proposed algorithm by evaluating different hyperparameters.

READ FULL TEXT
research
10/09/2020

Learning to Locomote: Understanding How Environment Design Matters for Deep Reinforcement Learning

Learning to locomote is one of the most common tasks in physics-based an...
research
01/01/2022

Transfer RL across Observation Feature Spaces via Model-Based Regularization

In many reinforcement learning (RL) applications, the observation space ...
research
12/21/2020

Offline Reinforcement Learning from Images with Latent Space Models

Offline reinforcement learning (RL) refers to the problem of learning po...
research
09/12/2022

Partial Observability during DRL for Robot Control

Deep Reinforcement Learning (DRL) has made tremendous advances in both s...
research
08/24/2018

Defining the problem of Observation Learning

This article defines and formulates the problem of observation learning ...
research
03/08/2019

Pixel-Attentive Policy Gradient for Multi-Fingered Grasping in Cluttered Scenes

Recent advances in on-policy reinforcement learning (RL) methods enabled...
research
09/19/2022

Understanding reinforcement learned crowds

Simulating trajectories of virtual crowds is a commonly encountered task...

Please sign up or login with your details

Forgot password? Click here to reset