Low Dimensional State Representation Learning with Reward-shaped Priors

07/29/2020
by   Nicolò Botteghi, et al.
4

Reinforcement Learning has been able to solve many complicated robotics tasks without any need for feature engineering in an end-to-end fashion. However, learning the optimal policy directly from the sensory inputs, i.e the observations, often requires processing and storage of a huge amount of data. In the context of robotics, the cost of data from real robotics hardware is usually very high, thus solutions that achieve high sample-efficiency are needed. We propose a method that aims at learning a mapping from the observations into a lower-dimensional state space. This mapping is learned with unsupervised learning using loss functions shaped to incorporate prior knowledge of the environment and the task. Using the samples from the state space, the optimal policy is quickly and efficiently learned. We test the method on several mobile robot navigation tasks in a simulation environment and also on a real robot.

READ FULL TEXT
research
07/04/2021

Low Dimensional State Representation Learning with Robotics Priors in Continuous Action Spaces

Autonomous robots require high degrees of cognitive and motoric intellig...
research
09/15/2017

Unsupervised state representation learning with robotic priors: a robustness benchmark

Our understanding of the world depends highly on our capacity to produce...
research
01/24/2019

Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics

Scaling end-to-end reinforcement learning to control real robots from vi...
research
06/23/2020

Environment Shaping in Reinforcement Learning using State Abstraction

One of the central challenges faced by a reinforcement learning (RL) age...
research
10/28/2021

Equivariant Q Learning in Spatial Action Spaces

Recently, a variety of new equivariant neural network model architecture...
research
01/31/2023

Few-Shot Image-to-Semantics Translation for Policy Transfer in Reinforcement Learning

We investigate policy transfer using image-to-semantics translation to m...
research
10/28/2019

Biomimetic Ultra-Broadband Perfect Absorbers Optimised with Reinforcement Learning

By learning the optimal policy with a double deep Q-learning network, we...

Please sign up or login with your details

Forgot password? Click here to reset