OptLayer - Practical Constrained Optimization for Deep Reinforcement Learning in the Real World

09/22/2017
by   Tu-Hoa Pham, et al.
0

While deep reinforcement learning techniques have recently produced considerable achievements on many decision-making problems, their use in robotics has largely been limited to simulated worlds or restricted motions, since unconstrained trial-and-error interactions in the real world can have undesirable consequences for the robot or its environment. To overcome such limitations, we propose a novel reinforcement learning architecture, OptLayer, that takes as inputs possibly unsafe actions predicted by a neural network and outputs the closest actions that satisfy chosen constraints. While learning control policies often requires carefully crafted rewards and penalties while exploring the range of possible actions, OptLayer ensures that only safe actions are actually executed and unsafe predictions are penalized during training. We demonstrate the effectiveness of our approach on robot reaching tasks, both simulated and in the real world.

READ FULL TEXT

page 1

page 7

page 8

research
01/30/2023

Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning

Many real-world domains require safe decision making in the presence of ...
research
12/26/2018

Learning to Walk via Deep Reinforcement Learning

Deep reinforcement learning suggests the promise of fully automated lear...
research
06/17/2021

Cat-like Jumping and Landing of Legged Robots in Low-gravity Using Deep Reinforcement Learning

In this article, we show that learned policies can be applied to solve l...
research
09/17/2022

Reinforcement Learning for Self-exploration in Narrow Spaces

In narrow spaces, motion planning based on the traditional hierarchical ...
research
06/28/2022

DayDreamer: World Models for Physical Robot Learning

To solve tasks in complex environments, robots need to learn from experi...
research
06/02/2018

DAQN: Deep Auto-encoder and Q-Network

The deep reinforcement learning method usually requires a large number o...
research
04/18/2019

When is a Prediction Knowledge?

Within Reinforcement Learning, there is a growing collection of research...

Please sign up or login with your details

Forgot password? Click here to reset