Thinking While Moving: Deep Reinforcement Learning with Concurrent Control

04/13/2020
by   Ted Xiao, et al.
17

We study reinforcement learning in settings where sampling an action from the policy must be done concurrently with the time evolution of the controlled system, such as when a robot must decide on the next action while still performing the previous action. Much like a person or an animal, the robot must think and move at the same time, deciding on its next action before the previous one has completed. In order to develop an algorithmic framework for such concurrent control problems, we start with a continuous-time formulation of the Bellman equations, and then discretize them in a way that is aware of system delays. We instantiate this new class of approximate dynamic programming methods via a simple architectural extension to existing value-based deep reinforcement learning algorithms. We evaluate our methods on simulated benchmark tasks and a large-scale robotic grasping task where the robot must "think while moving".

READ FULL TEXT

page 7

page 17

research
10/15/2018

Using Deep Reinforcement Learning for the Continuous Control of Robotic Arms

Deep reinforcement learning enables algorithms to learn complex behavior...
research
09/07/2022

A Deep Reinforcement Learning Strategy for UAV Autonomous Landing on a Platform

With the development of industry, drones are appearing in various field....
research
02/28/2018

Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods

In this paper, we explore deep reinforcement learning algorithms for vis...
research
04/19/2023

Torque-based Deep Reinforcement Learning for Task-and-Robot Agnostic Learning on Bipedal Robots Using Sim-to-Real Transfer

In this paper, we review the question of which action space is best suit...
research
08/16/2019

Performing Deep Recurrent Double Q-Learning for Atari Games

Currently, many applications in Machine Learning are based on define new...
research
10/27/2022

Characterising the Robustness of Reinforcement Learning for Continuous Control using Disturbance Injection

In this study, we leverage the deliberate and systematic fault-injection...
research
12/02/2020

Coinbot: Intelligent Robotic Coin Bag Manipulation Using Deep Reinforcement Learning And Machine Teaching

Given the laborious difficulty of moving heavy bags of physical currency...

Please sign up or login with your details

Forgot password? Click here to reset