Deep Reinforcement Learning Based Robot Arm Manipulation with Efficient Training Data through Simulation

07/16/2019
by   Xiaowei Xing, et al.
0

Deep reinforcement learning trains neural networks using experiences sampled from the replay buffer, which is commonly updated at each time step. In this paper, we propose a method to update the replay buffer adaptively and selectively to train a robot arm to accomplish a suction task in simulation. The response time of the agent is thoroughly taken into account. The state transitions that remain stuck at the boundary of constraint are not stored. The policy trained with our method works better than the one with the common replay buffer update method. The result is demonstrated both by simulation and by experiment with a real robot arm.

READ FULL TEXT
research
06/26/2022

Analysis of Stochastic Processes through Replay Buffers

Replay buffers are a key component in many reinforcement learning scheme...
research
09/17/2018

Muscle Excitation Estimation in Biomechanical Simulation Using NAF Reinforcement Learning

Motor control is a set of time-varying muscle excitations which generate...
research
10/01/2021

Sim and Real: Better Together

Simulation is used extensively in autonomous systems, particularly in ro...
research
07/14/2021

Mixing Human Demonstrations with Self-Exploration in Experience Replay for Deep Reinforcement Learning

We investigate the effect of using human demonstration data in the repla...
research
09/06/2018

ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience Replay

Experience replay is an important technique for addressing sample-ineffi...
research
05/31/2018

Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update

We propose Episodic Backward Update - a new algorithm to boost the perfo...
research
10/18/2017

The Effects of Memory Replay in Reinforcement Learning

Experience replay is a key technique behind many recent advances in deep...

Please sign up or login with your details

Forgot password? Click here to reset