Deep Reinforcement Learning with Stage Incentive Mechanism for Robotic Trajectory Planning

09/25/2020
by   Jin Yang, et al.
11

To improve the efficiency of deep reinforcement learning (DRL) based methods for robot manipulator trajectory planning in random working environment. Different from the traditional sparse reward function, we present three dense reward functions in this paper. Firstly, posture reward function is proposed to accelerate the learning process with a more reasonable trajectory by modeling the distance and direction constraints, which can reduce the blindness of exploration. Secondly, to improve the stability, a reward function at stride reward is proposed by modeling the distance and movement distance of joints constraints, it can make the learning process more stable. In order to further improve learning efficiency, we are inspired by the cognitive process of human behavior and propose a stage incentive mechanism, including hard stage incentive reward function and soft stage incentive reward function. Extensive experiments show that the soft stage incentive reward function proposed is able to improve convergence rate by up to 46.9 methods. The percentage increase in convergence mean reward is 4.4 the percentage decreases with respect to standard deviation by 21.9 the evaluation, the success rate of trajectory planning for robot manipulator is up to 99.6

READ FULL TEXT

page 7

page 11

page 13

page 15

page 17

research
04/28/2020

Pitfalls of learning a reward function online

In some agent designs like inverse reinforcement learning an agent needs...
research
09/13/2023

Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics

Although Deep Reinforcement Learning (DRL) has achieved notable success ...
research
02/01/2023

Internally Rewarded Reinforcement Learning

We study a class of reinforcement learning problems where the reward sig...
research
11/05/2019

Efficient Multi-robot Exploration via Multi-head Attention-based Cooperation Strategy

The goal of coordinated multi-robot exploration tasks is to employ a tea...
research
09/19/2017

Incorrigibility in the CIRL Framework

A value learning system has incentives to follow shutdown instructions, ...
research
05/19/2022

Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks

This paper describes a deep reinforcement learning (DRL) approach that w...
research
05/31/2019

Sequence Modeling of Temporal Credit Assignment for Episodic Reinforcement Learning

Recent advances in deep reinforcement learning algorithms have shown gre...

Please sign up or login with your details

Forgot password? Click here to reset