Reinforcement Learning for Robotic Time-optimal Path Tracking Using Prior Knowledge

06/30/2019
by   Jiadong Xiao, et al.
0

Time-optimal path tracking, as a significant tool for industrial robots, has attracted the attention of numerous researchers. In most time-optimal path tracking problems, the actuator torque constraints are assumed to be conservative, which ignores the motor characteristic; i.e., the actuator torque constraints are velocity-dependent, and the relationship between torque and velocity is piecewise linear. However, considering that the motor characteristics increase the solving difficulty, in this study, an improved Q-learning algorithm for robotic time-optimal path tracking using prior knowledge is proposed. After considering the limitations of the Q-learning algorithm, an improved action-value function is proposed to improve the convergence rate. The proposed algorithms use the idea of reward and penalty, rewarding the actions that satisfy constraint conditions and penalizing the actions that break constraint conditions, to finally obtain a time-optimal trajectory that satisfies the constraint conditions. The effectiveness of the algorithms is verified by experiments.

READ FULL TEXT
research
07/02/2019

Time-Optimal Path Tracking for Industrial Robots: A Dynamic Model-Free Reinforcement Learning Approach

In pursuit of the time-optimal path tracking (TOPT) trajectory of a robo...
research
07/02/2019

Time-optimal path tracking for industrial robot: A model-free reinforcement approach

In pursuit of the time-optimal motion of a robot manipulator along a pre...
research
12/22/2020

Dynamic penalty function approach for constraints handling in reinforcement learning

Reinforcement learning (RL) is attracting attentions as an effective way...
research
09/27/2022

DCE: Offline Reinforcement Learning With Double Conservative Estimates

Offline Reinforcement Learning has attracted much interest in solving th...
research
04/01/2021

Trajectory Tracking of Underactuated Sea Vessels With Uncertain Dynamics: An Integral Reinforcement Learning Approach

Underactuated systems like sea vessels have degrees of motion that are i...
research
02/21/2021

Learning Efficient Navigation in Vortical Flow Fields

Efficient point-to-point navigation in the presence of a background flow...
research
08/12/2020

An Intelligent Prediction System for Mobile Source Localization Using Time Delay Measurements

In this paper, we introduce an intelligent prediction system for mobile ...

Please sign up or login with your details

Forgot password? Click here to reset