A Self-adaptive LSAC-PID Approach based on Lyapunov Reward Shaping for Mobile Robots

11/03/2021
by   Xinyi Yu, et al.
0

To solve the coupling problem of control loops and the adaptive parameter tuning problem in the multi-input multi-output (MIMO) PID control system, a self-adaptive LSAC-PID algorithm is proposed based on deep reinforcement learning (RL) and Lyapunov-based reward shaping in this paper. For complex and unknown mobile robot control environment, an RL-based MIMO PID hybrid control strategy is firstly presented. According to the dynamic information and environmental feedback of the mobile robot, the RL agent can output the optimal MIMO PID parameters in real time, without knowing mathematical model and decoupling multiple control loops. Then, to improve the convergence speed of RL and the stability of mobile robots, a Lyapunov-based reward shaping soft actor-critic (LSAC) algorithm is proposed based on Lyapunov theory and potential-based reward shaping method. The convergence and optimality of the algorithm are proved in terms of the policy evaluation and improvement step of soft policy iteration. In addition, for line-following robots, the region growing method is improved to adapt to the influence of forks and environmental interference. Through comparison, test and cross-validation, the simulation and real-environment experimental results all show good performance of the proposed LSAC-PID tuning algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 8

page 9

page 11

research
03/19/2021

A Self-adaptive SAC-PID Control Approach based on Reinforcement Learning for Mobile Robots

Proportional-integral-derivative (PID) control is the most widely used i...
research
12/01/2021

Homotopy Based Reinforcement Learning with Maximum Entropy for Autonomous Air Combat

The Intelligent decision of the unmanned combat aerial vehicle (UCAV) ha...
research
12/02/2020

Pareto Deterministic Policy Gradients and Its Application in 5G Massive MIMO Networks

In this paper, we consider jointly optimizing cell load balance and netw...
research
01/13/2020

Learning to Locomote with Deep Neural-Network and CPG-based Control in a Soft Snake Robot

In this paper, we present a new locomotion control method for soft robot...
research
03/13/2023

Reinforcement Learning-based Wavefront Sensorless Adaptive Optics Approaches for Satellite-to-Ground Laser Communication

Optical satellite-to-ground communication (OSGC) has the potential to im...
research
10/09/2020

MIMO ILC for Precision SEA robots using Input-weighted Complex-Kernel Regression

This work improves the positioning precision of lightweight robots with ...
research
07/19/2021

Reinforcement learning based closed‐loop reference model adaptive flight control system design

In this study, we present a reinforcement learning (RL)-based flight con...

Please sign up or login with your details

Forgot password? Click here to reset