DeepAI AI Chat
Log In Sign Up

Recurrent Model Predictive Control

by   Zhengyu Liu, et al.

This paper proposes an off-line algorithm, called Recurrent Model Predictive Control (RMPC), to solve general nonlinear finite-horizon optimal control problems. Unlike traditional Model Predictive Control (MPC) algorithms, it can make full use of the current computing resources and adaptively select the longest model prediction horizon. Our algorithm employs a recurrent function to approximate the optimal policy, which maps the system states and reference values directly to the control inputs. The number of prediction steps is equal to the number of recurrent cycles of the learned policy function. With an arbitrary initial policy function, the proposed RMPC algorithm can converge to the optimal policy by directly minimizing the designed loss function. We further prove the convergence and optimality of the RMPC algorithm thorough Bellman optimality principle, and demonstrate its generality and efficiency using two numerical examples.


Generalized Policy Iteration for Optimal Control in Continuous Time

This paper proposes the Deep Generalized Policy Iteration (DGPI) algorit...

MPC-Net: A First Principles Guided Policy Search

We present an Imitation Learning approach for the control of dynamical s...

Policy Learning for Nonlinear Model Predictive Control with Application to USVs

The unaffordable computation load of nonlinear model predictive control ...

Regret Analysis of Learning-Based MPC with Partially-Unknown Cost Function

The exploration/exploitation trade-off is an inherent challenge in data-...

Variational Inference MPC using Tsallis Divergence

In this paper, we provide a generalized framework for Variational Infere...

Safe and Near-Optimal Policy Learning for Model Predictive Control using Primal-Dual Neural Networks

In this paper, we propose a novel framework for approximating the explic...

ARES: Adaptive Receding-Horizon Synthesis of Optimal Plans

We introduce ARES, an efficient approximation algorithm for generating o...