Reinforcement Learning for Mixed-Integer Problems Based on MPC

04/03/2020
by   Sebastien Gros, et al.
0

Model Predictive Control has been recently proposed as policy approximation for Reinforcement Learning, offering a path towards safe and explainable Reinforcement Learning. This approach has been investigated for Q-learning and actor-critic methods, both in the context of nominal Economic MPC and Robust (N)MPC, showing very promising results. In that context, actor-critic methods seem to be the most reliable approach. Many applications include a mixture of continuous and integer inputs, for which the classical actor-critic methods need to be adapted. In this paper, we present a policy approximation based on mixed-integer MPC schemes, and propose a computationally inexpensive technique to generate exploration in the mixed-integer input space that ensures a satisfaction of the constraints. We then propose a simple compatible advantage function approximation for the proposed policy, that allows one to build the gradient of the mixed-integer MPC-based policy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/17/2021

Soft Actor-Critic With Integer Actions

Reinforcement learning is well-studied under discrete actions. Integer a...
research
11/26/2020

Learning from Simulation, Racing in Reality

We present a reinforcement learning-based solution to autonomously race ...
research
07/23/2023

Robust explicit model predictive control for hybrid linear systems with parameter uncertainties

Explicit model-predictive control (MPC) is a widely used control design ...
research
08/07/2023

Optimizing the switching operation in monoclonal antibody production: Economic MPC and reinforcement learning

Monoclonal antibodies (mAbs) have emerged as indispensable assets in med...
research
06/16/2023

Actor-Critic Model Predictive Control

Despite its success, Model Predictive Control (MPC) often requires inten...
research
06/14/2023

Integrating machine learning paradigms and mixed-integer model predictive control for irrigation scheduling

The agricultural sector currently faces significant challenges in water ...
research
06/27/2022

Stability Verification of Neural Network Controllers using Mixed-Integer Programming

We propose a framework for the stability verification of Mixed-Integer L...

Please sign up or login with your details

Forgot password? Click here to reset