Infusing model predictive control into meta-reinforcement learning for mobile robots in dynamic environments

09/15/2021
by   Jaeuk Shin, et al.
0

The successful operation of mobile robots requires them to rapidly adapt to environmental changes. Toward developing an adaptive decision-making tool for mobile robots, we propose combining meta-reinforcement learning (meta-RL) with model predictive control (MPC). The key idea of our method is to switch between a meta-learned policy and an MPC controller in an event-triggered fashion. Our method uses an off-policy meta-RL algorithm as a baseline to train a policy using transition samples generated by MPC. The MPC module of our algorithm is carefully designed to infer the movements of obstacles via Gaussian process regression (GPR) and to avoid collisions via conditional value-at-risk (CVaR) constraints. Due to its design, our method benefits from the two complementary tools. First, high-performance action samples generated by the MPC controller enhance the learning performance and stability of the meta-RL algorithm. Second, through the use of the meta-learned policy, the MPC controller is infrequently activated, thereby significantly reducing computation time. The results of our simulations on a restaurant service robot show that our algorithm outperforms both of the baseline methods.

READ FULL TEXT
research
11/07/2021

Optimization of the Model Predictive Control Meta-Parameters Through Reinforcement Learning

Model predictive control (MPC) is increasingly being considered for cont...
research
08/10/2021

An experimental study of two predictive reinforcement learning methods and comparison with model-predictive control

Reinforcement learning (RL) has been successfully used in various simula...
research
11/09/2018

Sample-Efficient Policy Learning based on Completely Behavior Cloning

Direct policy search is one of the most important algorithm of reinforce...
research
08/29/2023

On the improvement of model-predictive controllers

This article investigates synthetic model-predictive control (MPC) probl...
research
05/18/2022

Bridging the gap between QP-based and MPC-based RL

Reinforcement learning methods typically use Deep Neural Networks to app...
research
04/17/2023

TreeC: a method to generate interpretable energy management systems using a metaheuristic algorithm

Energy management systems (EMS) have classically been implemented based ...
research
09/26/2022

Training Efficient Controllers via Analytic Policy Gradient

Control design for robotic systems is complex and often requires solving...

Please sign up or login with your details

Forgot password? Click here to reset