Stein Variational Guided Model Predictive Path Integral Control: Proposal and Experiments with Fast Maneuvering Vehicles

09/20/2023
by   Kohei Honda, et al.
0

This paper presents a novel Stochastic Optimal Control (SOC) method based on Model Predictive Path Integral control (MPPI), named Stein Variational Guided MPPI (SVG-MPPI), designed to handle rapidly shifting multimodal optimal action distributions. While MPPI can find a Gaussian-approximated optimal action distribution in closed form, i.e., without iterative solution updates, it struggles with multimodality of the optimal distributions, such as those involving non-convex constraints for obstacle avoidance. This is due to the less representative nature of the Gaussian. To overcome this limitation, our method aims to identify a target mode of the optimal distribution and guide the solution to converge to fit it. In the proposed method, the target mode is roughly estimated using a modified Stein Variational Gradient Descent (SVGD) method and embedded into the MPPI algorithm to find a closed-form "mode-seeking" solution that covers only the target mode, thus preserving the fast convergence property of MPPI. Our simulation and real-world experimental results demonstrate that SVG-MPPI outperforms both the original MPPI and other state-of-the-art sampling-based SOC algorithms in terms of path-tracking and obstacle-avoidance capabilities. Source code: https://github.com/kohonda/proj-svg_mppi

READ FULL TEXT
research
08/29/2023

Stochastic Motion Planning as Gaussian Variational Inference: Theory and Algorithms

We consider the motion planning problem under uncertainty and address it...
research
06/21/2023

Robotic Navigation with Convergence Guarantees in Complex Dynamic Environments

This article addresses the obstacle avoidance problem for setpoint stabi...
research
09/23/2021

Optimal Control via Combined Inference and Numerical Optimization

Derivative based optimization methods are efficient at solving optimal c...
research
05/12/2022

Closed-Form Solution of the Unit Normal Loss Integral in Two-Dimensions

In Value of Information (VoI) analysis, the unit normal loss integral (U...
research
10/14/2017

Hybrid DDP in Clutter (CHDDP): Trajectory Optimization for Hybrid Dynamical System in Cluttered Environments

We present an algorithm for obtaining an optimal control policy for hybr...
research
01/22/2021

Gaussian Process-Based Model Predictive Control for Overtaking

This paper proposes a novel framework for addressing the challenge of au...
research
06/17/2022

Path-Gradient Estimators for Continuous Normalizing Flows

Recent work has established a path-gradient estimator for simple variati...

Please sign up or login with your details

Forgot password? Click here to reset