Dual Online Stein Variational Inference for Control and Dynamics

by   Lucas Barcelos, et al.

Model predictive control (MPC) schemes have a proven track record for delivering aggressive and robust performance in many challenging control tasks, coping with nonlinear system dynamics, constraints, and observational noise. Despite their success, these methods often rely on simple control distributions, which can limit their performance in highly uncertain and complex environments. MPC frameworks must be able to accommodate changing distributions over system parameters, based on the most recent measurements. In this paper, we devise an implicit variational inference algorithm able to estimate distributions over model parameters and control inputs on-the-fly. The method incorporates Stein Variational gradient descent to approximate the target distributions as a collection of particles, and performs updates based on a Bayesian formulation. This enables the approximation of complex multi-modal posterior distributions, typically occurring in challenging and realistic robot navigation tasks. We demonstrate our approach on both simulated and real-world experiments requiring real-time execution in the face of dynamically changing environments.


page 1

page 2


Stein Variational Model Predictive Control

Decision making under uncertainty is critical to real-world, autonomous ...

Variational Inference MPC using Normalizing Flows and Out-of-Distribution Projection

We propose a Model Predictive Control (MPC) method for collision-free na...

Bayesian Optimisation for Robust Model Predictive Control under Model Parameter Uncertainty

We propose an adaptive optimisation approach for tuning stochastic model...

Probabilistic Inference of Simulation Parameters via Parallel Differentiable Simulation

To accurately reproduce measurements from the real world, simulators nee...

Annealed Stein Variational Gradient Descent

Particle based optimization algorithms have recently been developed as s...

Trust-Region Variational Inference with Gaussian Mixture Models

Many methods for machine learning rely on approximate inference from int...

Reactive Navigation under Non-Parametric Uncertainty through Hilbert Space Embedding of Probabilistic Velocity Obstacles

The probabilistic velocity obstacle (PVO) extends the concept of velocit...