Advancing Trajectory Optimization with Approximate Inference: Exploration, Covariance Control and Adaptive Risk

03/10/2021
by   Joe Watson, et al.
0

Discrete-time stochastic optimal control remains a challenging problem for general, nonlinear systems under significant uncertainty, with practical solvers typically relying on the certainty equivalence assumption, replanning and/or extensive regularization. Control as inference is an approach that frames stochastic control as an equivalent inference problem, and has demonstrated desirable qualities over existing methods, namely in exploration and regularization. We look specifically at the input inference for control (i2c) algorithm, and derive three key characteristics that enable advanced trajectory optimization: An `expert' linear Gaussian controller that combines the benefits of open-loop optima and closed-loop variance reduction when optimizing for nonlinear systems, inherent adaptive risk sensitivity from the inference formulation, and covariance control functionality with only a minor algorithmic adjustment.

READ FULL TEXT

page 1

page 2

page 3

page 4

04/17/2019

Decoupled Data Based Approach for Learning to Control Nonlinear Dynamical Systems

This paper addresses the problem of learning the optimal control policy ...
05/17/2021

Stochastic Control through Approximate Bayesian Input Inference

Optimal control under uncertainty is a prevailing challenge in control, ...
10/07/2019

Stochastic Optimal Control as Approximate Input Inference

Optimal control of stochastic nonlinear dynamical systems is a major cha...
03/09/2021

Combining Gaussian processes and polynomial chaos expansions for stochastic nonlinear model predictive control

Model predictive control is an advanced control approach for multivariab...
10/16/2020

RAT iLQR: A Risk Auto-Tuning Controller to Optimally Account for Stochastic Model Mismatch

Successful robotic operation in stochastic environments relies on accura...
11/26/2020

Regret Bounds for Adaptive Nonlinear Control

We study the problem of adaptively controlling a known discrete-time non...