Inferring Smooth Control: Monte Carlo Posterior Policy Iteration with Gaussian Processes

10/07/2022
by   Joe Watson, et al.
0

Monte Carlo methods have become increasingly relevant for control of non-differentiable systems, approximate dynamics models and learning from data. These methods scale to high-dimensional spaces and are effective at the non-convex optimizations often seen in robot learning. We look at sample-based methods from the perspective of inference-based control, specifically posterior policy iteration. From this perspective, we highlight how Gaussian noise priors produce rough control actions that are unsuitable for physical robot deployment. Considering smoother Gaussian process priors, as used in episodic reinforcement learning and motion planning, we demonstrate how smoother model predictive control can be achieved using online sequential inference. This inference is realized through an efficient factorization of the action distribution and a novel means of optimizing the likelihood temperature to improve importance sampling accuracy. We evaluate this approach on several high-dimensional robot control tasks, matching the sample efficiency of prior heuristic methods while also ensuring smoothness. Simulation results can be seen at https://monte-carlo-ppi.github.io/.

READ FULL TEXT

page 1

page 28

page 30

page 31

research
12/12/2017

Approximating multivariate posterior distribution functions from Monte Carlo samples for sequential Bayesian inference

An important feature of Bayesian statistics is the possibility to do seq...
research
12/15/2021

IID Sampling from Doubly Intractable Distributions

Intractable posterior distributions of parameters with intractable norma...
research
09/02/2023

A Unifying Variational Framework for Gaussian Process Motion Planning

To control how a robot moves, motion planning algorithms must compute pa...
research
03/16/2021

Gradient-Based Markov Chain Monte Carlo for Bayesian Inference With Non-Differentiable Priors

The use of non-differentiable priors in Bayesian statistics has become i...
research
11/02/2020

Sample-efficient reinforcement learning using deep Gaussian processes

Reinforcement learning provides a framework for learning to control whic...
research
05/30/2022

Critic Sequential Monte Carlo

We introduce CriticSMC, a new algorithm for planning as inference built ...
research
10/19/2012

Monte Carlo Matrix Inversion Policy Evaluation

In 1950, Forsythe and Leibler (1950) introduced a statistical technique ...

Please sign up or login with your details

Forgot password? Click here to reset