GPU-Accelerated Policy Optimization via Batch Automatic Differentiation of Gaussian Processes for Real-World Control

02/28/2022
by   Abdolreza Taheri, et al.
0

The ability of Gaussian processes (GPs) to predict the behavior of dynamical systems as a more sample-efficient alternative to parametric models seems promising for real-world robotics research. However, the computational complexity of GPs has made policy search a highly time and memory consuming process that has not been able to scale to larger problems. In this work, we develop a policy optimization method by leveraging fast predictive sampling methods to process batches of trajectories in every forward pass, and compute gradient updates over policy parameters by automatic differentiation of Monte Carlo evaluations, all on GPU. We demonstrate the effectiveness of our approach in training policies on a set of reference-tracking control experiments with a heavy-duty machine. Benchmark results show a significant speedup over exact methods and showcase the scalability of our method to larger policy networks, longer horizons, and up to thousands of trajectories with a sublinear drop in speed.

READ FULL TEXT
research
03/19/2019

Exact Gaussian Processes on a Million Data Points

Gaussian processes (GPs) are flexible models with state-of-the-art perfo...
research
01/21/2021

Model-based Policy Search for Partially Measurable Systems

In this paper, we propose a Model-Based Reinforcement Learning (MBRL) al...
research
02/21/2020

Efficiently sampling functions from Gaussian process posteriors

Gaussian processes are the gold standard for many real-world modeling pr...
research
04/22/2017

Asynchronous Distributed Variational Gaussian Processes for Regression

Gaussian processes (GPs) are powerful non-parametric function estimators...
research
04/08/2019

Spatio-Temporal Multisensor Calibration Based on Gaussian Processes Moving Object Tracking

Perception is one of the key abilities of autonomous mobile robotic syst...
research
10/08/2018

A Hybrid Approach for Trajectory Control Design

This work presents a methodology to design trajectory tracking feedback ...
research
05/25/2022

Gradient-based explanations for Gaussian Process regression and classification models

Gaussian Processes (GPs) have proven themselves as a reliable and effect...

Please sign up or login with your details

Forgot password? Click here to reset