Parallel Gaussian Process Regression with Low-Rank Covariance Matrix Approximations

08/09/2014
by   Jie Chen, et al.
0

Gaussian processes (GP) are Bayesian non-parametric models that are widely used for probabilistic regression. Unfortunately, it cannot scale well with large data nor perform real-time predictions due to its cubic time cost in the data size. This paper presents two parallel GP regression methods that exploit low-rank covariance matrix approximations for distributing the computational load among parallel machines to achieve time efficiency and scalability. We theoretically guarantee the predictive performances of our proposed parallel GPs to be equivalent to that of some centralized approximate GP regression methods: The computation of their centralized counterparts can be distributed among parallel machines, hence achieving greater time efficiency and scalability. We analytically compare the properties of our parallel GPs such as time, space, and communication complexity. Empirical evaluation on two real-world datasets in a cluster of 20 computing nodes shows that our parallel GPs are significantly more time-efficient and scalable than their centralized counterparts and exact/full GP while achieving predictive performances comparable to full GP.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/17/2014

Parallel Gaussian Process Regression for Big Data: Low-Rank Representation Meets Markov Approximation

The expressive power of a Gaussian process (GP) model comes at a cost of...
research
02/27/2017

Embarrassingly parallel inference for Gaussian processes

Training Gaussian process (GP)-based models typically involves an O(N^3...
research
11/03/2018

Understanding and Comparing Scalable Gaussian Process Regression for Big Data

As a non-parametric Bayesian model which produces informative predictive...
research
09/18/2012

Scaling Multidimensional Inference for Structured Gaussian Processes

Exact Gaussian Process (GP) regression has O(N^3) runtime for data size ...
research
06/05/2020

A conditional one-output likelihood formulation for multitask Gaussian processes

Multitask Gaussian processes (MTGP) are the Gaussian process (GP) framew...
research
07/03/2018

When Gaussian Process Meets Big Data: A Review of Scalable GPs

The vast quantity of information brought by big data as well as the evol...
research
08/27/2017

Local Gaussian Processes for Efficient Fine-Grained Traffic Speed Prediction

Traffic speed is a key indicator for the efficiency of an urban transpor...

Please sign up or login with your details

Forgot password? Click here to reset