Splitting Gaussian Process Regression for Streaming Data

10/06/2020
by   Nick Terry, et al.
0

Gaussian processes offer a flexible kernel method for regression. While Gaussian processes have many useful theoretical properties and have proven practically useful, they suffer from poor scaling in the number of observations. In particular, the cubic time complexity of updating standard Gaussian process models make them generally unsuitable for application to streaming data. We propose an algorithm for sequentially partitioning the input space and fitting a localized Gaussian process to each disjoint region. The algorithm is shown to have superior time and space complexity to existing methods, and its sequential nature permits application to streaming data. The algorithm constructs a model for which the time complexity of updating is tightly bounded above by a pre-specified parameter. To the best of our knowledge, the model is the first local Gaussian process regression model to achieve linear memory complexity. Theoretical continuity properties of the model are proven. We demonstrate the efficacy of the resulting model on multi-dimensional regression tasks for streaming data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/09/2018

Ensemble Kalman Filtering for Online Gaussian Process Regression and Learning

Gaussian process regression is a machine learning approach which has bee...
research
06/16/2020

Real-Time Regression with Dividing Local Gaussian Processes

The increased demand for online prediction and the growing availability ...
research
09/17/2015

Fast Gaussian Process Regression for Big Data

Gaussian Processes are widely used for regression tasks. A known limitat...
research
10/18/2022

Locally Smoothed Gaussian Process Regression

We develop a novel framework to accelerate Gaussian process regression (...
research
05/11/2021

Resource-aware Distributed Gaussian Process Regression for Real-time Machine Learning

We study the problem where a group of agents aim to collaboratively lear...
research
01/05/2017

Overlapping Cover Local Regression Machines

We present the Overlapping Domain Cover (ODC) notion for kernel machines...
research
09/14/2023

Scalable Model-Based Gaussian Process Clustering

Gaussian process is an indispensable tool in clustering functional data,...

Please sign up or login with your details

Forgot password? Click here to reset