Efficient Multiscale Gaussian Process Regression using Hierarchical Clustering

11/06/2015
by   Z. Zhang, et al.
0

Standard Gaussian Process (GP) regression, a powerful machine learning tool, is computationally expensive when it is applied to large datasets, and potentially inaccurate when data points are sparsely distributed in a high-dimensional feature space. To address these challenges, a new multiscale, sparsified GP algorithm is formulated, with the goal of application to large scientific computing datasets. In this approach, the data is partitioned into clusters and the cluster centers are used to define a reduced training set, resulting in an improvement over standard GPs in terms of training and evaluation costs. Further, a hierarchical technique is used to adaptively map the local covariance representation to the underlying sparsity of the feature space, leading to improved prediction accuracy when the data distribution is highly non-uniform. A theoretical investigation of the computational complexity of the algorithm is presented. The efficacy of this method is then demonstrated on smooth and discontinuous analytical functions and on data from a direct numerical simulation of turbulent combustion.

READ FULL TEXT
research
02/24/2014

Manifold Gaussian Processes for Regression

Off-the-shelf Gaussian Process (GP) covariance functions encode smoothne...
research
03/15/2012

Sparse-posterior Gaussian Processes for general likelihoods

Gaussian processes (GPs) provide a probabilistic nonparametric represent...
research
11/17/2018

A Greedy approximation scheme for Sparse Gaussian process regression

In their standard form Gaussian processes (GPs) provide a powerful non-p...
research
07/09/2021

Gaussian Process Subspace Regression for Model Reduction

Subspace-valued functions arise in a wide range of problems, including p...
research
02/26/2019

Multiscale Gaussian Process Level Set Estimation

In this paper, the problem of estimating the level set of a black-box fu...
research
12/30/2022

An Entropy-Based Model for Hierarchical Learning

Machine learning is the dominant approach to artificial intelligence, th...
research
08/14/2021

Active Assessment of Prediction Services as Accuracy Surface Over Attribute Combinations

Our goal is to evaluate the accuracy of a black-box classification model...

Please sign up or login with your details

Forgot password? Click here to reset