Multi-fidelity classification using Gaussian processes: accelerating the prediction of large-scale computational models

Machine learning techniques typically rely on large datasets to create accurate classifiers. However, there are situations when data is scarce and expensive to acquire. This is the case of studies that rely on state-of-the-art computational models which typically take days to run, thus hindering the potential of machine learning tools. In this work, we present a novel classifier that takes advantage of lower fidelity models and inexpensive approximations to predict the binary output of expensive computer simulations. We postulate an autoregressive model between the different levels of fidelity with Gaussian process priors. We adopt a fully Bayesian treatment for the hyper-parameters and use Markov Chain Mont Carlo samplers. We take advantage of the probabilistic nature of the classifier to implement active learning strategies. We also introduce a sparse approximation to enhance the ability of themulti-fidelity classifier to handle large datasets. We test these multi-fidelity classifiers against their single-fidelity counterpart with synthetic data, showing a median computational cost reduction of 23 target accuracy of 90 multi-fidelity classifier achieves an F1 score, the harmonic mean of precision and recall, of 99.6 both are trained with 50 samples. In general, our results show that the multi-fidelity classifiers outperform their single-fidelity counterpart in terms of accuracy in all cases. We envision that this new tool will enable researchers to study classification problems that would otherwise be prohibitively expensive. Source code is available at https://github.com/fsahli/MFclass.

READ FULL TEXT

page 7

page 8

page 10

page 11

research
06/29/2020

Multi-fidelity modeling with different input domain definitions using Deep Gaussian Processes

Multi-fidelity approaches combine different models built on a scarce but...
research
07/31/2021

A graphical Gaussian process model for multi-fidelity emulation of expensive computer codes

We present a novel Graphical Multi-fidelity Gaussian Process (GMGP) mode...
research
04/08/2021

Residual Gaussian Process: A Tractable Nonparametric Bayesian Emulator for Multi-fidelity Simulations

Challenges in multi-fidelity modeling relate to accuracy, uncertainty es...
research
10/16/2020

Multi-fidelity data fusion for the approximation of scalar functions with low intrinsic dimensionality through active subspaces

Gaussian processes are employed for non-parametric regression in a Bayes...
research
06/25/2020

Green Machine Learning via Augmented Gaussian Processes and Multi-Information Source Optimization

Searching for accurate Machine and Deep Learning models is a computation...
research
10/27/2021

Multi-fidelity data fusion through parameter space reduction with applications to automotive engineering

Multi-fidelity models are of great importance due to their capability of...
research
09/20/2017

Integrating hyper-parameter uncertainties in a multi-fidelity Bayesian model for the estimation of a probability of failure

A multi-fidelity simulator is a numerical model, in which one of the inp...

Please sign up or login with your details

Forgot password? Click here to reset