Mondrian Forests for Large-Scale Regression when Uncertainty Matters

06/11/2015
by   Balaji Lakshminarayanan, et al.
0

Many real-world regression problems demand a measure of the uncertainty associated with each prediction. Standard decision forests deliver efficient state-of-the-art predictive performance, but high-quality uncertainty estimates are lacking. Gaussian processes (GPs) deliver uncertainty estimates, but scaling GPs to large-scale data sets comes at the cost of approximating the uncertainty estimates. We extend Mondrian forests, first proposed by Lakshminarayanan et al. (2014) for classification problems, to the large-scale non-parametric regression setting. Using a novel hierarchical Gaussian prior that dovetails with the Mondrian forest framework, we obtain principled uncertainty estimates, while still retaining the computational advantages of decision forests. Through a combination of illustrative examples, real-world large-scale datasets, and Bayesian optimization benchmarks, we demonstrate that Mondrian forests outperform approximate GPs on large-scale regression tasks and deliver better-calibrated uncertainty assessments than decision-forest-based methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/12/2016

Deep Gaussian Processes for Regression using Approximate Expectation Propagation

Deep Gaussian processes (DGPs) are multi-layer hierarchical generalisati...
research
04/22/2017

Asynchronous Distributed Variational Gaussian Processes for Regression

Gaussian processes (GPs) are powerful non-parametric function estimators...
research
01/26/2022

Flexible domain prediction using mixed effects random forests

This paper promotes the use of random forests as versatile tools for est...
research
05/29/2022

Modeling Disagreement in Automatic Data Labelling for Semi-Supervised Learning in Clinical Natural Language Processing

Computational models providing accurate estimates of their uncertainty a...
research
04/10/2022

Gaussian Processes for Missing Value Imputation

Missing values are common in many real-life datasets. However, most of t...
research
11/11/2015

Training Deep Gaussian Processes using Stochastic Expectation Propagation and Probabilistic Backpropagation

Deep Gaussian processes (DGPs) are multi-layer hierarchical generalisati...

Please sign up or login with your details

Forgot password? Click here to reset