On Uncertainty Estimation by Tree-based Surrogate Models in Sequential Model-based Optimization

02/22/2022
by   Jungtaek Kim, et al.
0

Sequential model-based optimization sequentially selects a candidate point by constructing a surrogate model with the history of evaluations, to solve a black-box optimization problem. Gaussian process (GP) regression is a popular choice as a surrogate model, because of its capability of calculating prediction uncertainty analytically. On the other hand, an ensemble of randomized trees is another option and has practical merits over GPs due to its scalability and easiness of handling continuous/discrete mixed variables. In this paper we revisit various ensembles of randomized trees to investigate their behavior in the perspective of prediction uncertainty estimation. Then, we propose a new way of constructing an ensemble of randomized trees, referred to as BwO forest, where bagging with oversampling is employed to construct bootstrapped samples that are used to build randomized trees with random splitting. Experimental results demonstrate the validity and good performance of BwO forest over existing tree-based models in various circumstances.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/15/2023

Unboxing Tree Ensembles for interpretability: a hierarchical visualization tool and a multivariate optimal re-built tree

The interpretability of models has become a crucial issue in Machine Lea...
research
02/06/2023

Uncertainty estimation for time series forecasting via Gaussian process regression surrogates

Machine learning models are widely used to solve real-world problems in ...
research
04/04/2021

Urysohn Forest for Aleatoric Uncertainty Quantification

The terms tree and forest are normally associated with an ensemble of cl...
research
03/09/2017

mlrMBO: A Modular Framework for Model-Based Optimization of Expensive Black-Box Functions

We present mlrMBO, a flexible and comprehensive R toolbox for model-base...
research
10/19/2021

Optimal randomized classification trees

Classification and Regression Trees (CARTs) are off-the-shelf techniques...
research
04/08/2017

Interactive Graphics for Visually Diagnosing Forest Classifiers in R

This paper describes structuring data and constructing plots to explore ...
research
09/10/2019

Surrogate-based Optimization using Mutual Information for Computer Experiments (optim-MICE)

The computational burden of running a complex computer model can make op...

Please sign up or login with your details

Forgot password? Click here to reset