Optimizing model-agnostic Random Subspace ensembles

09/07/2021
by   Vân Anh Huynh-Thu, et al.
0

This paper presents a model-agnostic ensemble approach for supervised learning. The proposed approach alternates between (1) learning an ensemble of models using a parametric version of the Random Subspace approach, in which feature subsets are sampled according to Bernoulli distributions, and (2) identifying the parameters of the Bernoulli distributions that minimize the generalization error of the ensemble model. Parameter optimization is rendered tractable by using an importance sampling approach able to estimate the expected model output for any given parameter set, without the need to learn new models. While the degree of randomization is controlled by a hyper-parameter in standard Random Subspace, it has the advantage to be automatically tuned in our parametric version. Furthermore, model-agnostic feature importance scores can be easily derived from the trained ensemble model. We show the good performance of the proposed approach, both in terms of prediction and feature ranking, on simulated and real-world datasets. We also show that our approach can be successfully used for the reconstruction of gene regulatory networks.

READ FULL TEXT
research
01/20/2022

Bayesian Nonparametric Mixtures of Exponential Random Graph Models for Ensembles of Networks

Ensembles of networks arise in various fields where multiple independent...
research
01/25/2015

Prediction Error Reduction Function as a Variable Importance Score

This paper introduces and develops a novel variable importance score fun...
research
06/16/2020

RaSE: Random Subspace Ensemble Classification

We propose a new model-free ensemble classification framework, Random Su...
research
02/04/2014

Sequential Model-Based Ensemble Optimization

One of the most tedious tasks in the application of machine learning is ...
research
06/08/2015

Learning Mixtures of Ising Models using Pseudolikelihood

Maximum pseudolikelihood method has been among the most important method...
research
03/26/2019

A layered multiple importance sampling scheme for focused optimal Bayesian experimental design

We develop a new computational approach for "focused" optimal Bayesian e...
research
02/11/2021

Neural BRDF Representation and Importance Sampling

Controlled capture of real-world material appearance yields tabulated se...

Please sign up or login with your details

Forgot password? Click here to reset