Best-scored Random Forest Density Estimation

05/09/2019
by   Hanyuan Hang, et al.
0

This paper presents a brand new nonparametric density estimation strategy named the best-scored random forest density estimation whose effectiveness is supported by both solid theoretical analysis and significant experimental performance. The terminology best-scored stands for selecting one density tree with the best estimation performance out of a certain number of purely random density tree candidates and we then name the selected one the best-scored random density tree. In this manner, the ensemble of these selected trees that is the best-scored random density forest can achieve even better estimation results than simply integrating trees without selection. From the theoretical perspective, by decomposing the error term into two, we are able to carry out the following analysis: First of all, we establish the consistency of the best-scored random density trees under L_1-norm. Secondly, we provide the convergence rates of them under L_1-norm concerning with three different tail assumptions, respectively. Thirdly, the convergence rates under L_∞-norm is presented. Last but not least, we also achieve the above convergence rates analysis for the best-scored random density forest. When conducting comparative experiments with other state-of-the-art density estimation approaches on both synthetic and real data sets, it turns out that our algorithm has not only significant advantages in terms of estimation accuracy over other methods, but also stronger resistance to the curse of dimensionality.

READ FULL TEXT
research
05/27/2019

Best-scored Random Forest Classification

We propose an algorithm named best-scored random forest for binary class...
research
06/24/2019

Density-based Clustering with Best-scored Random Forest

Single-level density-based approach has long been widely acknowledged to...
research
11/24/2019

Histogram Transform Ensembles for Density Estimation

We investigate an algorithm named histogram transform ensembles (HTE) de...
research
05/09/2019

Two-stage Best-scored Random Forest for Large-scale Regression

We propose a novel method designed for large-scale regression problems, ...
research
12/29/2020

Random Planted Forest: a directly interpretable tree ensemble

We introduce a novel interpretable and tree-based algorithm for predicti...
research
03/22/2018

Boosted Density Estimation Remastered

There has recently been a steadily increase in the iterative approaches ...
research
10/23/2020

Smoothing and adaptation of shifted Pólya Tree ensembles

Recently, S. Arlot and R. Genuer have shown that a model of random fores...

Please sign up or login with your details

Forgot password? Click here to reset