Random Planted Forest: a directly interpretable tree ensemble

12/29/2020
by   Munir Hiabu, et al.
0

We introduce a novel interpretable and tree-based algorithm for prediction in a regression setting in which each tree in a classical random forest is replaced by a family of planted trees that grow simultaneously. The motivation for our algorithm is to estimate the unknown regression function from a functional ANOVA decomposition perspective, where each tree corresponds to a function within that decomposition. Therefore, planted trees are limited in the number of interaction terms. The maximal order of approximation in the ANOVA decomposition can be specified or left unlimited. If a first order approximation is chosen, the result is an additive model. In the other extreme case, if the order of approximation is not limited, the resulting model puts no restrictions on the form of the regression function. In a simulation study we find encouraging prediction and visualisation properties of our random planted forest method. We also develop theory for an idealised version of random planted forests in the case of an underlying additive model. We show that in the additive case, the idealised version achieves up to a logarithmic factor asymptotically optimal one-dimensional convergence rates of order n^-2/5.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2018

Complete Analysis of a Random Forest Model

Random forests have become an important tool for improving accuracy in r...
research
10/19/2022

Subtractive random forests

Motivated by online recommendation systems, we study a family of random ...
research
03/02/2021

Slow-Growing Trees

Random Forest's performance can be matched by a single slow-growing tree...
research
05/09/2019

Best-scored Random Forest Density Estimation

This paper presents a brand new nonparametric density estimation strateg...
research
01/18/2019

A Random Forest Approach for Modeling Bounded Outcomes

Random forests have become an established tool for classification and re...
research
10/23/2020

Smoothing and adaptation of shifted Pólya Tree ensembles

Recently, S. Arlot and R. Genuer have shown that a model of random fores...
research
05/09/2019

Two-stage Best-scored Random Forest for Large-scale Regression

We propose a novel method designed for large-scale regression problems, ...

Please sign up or login with your details

Forgot password? Click here to reset