Minimax optimal rates for Mondrian trees and forests

03/15/2018
by   Jaouad Mourtada, et al.
0

Introduced by Breiman (2001), Random Forests are widely used as classification and regression algorithms. While being initially designed as batch algorithms, several variants have been proposed to handle online learning. One particular instance of such forests is the Mondrian Forest, whose trees are built using the so-called Mondrian process, therefore allowing to easily update their construction in a streaming fashion. In this paper, we study Mondrian Forests in a batch setting and prove their consistency assuming a proper tuning of the lifetime sequence. A thorough theoretical study of Mondrian partitions allows us to derive an upper bound for the risk of Mondrian Forests, which turns out to be the minimax optimal rate for both Lipschitz and twice differentiable regression functions. These results are actually the first to state that some particular random forests achieve minimax rates in arbitrary dimension, paving the way to a refined theoretical analysis and thus a deeper understanding of these black box algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/22/2021

Minimax Rates for STIT and Poisson Hyperplane Random Forests

In [12], Mourtada, Gaïffas and Scornet showed that, under proper tuning ...
research
11/08/2017

Universal consistency and minimax rates for online Mondrian Forests

We establish the consistency of an algorithm of Mondrian Forests, a rand...
research
03/18/2020

Enumeration of Unordered Forests

Reverse search is a convenient method for enumerating structured objects...
research
08/06/2020

Modeling of time series using random forests: theoretical developments

In this paper we study asymptotic properties of random forests within th...
research
10/16/2021

Streaming Decision Trees and Forests

Machine learning has successfully leveraged modern data and provided com...
research
06/16/2016

The Mondrian Kernel

We introduce the Mondrian kernel, a fast random feature approximation to...
research
10/31/2022

HARRIS: Hybrid Ranking and Regression Forests for Algorithm Selection

It is well known that different algorithms perform differently well on a...

Please sign up or login with your details

Forgot password? Click here to reset