Subtractive random forests

10/19/2022
by   Nicolas Broutin, et al.
0

Motivated by online recommendation systems, we study a family of random forests. The vertices of the forest are labeled by integers. Each non-positive integer i≤ 0 is the root of a tree. Vertices labeled by positive integers n ≥ 1 are attached sequentially such that the parent of vertex n is n-Z_n, where the Z_n are i.i.d. random variables taking values in ℕ. We study several characteristics of the resulting random forest. In particular, we establish bounds for the expected tree sizes, the number of trees in the forest, the number of leaves, the maximum degree, and the height of the forest. We show that for all distributions of the Z_n, the forest contains at most one infinite tree, almost surely. If 𝔼 Z_n < ∞, then there is a unique infinite tree and the total size of the remaining trees is finite, with finite expected value if 𝔼Z_n^2 < ∞. If 𝔼 Z_n = ∞ then almost surely all trees are finite.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/10/2010

A Generalization of the Chow-Liu Algorithm and its Application to Statistical Learning

We extend the Chow-Liu algorithm for general random variables while the ...
research
05/23/2022

The Tree-Forest Ratio

The number of rooted spanning forests divided by the number of spanning ...
research
12/29/2020

Random Planted Forest: a directly interpretable tree ensemble

We introduce a novel interpretable and tree-based algorithm for predicti...
research
12/04/2018

Proving that a Tree Language is not First-Order Definable

We explore from an algebraic viewpoint the properties of the tree langua...
research
10/16/2022

Positive-Unlabeled Learning using Random Forests via Recursive Greedy Risk Minimization

The need to learn from positive and unlabeled data, or PU learning, aris...
research
03/18/2020

Enumeration of Unordered Forests

Reverse search is a convenient method for enumerating structured objects...
research
08/01/2018

Forest Learning Universal Coding

This paper considers structure learning from data with n samples of p va...

Please sign up or login with your details

Forgot password? Click here to reset