Breadth-first, Depth-next Training of Random Forests

10/15/2019
by   Andreea Anghel, et al.
0

In this paper we analyze, evaluate, and improve the performance of training Random Forest (RF) models on modern CPU architectures. An exact, state-of-the-art binary decision tree building algorithm is used as the basis of this study. Firstly, we investigate the trade-offs between using different tree building algorithms, namely breadth-first-search (BFS) and depth-search-first (DFS). We design a novel, dynamic, hybrid BFS-DFS algorithm and demonstrate that it performs better than both BFS and DFS, and is more robust in the presence of workloads with different characteristics. Secondly, we identify CPU performance bottlenecks when generating trees using this approach, and propose optimizations to alleviate them. The proposed hybrid tree building algorithm for RF is implemented in the Snap Machine Learning framework, and speeds up the training of RFs by 7.8x on average when compared to state-of-the-art RF solvers (sklearn, H2O, and xgboost) on a range of datasets, RF configurations, and multi-core CPU architectures.

READ FULL TEXT
research
04/19/2018

A Dynamic Boosted Ensemble Learning Based on Random Forest

We propose Dynamic Boosted Random Forest (DBRF), a novel ensemble algori...
research
04/19/2018

A Dynamic Boosted Ensemble Learning Method Based on Random Forest

We propose a dynamic boosted ensemble learning method based on random fo...
research
11/08/2021

There is no Double-Descent in Random Forests

Random Forests (RFs) are among the state-of-the-art in machine learning ...
research
06/29/2020

Random Partitioning Forest for Point-Wise and Collective Anomaly Detection – Application to Intrusion Detection

In this paper, we propose DiFF-RF, an ensemble approach composed of rand...
research
11/05/2018

Parallel training of linear models without compromising convergence

In this paper we analyze, evaluate, and improve the performance of train...
research
10/26/2017

Statistical Inference on Tree Swallow Migrations, Using Random Forests

Species migratory patterns have typically been studied through individua...
research
03/02/2021

Slow-Growing Trees

Random Forest's performance can be matched by a single slow-growing tree...

Please sign up or login with your details

Forgot password? Click here to reset