ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R

by   Marvin N. Wright, et al.

We introduce the C++ application and R package ranger. The software is a fast implementation of random forests for high dimensional data. Ensembles of classification, regression and survival trees are supported. We describe the implementation, provide examples, validate the package with a reference implementation, and compare runtime and memory usage with other implementations. The new software proves to scale best with the number of features, samples, trees, and features tried for splitting. Finally, we show that ranger is the fastest and most memory efficient implementation of random forests to analyze data on the scale of a genome-wide association study.



page 7

page 8

page 11

page 15


Fréchet random forests

Random forests are a statistical learning method widely used in many are...

bartMachine: Machine Learning with Bayesian Additive Regression Trees

We present a new package in R implementing Bayesian additive regression ...

Random forests for survival analysis using maximally selected rank statistics

The most popular approach for analyzing survival data is the Cox regress...

Forest Packing: Fast, Parallel Decision Forests

Machine learning has an emerging critical role in high-performance compu...

Training Big Random Forests with Little Resources

Without access to large compute clusters, building random forests on lar...

Fuzzy Forests For Feature Selection in High-Dimensional Survey Data: An Application to the 2020 U.S. Presidential Election

An increasingly common methodological issue in the field of social scien...

Ordinal Trees and Random Forests: Score-Free Recursive Partitioning and Improved Ensembles

Existing ordinal trees and random forests typically use scores that are ...

Code Repositories


R package "missRanger" for fast imputation of missing values by random forests.

view repo


:exclamation: This is a read-only mirror of the CRAN R package repository. missRanger — Fast Imputation of Missing Values

view repo
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.