Forest Floor Visualizations of Random Forests

05/30/2016
by   Soeren H. Welling, et al.
0

We propose a novel methodology, forest floor, to visualize and interpret random forest (RF) models. RF is a popular and useful tool for non-linear multi-variate classification and regression, which yields a good trade-off between robustness (low variance) and adaptiveness (low bias). Direct interpretation of a RF model is difficult, as the explicit ensemble model of hundreds of deep trees is complex. Nonetheless, it is possible to visualize a RF model fit by its mapping from feature space to prediction space. Hereby the user is first presented with the overall geometrical shape of the model structure, and when needed one can zoom in on local details. Dimensional reduction by projection is used to visualize high dimensional shapes. The traditional method to visualize RF model structure, partial dependence plots, achieve this by averaging multiple parallel projections. We suggest to first use feature contributions, a method to decompose trees by splitting features, and then subsequently perform projections. The advantages of forest floor over partial dependence plots is that interactions are not masked by averaging. As a consequence, it is possible to locate interactions, which are not visualized in a given projection. Furthermore, we introduce: a goodness-of-visualization measure, use of colour gradients to identify interactions and an out-of-bag cross validated variant of feature contributions.

READ FULL TEXT

page 3

page 7

page 9

page 14

page 15

page 19

page 20

page 22

research
10/26/2022

Ensemble Projection Pursuit for General Nonparametric Regression

The projection pursuit regression (PPR) has played an important role in ...
research
05/17/2023

Optimal Weighted Random Forests

The random forest (RF) algorithm has become a very popular prediction me...
research
01/28/2015

ggRandomForests: Visually Exploring a Random Forest for Regression

Random Forests [Breiman:2001] (RF) are a fully non-parametric statistica...
research
01/14/2020

Interpretation and Simplification of Deep Forest

This paper proposes a new method for interpreting and simplifying a blac...
research
06/19/2018

Myocardial Segmentation of Contrast Echocardiograms Using Random Forests Guided by Shape Model

Myocardial Contrast Echocardiography (MCE) with micro-bubble contrast ag...
research
02/23/2021

Provable Boolean Interaction Recovery from Tree Ensemble obtained via Random Forests

Random Forests (RF) are at the cutting edge of supervised machine learni...
research
06/26/2017

Iterative Random Forests to detect predictive and stable high-order interactions

Genomics has revolutionized biology, enabling the interrogation of whole...

Please sign up or login with your details

Forgot password? Click here to reset