Very fast, approximate counterfactual explanations for decision forests

We consider finding a counterfactual explanation for a classification or regression forest, such as a random forest. This requires solving an optimization problem to find the closest input instance to a given instance for which the forest outputs a desired value. Finding an exact solution has a cost that is exponential on the number of leaves in the forest. We propose a simple but very effective approach: we constrain the optimization to only those input space regions defined by the forest that are populated by actual data points. The problem reduces to a form of nearest-neighbor search using a certain distance on a certain dataset. This has two advantages: first, the solution can be found very quickly, scaling to large forests and high-dimensional data, and enabling interactive use. Second, the solution found is more likely to be realistic in that it is guided towards high-density areas of input space.

READ FULL TEXT

page 6

page 12

page 20

research
04/06/2016

Comments on: "A Random Forest Guided Tour" by G. Biau and E. Scornet

This paper is a comment on the survey paper by Biau and Scornet (2016) a...
research
04/29/2020

Asymptotic Properties of High-Dimensional Random Forests

As a flexible nonparametric learning tool, random forest has been widely...
research
03/01/2021

Counterfactual Explanations for Oblique Decision Trees: Exact, Efficient Algorithms

We consider counterfactual explanations, the problem of minimally adjust...
research
01/04/2013

The Sum-over-Forests density index: identifying dense regions in a graph

This work introduces a novel nonparametric density index defined on grap...
research
01/06/2018

Spatial Factor Models for High-Dimensional and Large Spatial Data: An Application in Forest Variable Mapping

Gathering information about forest variables is an expensive and arduous...
research
12/14/2022

Simplification of Forest Classifiers and Regressors

We study the problem of sharing as many branching conditions of a given ...
research
09/16/2023

Inverse classification with logistic and softmax classifiers: efficient optimization

In recent years, a certain type of problems have become of interest wher...

Please sign up or login with your details

Forgot password? Click here to reset