Variable importance for causal forests: breaking down the heterogeneity of treatment effects

08/07/2023
by   Clément Bénard, et al.
0

Causal random forests provide efficient estimates of heterogeneous treatment effects. However, forest algorithms are also well-known for their black-box nature, and therefore, do not characterize how input variables are involved in treatment effect heterogeneity, which is a strong practical limitation. In this article, we develop a new importance variable algorithm for causal forests, to quantify the impact of each input on the heterogeneity of treatment effects. The proposed approach is inspired from the drop and relearn principle, widely used for regression problems. Importantly, we show how to handle the forest retrain without a confounding variable. If the confounder is not involved in the treatment effect heterogeneity, the local centering step enforces consistency of the importance measure. Otherwise, when a confounder also impacts heterogeneity, we introduce a corrective term in the retrained causal forest to recover consistency. Additionally, experiments on simulated, semi-synthetic, and real data show the good performance of our importance measure, which outperforms competitors on several test cases. Experiments also show that our approach can be efficiently extended to groups of variables, providing key insights in practice.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/14/2015

Estimation and Inference of Heterogeneous Treatment Effects using Random Forests

Many scientific and engineering challenges -- ranging from personalized ...
research
12/12/2022

Hybrid Censored Quantile Regression Forest to Assess the Heterogeneous Effects

In many applications, heterogeneous treatment effects on a censored resp...
research
05/25/2021

SHAFF: Fast and consistent SHApley eFfect estimates via random Forests

Interpretability of learning algorithms is crucial for applications invo...
research
02/26/2021

MDA for random forests: inconsistency, and a practical solution via the Sobol-MDA

Variable importance measures are the main tools to analyze the black-box...
research
09/20/2023

RHALE: Robust and Heterogeneity-aware Accumulated Local Effects

Accumulated Local Effects (ALE) is a widely-used explainability method f...
research
09/04/2023

Hierarchical Regression Discontinuity Design: Pursuing Subgroup Treatment Effects

Regression discontinuity design (RDD) is widely adopted for causal infer...
research
08/09/2019

Detecting Heterogeneous Treatment Effect with Instrumental Variables

There is an increasing interest in estimating heterogeneity in causal ef...

Please sign up or login with your details

Forgot password? Click here to reset