Estimating Heterogeneous Causal Mediation Effects with Bayesian Decision Tree Ensembles
The causal inference literature has increasingly recognized that explicitly targeting treatment effect heterogeneity can lead to improved scientific understanding and policy recommendations. Towards the same ends, studying the causal pathway connecting the treatment to the outcome can be also useful. This paper addresses these problems in the context of causal mediation analysis. We introduce a varying coefficient model based on Bayesian additive regression trees to identify and regularize heterogeneous causal mediation effects; analogously with linear structural equation models, these effects correspond to covariate-dependent products of coefficients. We show that, even on large datasets with few covariates, LSEMs can produce highly unstable estimates of the conditional average direct and indirect effects, while our Bayesian causal mediation forests model produces estimates that are stable. We find that our approach is conservative, with effect estimates “shrunk towards homogeneity.” We examine the salient properties of our method using both data from the Medical Expenditure Panel Survey and empirically-grounded simulated data. Finally, we show how our model can be combined with posterior summarization strategies to identify interesting subgroups and interpret the model fit.
READ FULL TEXT