A discriminative approach for finding and characterizing positivity violations using decision trees

07/18/2019
by   Ehud Karavani, et al.
0

The assumption of positivity in causal inference (also known as common support and co-variate overlap) is necessary to obtain valid causal estimates. Therefore, confirming it holds in a given dataset is an important first step of any causal analysis. Most common methods to date are insufficient for discovering non-positivity, as they do not scale for modern high-dimensional covariate spaces, or they cannot pinpoint the subpopulation violating positivity. To overcome these issues, we suggest to harness decision trees for detecting violations. By dividing the covariate space into mutually exclusive regions, each with maximized homogeneity of treatment groups, decision trees can be used to automatically detect subspaces violating positivity. By augmenting the method with an additional random forest model, we can quantify the robustness of the violation within each subspace. This solution is scalable and provides an interpretable characterization of the subspaces in which violations occur. We provide a visualization of the stratification rules that define each subpopulation, combined with the severity of positivity violation within it. We also provide an interactive version of the visualization that allows a deeper dive into the properties of each subspace.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/26/2021

On Soft Bayesian Additive Regression Trees and asynchronous longitudinal regression analysis

In many longitudinal studies, the covariate and response are often inter...
research
11/07/2021

Positivity Validation Detection and Explainability via Zero Fraction Multi-Hypothesis Testing and Asymmetrically Pruned Decision Trees

Positivity is one of the three conditions for causal inference from obse...
research
03/06/2021

visTree: Visualization of Subgroups for a Decision Tree

Decision trees are flexible prediction models which are constructed to q...
research
01/09/2018

A note on strict functional covariate overlap in causal inference problems with high-dimensional covariates

A powerful tool for the analysis of nonrandomized observational studies ...
research
11/30/2021

Causal Analysis and Classification of Traffic Crash Injury Severity Using Machine Learning Algorithms

Causal analysis and classification of injury severity applying non-param...
research
02/06/2019

Finding Good Itemsets by Packing Data

The problem of selecting small groups of itemsets that represent the dat...
research
06/07/2023

Invariant Causal Set Covering Machines

Rule-based models, such as decision trees, appeal to practitioners due t...

Please sign up or login with your details

Forgot password? Click here to reset