An Automated Approach to Causal Inference in Discrete Settings

by   Guilherme Duarte, et al.

When causal quantities cannot be point identified, researchers often pursue partial identification to quantify the range of possible values. However, the peculiarities of applied research conditions can make this analytically intractable. We present a general and automated approach to causal inference in discrete settings. We show causal questions with discrete data reduce to polynomial programming problems, and we present an algorithm to automatically bound causal effects using efficient dual relaxation and spatial branch-and-bound techniques. The user declares an estimand, states assumptions, and provides data (however incomplete or mismeasured). The algorithm then searches over admissible data-generating processes and outputs the most precise possible range consistent with available information – i.e., sharp bounds – including a point-identified solution if one exists. Because this search can be computationally intensive, our procedure reports and continually refines non-sharp ranges that are guaranteed to contain the truth at all times, even when the algorithm is not run to completion. Moreover, it offers an additional guarantee we refer to as ϵ-sharpness, characterizing the worst-case looseness of the incomplete bounds. Analytically validated simulations show the algorithm accommodates classic obstacles, including confounding, selection, measurement error, noncompliance, and nonresponse.


page 23

page 25


Causal inference for data centric engineering

The paper reviews methods that seek to draw causal inference from observ...

Automated versus do-it-yourself methods for causal inference: Lessons learned from a data analysis competition

Statisticians have made great strides towards assumption-free estimation...

Partial Identifiability in Discrete Data With Measurement Error

When data contains measurement errors, it is necessary to make assumptio...

Accessible Computation of Tight Symbolic Bounds on Causal Effects using an Intuitive Graphical Interface

Strong untestable assumptions are almost universal in causal point estim...

Identification and Estimation of Causal Effects from Dependent Data

The assumption that data samples are independent and identically distrib...

Optimized Partial Identification Bounds for Regression Discontinuity Designs with Manipulation

The regression discontinuity (RD) design is one of the most popular quas...

Please sign up or login with your details

Forgot password? Click here to reset