Conceptualizing Treatment Leakage in Text-based Causal Inference

05/01/2022
by   Adel Daoud, et al.
0

Causal inference methods that control for text-based confounders are becoming increasingly important in the social sciences and other disciplines where text is readily available. However, these methods rely on a critical assumption that there is no treatment leakage: that is, the text only contains information about the confounder and no information about treatment assignment. When this assumption does not hold, methods that control for text to adjust for confounders face the problem of post-treatment (collider) bias. However, the assumption that there is no treatment leakage may be unrealistic in real-world situations involving text, as human language is rich and flexible. Language appearing in a public policy document or health records may refer to the future and the past simultaneously, and thereby reveal information about the treatment assignment. In this article, we define the treatment-leakage problem, and discuss the identification as well as the estimation challenges it raises. Second, we delineate the conditions under which leakage can be addressed by removing the treatment-related signal from the text in a pre-processing step we define as text distillation. Lastly, using simulation, we show how treatment leakage introduces a bias in estimates of the average treatment effect (ATE) and how text distillation can mitigate this bias.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2022

Causal inference from treatment-control studies having an additional factor with unknown assignment mechanism

Consider a situation with two treatments, the first of which is randomiz...
research
07/30/2019

Incremental causal effects

This is a draft. The ignorability assumption is a key assumption in caus...
research
01/03/2023

Continual Treatment Effect Estimation: Challenges and Opportunities

A further understanding of cause and effect within observational data is...
research
07/29/2021

Randomization does not imply unconfoundedness

A common assumption in causal inference is that random treatment assignm...
research
03/16/2017

Causal Inference through the Method of Direct Estimation

The intersection of causal inference and machine learning is a rapidly a...
research
09/10/2020

A note on post-treatment selection in studying racial discrimination in policing

We discuss some causal estimands used to study racial discrimination in ...
research
02/07/2022

Personalized Public Policy Analysis in Social Sciences using Causal-Graphical Normalizing Flows

Structural Equation/Causal Models (SEMs/SCMs) are widely used in epidemi...

Please sign up or login with your details

Forgot password? Click here to reset