Inference for Individual Mediation Effects and Interventional Effects in Sparse High-Dimensional Causal Graphical Models

09/27/2018
by   Abhishek Chakrabortty, et al.
0

We consider the problem of identifying intermediate variables (or mediators) that regulate the effect of a treatment on a response variable. While there has been significant research on this topic, little work has been done when the set of potential mediators is high-dimensional and when they are interrelated. In particular, we assume that the causal structure of the treatment, the potential mediators and the response is a directed acyclic graph (DAG). High-dimensional DAG models have previously been used for the estimation of causal effects from observational data and methods called IDA and joint-IDA have been developed for estimating the effects of single interventions and multiple simultaneous interventions respectively. In this paper, we propose an IDA-type method called MIDA for estimating mediation effects from high-dimensional observational data. Although IDA and joint-IDA estimators have been shown to be consistent in certain sparse high-dimensional settings, their asymptotic properties such as convergence in distribution and inferential tools in such settings remained unknown. We prove high-dimensional consistency of MIDA for linear structural equation models with sub-Gaussian errors. More importantly, we derive distributional convergence results for MIDA in similar high-dimensional settings, which are applicable to IDA and joint-IDA estimators as well. To the best of our knowledge, these are the first distributional convergence results facilitating inference for IDA-type estimators. These results have been built on our novel theoretical results regarding uniform bounds for linear regression estimators over varying subsets of high-dimensional covariates, which may be of independent interest. Finally, we empirically validate our asymptotic theory and demonstrate the usefulness of MIDA in identifying large mediation effects via simulations and application to real data in genomics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2022

Finite-Sample Guarantees for High-Dimensional DML

Debiased machine learning (DML) offers an attractive way to estimate tre...
research
03/27/2020

Semiparametric Inference For Causal Effects In Graphical Models With Hidden Variables

The last decade witnessed the development of algorithms that completely ...
research
05/29/2019

Deep Generalized Method of Moments for Instrumental Variable Analysis

Instrumental variable analysis is a powerful tool for estimating causal ...
research
10/04/2021

Causality and Generalizability: Identifiability and Learning Methods

This PhD thesis contains several contributions to the field of statistic...
research
12/17/2020

The Causal Learning of Retail Delinquency

This paper focuses on the expected difference in borrower's repayment wh...
research
12/15/2021

A Targeted Approach to Confounder Selection for High-Dimensional Data

We consider the problem of selecting confounders for adjustment from a p...
research
02/22/2021

Sharp Inference on Selected Subgroups in Observational Studies

In modern drug development, the broader availability of high-dimensional...

Please sign up or login with your details

Forgot password? Click here to reset