Cross-Validated Decision Trees with Targeted Maximum Likelihood Estimation for Nonparametric Causal Mixtures Analysis

02/15/2023
by   David McCoy, et al.
0

Exposure to mixtures of chemicals, such as drugs, pollutants, and nutrients, is common in real-world exposure or treatment scenarios. To understand the impact of these exposures on health outcomes, an interpretable and important approach is to estimate the causal effect of exposure regions that are most associated with a health outcome. This requires a statistical estimator that can identify these exposure regions and provide an unbiased estimate of a causal target parameter given the region. In this work, we present a methodology that uses decision trees to data-adaptively determine exposure regions and employs cross-validated targeted maximum likelihood estimation to unbiasedly estimate the average regional-exposure effect (ARE). This results in a plug-in estimator with an asymptotically normal distribution and minimum variance, from which confidence intervals can be derived. The methodology is implemented in the open-source software, CVtreeMLE, a package in R. Analysts put in a vector of exposures, covariates and an outcome and tables are given for regions in the exposures, such as lead > 2.1 arsenic > 1.4, with an associated ARE which represents the mean outcome difference if all individuals were exposed to this region compared to if none were exposed to this region. CVtreeMLE enables researchers to discover interpretable exposure regions in mixed exposure scenarios and provides robust statistical inference for the impact of these regions. The resulting quantities offer interpretable thresholds that can inform public health policies, such as pollutant regulations, or aid in medical decision-making, such as identifying the most effective drug combinations.

READ FULL TEXT

page 13

page 14

page 21

research
05/03/2023

Semi-Parametric Identification and Estimation of Interaction and Effect Modification in Mixed Exposures using Stochastic Interventions

In many fields, including environmental epidemiology, researchers strive...
research
06/15/2020

Targeted Maximum Likelihood Estimation of Community-based Causal Effect of Community-Level Stochastic Interventions

Unlike the commonly used parametric regression models such as mixed mode...
research
11/26/2018

Bayesian kernel machine causal mediation analysis

Exposure to complex mixtures is a real-world scenario. As such, it is im...
research
07/05/2023

Unveiling Causal Mediation Pathways in High-Dimensional Mixed Exposures: A Data-Adaptive Target Parameter Strategy

Mediation analysis in causal inference typically concentrates on one bin...
research
07/13/2021

On doubly robust inference for double machine learning

Due to concerns about parametric model misspecification, there is intere...
research
05/20/2018

Stacked Propensity Score Functions for Observational Cohorts with Oversampled Exposed Subjects

Observational cohort studies with oversampled exposed subjects are typic...

Please sign up or login with your details

Forgot password? Click here to reset