Educational Note: Paradoxical Collider Effect in the Analysis of Non-Communicable Disease Epidemiological Data: a reproducible illustration and web application

Classical epidemiology has focused on the control of confounding but it is only recently that epidemiologists have started to focus on the bias produced by colliders. A collider for a certain pair of variables (e.g., an outcome Y and an exposure A) is a third variable (C) that is caused by both. In DAGs terminology, a collider is the variable in the middle of an inverted fork (i.e., the variable C in A -> C <- Y). Controlling for, or conditioning an analysis on a collider (i.e., through stratification or regression) can introduce a spurious association between its causes. This potentially explains many paradoxical findings in the medical literature, where established risk factors for a particular outcome appear protective. We used an example from non-communicable disease epidemiology to contextualize and explain the effect of conditioning on a collider. We generated a dataset with 1,000 observations and ran Monte-Carlo simulations to estimate the effect of 24-hour dietary sodium intake on systolic blood pressure, controlling for age, which acts as a confounder, and 24-hour urinary protein excretion, which acts as a collider. We illustrate how adding a collider to a regression model introduces bias. Thus, to prevent paradoxical associations, epidemiologists estimating causal effects should be wary of conditioning on colliders. We provide R-code in easy-to-read boxes throughout the manuscript and a GitHub repository (https://github.com/migariane/ColliderApp) for the reader to reproduce our example. We also provide an educational web application allowing real-time interaction to visualize the paradoxical effect of conditioning on a collider http://watzilei.com/shiny/collider/.

READ FULL TEXT

page 2

page 4

page 5

page 7

page 8

page 11

page 15

page 16

research
04/21/2018

On Associative Confounder Bias

Conditioning on some set of confounders that causally affect both treatm...
research
08/01/2023

Relationship between Collider Bias and Interactions on the Log-Additive Scale

Collider bias occurs when conditioning on a common effect (collider) of ...
research
03/19/2019

Semiparametric Methods for Exposure Misclassification in Propensity Score-Based Time-to-Event Data Analysis

In epidemiology, identifying the effect of exposure variables in relatio...
research
09/19/2022

Inference of nonlinear causal effects with GWAS summary data

Large-scale genome-wide association studies (GWAS) have offered an excit...
research
11/11/2020

A Framework for Mediation Analysis with Multiple Exposures, Multivariate Mediators, and Non-Linear Response Models

Mediation analysis seeks to identify and quantify the paths by which an ...
research
11/10/2014

Bounding the Probability of Causation in Mediation Analysis

Given empirical evidence for the dependence of an outcome variable on an...

Please sign up or login with your details

Forgot password? Click here to reset