Network Dependence and Confounding by Network Structure Lead to Invalid Inference

07/30/2019
by   Youjin Lee, et al.
0

Researchers across the health and social sciences generally assume that observations are independent, even while relying on convenience samples that draw subjects from one or a small number of communities, schools, hospitals, etc. A paradigmatic example of this is the Framingham Heart Study (FHS). Many of the limitations of such samples are well-known, but the issue of statistical dependence due to social network ties has not previously been addressed. We show that, along with anticonservative variance estimation, this network dependence can result in confounding by network structure that biases associations away from the null. Using a statistical test that we adapted from one developed for spatial autocorrelation, we test for network dependence and for possible confounding by network structure in several of the thousands of influential papers published using FHS data. Results suggest that some of the many decades of research on coronary heart disease, other health outcomes, and peer influence using FHS data may be biased and anticonservative due to unacknowledged network dependence. We conclude that these issues are not unique to the FHS; as researchers in psychology, medicine, and beyond grapple with replication failures, this unacknowledged source of invalid statistical inference should be part of the conversation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2017

Testing for Network Dependence in the Framingham Heart Study

Empirical research in public health and the social sciences often rely o...
research
06/29/2019

Causal Inference Under Interference And Network Uncertainty

Classical causal and statistical inference methods typically assume the ...
research
06/22/2020

When social influence promotes the wisdom of crowds

Whether, and under what conditions, groups exhibit “crowd wisdom” has be...
research
02/04/2019

Identification and Estimation of Causal Effects from Dependent Data

The assumption that data samples are independent and identically distrib...
research
11/13/2015

Seeing the Unseen Network: Inferring Hidden Social Ties from Respondent-Driven Sampling

Learning about the social structure of hidden and hard-to-reach populati...
research
08/23/2019

Dyadic Regression

Dyadic data, where outcomes reflecting pairwise interaction among sample...
research
07/01/2020

Linear regression and its inference on noisy network-linked data

Linear regression on a set of observations linked by a network has been ...

Please sign up or login with your details

Forgot password? Click here to reset