Seeing the Unseen Network: Inferring Hidden Social Ties from Respondent-Driven Sampling

11/13/2015
by   Lin Chen, et al.
0

Learning about the social structure of hidden and hard-to-reach populations --- such as drug users and sex workers --- is a major goal of epidemiological and public health research on risk behaviors and disease prevention. Respondent-driven sampling (RDS) is a peer-referral process widely used by many health organizations, where research subjects recruit other subjects from their social network. In such surveys, researchers observe who recruited whom, along with the time of recruitment and the total number of acquaintances (network degree) of respondents. However, due to privacy concerns, the identities of acquaintances are not disclosed. In this work, we show how to reconstruct the underlying network structure through which the subjects are recruited. We formulate the dynamics of RDS as a continuous-time diffusion process over the underlying graph and derive the likelihood for the recruitment time series under an arbitrary recruitment time distribution. We develop an efficient stochastic optimization algorithm called RENDER (REspoNdent-Driven nEtwork Reconstruction) that finds the network that best explains the collected data. We support our analytical results through an exhaustive set of experiments on both synthetic and real data.

READ FULL TEXT
research
08/08/2020

Clustering Network Tree Data From Respondent-driven sampling with application to opioid users in New York City

There is great interest in finding meaningful subgroups of attributed ne...
research
05/12/2014

Estimating Diffusion Network Structures: Recovery Conditions, Sample Complexity & Soft-thresholding Algorithm

Information spreads across social and technological networks, but often ...
research
01/18/2020

Inference for Network Structure and Dynamics from Time Series Data via Graph Neural Network

Network structures in various backgrounds play important roles in social...
research
04/12/2018

Fast approaches for Bayesian estimation of size of hard-to-reach populations using Network Scale-up

The Network scale-up method is commonly used to overcome difficulties in...
research
07/30/2019

Network Dependence and Confounding by Network Structure Lead to Invalid Inference

Researchers across the health and social sciences generally assume that ...
research
10/09/2017

Testing for Network Dependence in the Framingham Heart Study

Empirical research in public health and the social sciences often rely o...
research
12/01/2020

General Regression Methods for Respondent-Driven Sampling Data

Respondent-Driven Sampling (RDS) is a variant of link-tracing sampling t...

Please sign up or login with your details

Forgot password? Click here to reset