A Tale of Two Datasets: Representativeness and Generalisability of Inference for Samples of Networks

by   Pavel N. Krivitsky, et al.

The last two decades have witnessed considerable progress on foundational aspects of statistical network analysis, but less attention has been paid to the complex statistical issues arising in real-world applications. Here, we consider two samples of within-household contact networks in Belgium generated by different but complementary sampling designs: one smaller but with all contacts in each household observed, the other larger and more representative but recording contacts of only one person per household. We wish to combine their strengths to learn the social forces that shape household contact formation and facilitate simulation for prediction of disease spread, while generalising to the population of households in the region. To accomplish this, we introduce a flexible framework for specifying multi-network models in the exponential family class and identify the requirements for inference and prediction under this framework to be consistent, identifiable, and generalisable, even when data are incomplete; explore how these requirements may be violated in practice; and develop a suite of quantitative and graphical diagnostics for detecting violations and suggesting improvements to a candidate model. We report on the effects of network size, geography, and household roles on household contact patterns (activity, heterogeneity in activity, and triadic closure).



page 24


Toward epidemic thresholds on temporal networks: a review and open questions

Epidemiological contact network models have emerged as an important tool...

A semiparametric Bayesian approach to epidemics, with application to the spread of the coronavirus MERS in South Korea in 2015

We consider incomplete observations of stochastic processes governing th...

On the Accuracy of Deterministic Models for Viral Spread on Networks

We consider the emergent behavior of viral spread when agents in a large...

Temporal Cascade Model for Analyzing Spread in Evolving Networks with Disease Monitoring Applications

Current approaches for modeling propagation in networks (e.g., spread of...

Monitoring behavioural responses during pandemic via reconstructed contact matrices from online and representative surveys

The unprecedented behavioural responses of societies have been evidently...

An Eigenmodel for Dynamic Multilayer Networks

Dynamic multilayer networks frequently represent the structure of multip...

NeuralTailor: Reconstructing Sewing Pattern Structures from 3D Point Clouds of Garments

The fields of SocialVR, performance capture, and virtual try-on are ofte...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.