The blessing of transitivity in sparse and stochastic networks

07/08/2013
by   Karl Rohe, et al.
0

The interaction between transitivity and sparsity, two common features in empirical networks, implies that there are local regions of large sparse networks that are dense. We call this the blessing of transitivity and it has consequences for both modeling and inference. Extant research suggests that statistical inference for the Stochastic Blockmodel is more difficult when the edges are sparse. However, this conclusion is confounded by the fact that the asymptotic limit in all of the previous studies is not merely sparse, but also non-transitive. To retain transitivity, the blocks cannot grow faster than the expected degree. Thus, in sparse models, the blocks must remain asymptotically small. Previous algorithmic research demonstrates that small "local" clusters are more amenable to computation, visualization, and interpretation when compared to "global" graph partitions. This paper provides the first statistical results that demonstrate how these small transitive clusters are also more amenable to statistical estimation. Theorem 2 shows that a "local" clustering algorithm can, with high probability, detect a transitive stochastic block of a fixed size (e.g. 30 nodes) embedded in a large graph. The only constraint on the ambient graph is that it is large and sparse--it could be generated at random or by an adversary--suggesting a theoretical explanation for the robust empirical performance of local clustering algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/14/2017

Fast and reliable inference algorithm for hierarchical stochastic block models

Network clustering reveals the organization of a network or correspondin...
research
05/21/2020

Computationally efficient sparse clustering

We study statistical and computational limits of clustering when the mea...
research
09/08/2016

Non-Backtracking Spectrum of Degree-Corrected Stochastic Block Models

Motivated by community detection, we characterise the spectrum of the no...
research
02/02/2023

High-dimensional variable clustering based on sub-asymptotic maxima of a weakly dependent random process

We propose a new class of models for variable clustering called Asymptot...
research
02/06/2014

Statistical-Computational Tradeoffs in Planted Problems and Submatrix Localization with a Growing Number of Clusters and Submatrices

We consider two closely related problems: planted clustering and submatr...
research
07/09/2018

Sampling and Inference for Beta Neutral-to-the-Left Models of Sparse Networks

Empirical evidence suggests that heavy-tailed degree distributions occur...
research
01/30/2018

A Dynamic Process Interpretation of the Sparse ERGM Reference Model

Exponential family random graph models (ERGMs) can be understood in term...

Please sign up or login with your details

Forgot password? Click here to reset