Tractably Modelling Dependence in Networks Beyond Exchangeability

07/28/2020
by   Weichi Wu, et al.
0

We propose a general framework for modelling network data that is designed to describe aspects of non-exchangeable networks. Conditional on latent (unobserved) variables, the edges of the network are generated by their finite growth history (with latent orders) while the marginal probabilities of the adjacency matrix are modeled by a generalization of a graph limit function (or a graphon). In particular, we study the estimation, clustering and degree behavior of the network in our setting. We determine (i) the minimax estimator of a composite graphon with respect to squared error loss; (ii) that spectral clustering is able to consistently detect the latent membership when the block-wise constant composite graphon is considered under additional conditions; and (iii) we are able to construct models with heavy-tailed empirical degrees under specific scenarios and parameter choices. This explores why and under which general conditions non-exchangeable network data can be described by a stochastic block model. The new modelling framework is able to capture empirically important characteristics of network data such as sparsity combined with heavy tailed degree distribution, and add understanding as to what generative mechanisms will make them arise. Keywords: statistical network analysis, exchangeable arrays, stochastic block model, nonlinear stochastic processes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/07/2013

Consistency of spectral clustering in stochastic block models

We analyze the performance of spectral clustering for community extracti...
research
05/31/2012

Oriented and Degree-generated Block Models: Generating and Inferring Communities with Inhomogeneous Degree Distributions

The stochastic block model is a powerful tool for inferring community st...
research
07/30/2021

Impact of regularization on spectral clustering under the mixed membership stochastic block model

Mixed membership community detection is a challenge problem in network a...
research
07/16/2020

Understanding Implicit Regularization in Over-Parameterized Nonlinear Statistical Model

We study the implicit regularization phenomenon induced by simple optimi...
research
03/22/2022

A new class of composite GBII regression models with varying threshold for modelling heavy-tailed data

The four-parameter generalized beta distribution of the second kind (GBI...
research
02/07/2020

Statistical Inference in Heterogeneous Block Model

There exist various types of network block models such as the Stochastic...
research
09/19/2022

A Dynamic Stochastic Block Model for Multi-Layer Networks

We propose a flexible stochastic block model for multi-layer networks, w...

Please sign up or login with your details

Forgot password? Click here to reset