On identifying unobserved heterogeneity in stochastic blockmodel graphs with vertex covariates

07/04/2020
by   Cong Mu, et al.
0

Both observed and unobserved vertex heterogeneity can influence block structure in graphs. To assess these effects on block recovery, we present a comparative analysis of two model-based spectral algorithms for clustering vertices in stochastic blockmodel graphs with vertex covariates. The first algorithm directly estimates the induced block assignments by investigating the estimated block connectivity probability matrix including the vertex covariate effect. The second algorithm estimates the vertex covariate effect and then estimates the induced block assignments after accounting for this effect. We employ Chernoff information to analytically compare the algorithms' performance and derive the Chernoff ratio formula for some special models of interest. Analytic results and simulations suggest that, in general, the second algorithm is preferred: we can better estimate the induced block assignments by first estimating the vertex covariate effect. In addition, real data experiments on a diffusion MRI connectome data set indicate that the second algorithm has the advantages of revealing underlying block structure and taking observed vertex heterogeneity into account in real applications. Our findings emphasize the importance of distinguishing between observed and unobserved factors that can affect block structure in graphs.

READ FULL TEXT

page 1

page 7

research
01/03/2018

Accounting for unobserved covariates with varying degrees of estimability in high dimensional experimental data

An important phenomenon in high dimensional biological data is the prese...
research
01/03/2018

Accounting for unobserved covariates with varying degrees of estimability in high dimensional biological data

An important phenomenon in high dimensional biological data is the prese...
research
08/18/2019

Spectral inference for large Stochastic Blockmodels with nodal covariates

In many applications of network analysis, it is important to distinguish...
research
07/10/2018

Pairwise Covariates-adjusted Block Model for Community Detection

One of the most fundamental problems in network study is community detec...
research
01/12/2023

A Generalized Estimating Equation Approach to Network Regression

Regression models applied to network data where node attributes are the ...
research
02/14/2018

Vertex nomination: The canonical sampling and the extended spectral nomination schemes

Suppose that one particular block in a stochastic block model is deemed ...
research
12/10/2013

Vertex nomination schemes for membership prediction

Suppose that a graph is realized from a stochastic block model where one...

Please sign up or login with your details

Forgot password? Click here to reset