Testing Changes in Communities for the Stochastic Block Model

11/29/2018
by   Aditya Gangrade, et al.
0

We introduce the problems of goodness-of-fit and two-sample testing of the latent community structure in a 2-community, symmetric, stochastic block model (SBM), in the regime where recovery of the structure is difficult. The latter problem may be described as follows: let x,y be two latent community partitions. Given graphs G,H drawn according to SBMs with partitions x,y, respectively, we wish to test the hypothesis x = y against d(x,y) > s, for a given Hamming distortion parameter s ≪ n. Prior work showed that `partial' recovery of these partitions up to distortion s with vanishing error probability requires that the signal-to-noise ratio (SNR) is ≳ C (n/s). We prove by constructing simple schemes that if s ≫√(n n), then these testing problems can be solved even if SNR = O(1). For s = o(√(n)), and constant order degrees, we show via an information-theoretic lower bound that both testing problems require SNR = Ω((n)), and thus at this scale the naïve scheme of learning the communities and comparing them is minimax optimal up to constant factors. These results are augmented by simulations of goodness-of-fit and two-sample testing for standard SBMs as well as for Gaussian Markov random fields with underlying SBM structure.

READ FULL TEXT

page 12

page 14

page 32

page 35

page 36

research
04/09/2020

Inference in the Stochastic Block Model with a Markovian assignment of the communities

We tackle the community detection problem in the Stochastic Block Model ...
research
10/28/2017

Lower Bounds for Two-Sample Structural Change Detection in Ising and Gaussian Models

The change detection problem is to determine if the Markov network struc...
research
10/28/2020

Combinatorial-Probabilistic Trade-Off: Community Properties Test in the Stochastic Block Models

In this paper, we propose an inferential framework testing the general c...
research
11/06/2014

A Generic Sample Splitting Approach for Refined Community Recovery in Stochastic Block Models

We propose and analyze a generic method for community recovery in stocha...
research
09/19/2020

Estimating the number of communities by Stepwise Goodness-of-fit

Given a symmetric network with n nodes, how to estimate the number of co...
research
07/19/2018

Partial recovery bounds for clustering with the relaxed Kmeans

We investigate the clustering performances of the relaxed Kmeans in the ...
research
03/22/2016

Inference via Message Passing on Partially Labeled Stochastic Block Models

We study the community detection and recovery problem in partially-label...

Please sign up or login with your details

Forgot password? Click here to reset