Percolated stochastic block model via EM algorithm and belief propagation with non-backtracking spectra
Whereas Laplacian and modularity based spectral clustering is apt to dense graphs, recent results show that for sparse ones, the non-backtracking spectrum is the best candidate to find assortative clusters of nodes. Here belief propagation in the sparse stochastic block model is derived with arbitrary given model parameters that results in a non-linear system of equations; with linear approximation, the spectrum of the non-backtracking matrix is able to specify the number k of clusters. Then the model parameters themselves can be estimated by the EM algorithm. Bond percolation in the assortative model is considered in the following two senses: the within- and between-cluster edge probabilities decrease with the number of nodes and edges coming into existence in this way are retained with probability β. As a consequence, the optimal k is the number of the structural real eigenvalues (greater than √(c), where c is the average degree) of the non-backtracking matrix of the graph. Assuming, these eigenvalues μ_1 >… > μ_k are distinct, the multiple phase transitions obtained for β are β_i =c/μ_i^2; further, at β_i the number of detectable clusters is i, for i=1,… ,k. Inflation-deflation techniques are also discussed to classify the nodes themselves, which can be the base of the sparse spectral clustering.
READ FULL TEXT