Optimizing Generalized PageRank Methods for Seed-Expansion Community Detection

05/26/2019
by   Pan Li, et al.
0

Landing probabilities (LP) of random walks (RW) over graphs encode rich information regarding graph topology. Generalized PageRanks (GPR), which represent weighted sums of LPs of RWs, utilize the discriminative power of LP features to enable many graph-based learning studies. Previous work in the area has mostly focused on evaluating suitable weights for GPRs, and only a few studies so far have attempted to derive the optimal weights of GRPs for a given application. We take a fundamental step forward in this direction by using random graph models to better our understanding of the behavior of GPRs. In this context, we provide a rigorous non-asymptotic analysis for the convergence of LPs and GPRs to their mean-field values on edge-independent random graphs. Although our theoretical results apply to many problem settings, we focus on the task of seed-expansion community detection over stochastic block models. There, we find that the predictive power of LPs decreases significantly slower than previously reported based on asymptotic findings. Given this result, we propose a new GPR, termed Inverse PR (IPR), with LP weights that increase for the initial few steps of the walks. Extensive experiments on both synthetic and real, large-scale networks illustrate the superiority of IPR compared to other GPRs for seeded community detection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/16/2019

Efficient Distributed Community Detection in the Stochastic Block Model

Designing effective algorithms for community detection is an important a...
research
10/20/2019

Landing Probabilities of Random Walks for Seed-Set Expansion in Hypergraphs

We describe the first known mean-field study of landing probabilities fo...
research
09/09/2021

Popularity Adjusted Block Models are Generalized Random Dot Product Graphs

We connect two random graph models, the Popularity Adjusted Block Model ...
research
01/21/2021

Synwalk – Community Detection via Random Walk Modelling

Complex systems, abstractly represented as networks, are ubiquitous in e...
research
06/05/2020

Linear Programming and Community Detection

The problem of community detection with two equal-sized communities is c...
research
05/16/2019

When random initializations help: a study of variational inference for community detection

Variational approximation has been widely used in large-scale Bayesian i...
research
12/20/2021

Measuring Segregation via Analysis on Graphs

In this paper, we use analysis on graphs to study quantitative measures ...

Please sign up or login with your details

Forgot password? Click here to reset