A Bayesian Framework for Community Detection Integrating Content and Link

05/09/2012
by   Tianbao Yang, et al.
0

This paper addresses the problem of community detection in networked data that combines link and content analysis. Most existing work combines link and content information by a generative model. There are two major shortcomings with the existing approaches. First, they assume that the probability of creating a link between two nodes is determined only by the community memberships of the nodes; however other factors (e.g. popularity) could also affect the link pattern. Second, they use generative models to model the content of individual nodes, whereas these generative models are vulnerable to the content attributes that are irrelevant to communities. We propose a Bayesian framework for combining link and content information for community detection that explicitly addresses these shortcomings. A new link model is presented that introduces a random variable to capture the node popularity when deciding the link between two nodes; a discriminative model is used to determine the community membership of a node by its content. An approximate inference algorithm is presented for efficient Bayesian inference. Our empirical study shows that the proposed framework outperforms several state-of-theart approaches in combining link and content information for community detection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/18/2019

vGraph: A Generative Model for Joint Community Detection and Node Representation Learning

This paper focuses on two fundamental tasks of graph analysis: community...
research
01/13/2021

Overlapping Community Detection in Temporal Text Networks

Analyzing the groups in the network based on same attributes, functions ...
research
07/16/2020

GRADE: Graph Dynamic Embedding

Representation learning of static and more recently dynamically evolving...
research
03/04/2022

Bayesian community detection for networks with covariates

The increasing prevalence of network data in a vast variety of fields an...
research
07/19/2016

Combining Random Walks and Nonparametric Bayesian Topic Model for Community Detection

Community detection has been an active research area for decades. Among ...
research
04/24/2018

Block-Structure Based Time-Series Models For Graph Sequences

Although the computational and statistical trade-off for modeling single...
research
03/06/2020

Novel Edge and Density Metrics for Link Cohesion

We present a new metric of link cohesion for measuring the strength of e...

Please sign up or login with your details

Forgot password? Click here to reset