Inferring community characteristics in labelled networks

05/28/2021
by   Ioannis Kontoyiannis, et al.
0

Labelled networks form a very common and important class of data, naturally appearing in numerous applications in science and engineering. A typical inference goal is to determine how the vertex labels(or features) affect the network's graph structure. A standard approach has been to partition the network into blocks grouped by distinct values of the feature of interest. A block-based random graph model – typically a variant of the stochastic block model – is then used to test for evidence of asymmetric behaviour within these feature-based communities. Nevertheless, the resulting communities often do not produce a natural partition of the graph. In this work, we introduce a new generative model, the feature-first block model (FFBM), which is more effective at describing vertex-labelled undirected graphs and also facilitates the use of richer queries on labelled networks. We develop a Bayesian framework for inference with this model, and we present a method to efficiently sample from the posterior distribution of the FFBM parameters. The FFBM's structure is kept deliberately simple to retain easy interpretability of the parameter values. We apply the proposed methods to a variety of network data to extract the most important features along which the vertices are partitioned. The main advantages of the proposed approach are that the whole feature-space is used automatically, and features can be rank-ordered implicitly according to impact. Any features that do not significantly impact the high-level structure can be discarded to reduce the problem dimension. In cases where the vertex features available do not readily explain the community structure in the resulting network, the approach detects this and is protected against over-fitting. Results on several real-world datasets illustrate the performance of the proposed methods.

READ FULL TEXT
research
05/31/2012

Oriented and Degree-generated Block Models: Generating and Inferring Communities with Inhomogeneous Degree Distributions

The stochastic block model is a powerful tool for inferring community st...
research
02/05/2016

Exchangeable Random Measures for Sparse and Modular Graphs with Overlapping Communities

We propose a novel statistical model for sparse networks with overlappin...
research
06/28/2018

Graphs without 2-community structures

In the context of community structure detection, we study the existence ...
research
01/20/2022

Classic Graph Structural Features Outperform Factorization-Based Graph Embedding Methods on Community Labeling

Graph representation learning (also called graph embeddings) is a popula...
research
08/10/2015

Local Algorithms for Block Models with Side Information

There has been a recent interest in understanding the power of local alg...
research
05/18/2021

A sparse stochastic block model with two unequal communities

We show posterior convergence for the community structure in the planted...
research
04/15/2019

Modeling Network Populations via Graph Distances

This article introduces a new class of models for multiple networks. The...

Please sign up or login with your details

Forgot password? Click here to reset