Clustering in Partially Labeled Stochastic Block Models via Total Variation Minimization

11/03/2019
by   Alexander Jung, et al.
6

A main task in data analysis is to organize data points into coherent groups or clusters. The stochastic block model is a probabilistic model for the cluster structure. This model prescribes different probabilities for the presence of edges within a cluster and between different clusters. We assume that the cluster assignments are known for at least one data point in each cluster. In such a partially labeled stochastic block model, clustering amounts to estimating the cluster assignments of the remaining data points. We study total variation minimization as a method for this clustering task. We implement the resulting clustering algorithm as a highly scalable message passing protocol. We also provide a condition on the model parameters such that total variation minimization allows for accurate clustering.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/28/2019

Semi-supervised Learning in Network-Structured Data via Total Variation Minimization

We propose and analyze a method for semi-supervised learning from partia...
research
03/26/2019

Classifying Partially Labeled Networked Data via Logistic Network Lasso

We apply the network Lasso to classify partially labeled data points whi...
research
01/23/2021

Unsupervised clustering of series using dynamic programming

We are interested in clustering parts of a given single multi-variate se...
research
11/04/2022

Model-based graph clustering of a collection of networks using an agglomerative algorithm

Graph clustering is the task of partitioning a collection of observed ne...
research
11/11/2022

Clustering with Total Variation Graph Neural Networks

Graph Neural Networks (GNNs) are deep learning models designed to proces...
research
05/19/2020

k-sums: another side of k-means

In this paper, the decades-old clustering method k-means is revisited. T...
research
06/09/2015

Clustering by transitive propagation

We present a global optimization algorithm for clustering data given the...

Please sign up or login with your details

Forgot password? Click here to reset