Latent Random Steps as Relaxations of Max-Cut, Min-Cut, and More

08/12/2023
by   Sudhanshu Chanpuriya, et al.
0

Algorithms for node clustering typically focus on finding homophilous structure in graphs. That is, they find sets of similar nodes with many edges within, rather than across, the clusters. However, graphs often also exhibit heterophilous structure, as exemplified by (nearly) bipartite and tripartite graphs, where most edges occur across the clusters. Grappling with such structure is typically left to the task of graph simplification. We present a probabilistic model based on non-negative matrix factorization which unifies clustering and simplification, and provides a framework for modeling arbitrary graph structure. Our model is based on factorizing the process of taking a random walk on the graph. It permits an unconstrained parametrization, allowing for optimization via simple gradient descent. By relaxing the hard clustering to a soft clustering, our algorithm relaxes potentially hard clustering problems to a tractable ones. We illustrate our algorithm's capabilities on a synthetic graph, as well as simple unsupervised learning tasks involving bipartite and tripartite clustering of orthographic and phonological data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2020

Biclustering and Boolean Matrix Factorization in Data Streams

We study the clustering of bipartite graphs and Boolean matrix factoriza...
research
03/09/2016

Bipartite Correlation Clustering -- Maximizing Agreements

In Bipartite Correlation Clustering (BCC) we are given a complete bipart...
research
07/22/2014

Resolution-limit-free and local Non-negative Matrix Factorization quality functions for graph clustering

Many graph clustering quality functions suffer from a resolution limit, ...
research
05/12/2023

One-step Bipartite Graph Cut: A Normalized Formulation and Its Application to Scalable Subspace Clustering

The bipartite graph structure has shown its promising ability in facilit...
research
10/26/2017

Simple Distributed Graph Clustering using Modularity and Map Equation

We study large-scale, distributed graph clustering. Given an undirected,...
research
07/01/2020

A simple bipartite graph projection model for clustering in networks

Graph datasets are frequently constructed by a projection of a bipartite...
research
04/27/2020

Hierarchical clustering of bipartite data sets based on the statistical significance of coincidences

When a set 'entities' are related by the 'features' they share they are ...

Please sign up or login with your details

Forgot password? Click here to reset