Resolution-limit-free and local Non-negative Matrix Factorization quality functions for graph clustering

07/22/2014
by   Twan van Laarhoven, et al.
0

Many graph clustering quality functions suffer from a resolution limit, the inability to find small clusters in large graphs. So called resolution-limit-free quality functions do not have this limit. This property was previously introduced for hard clustering, that is, graph partitioning. We investigate the resolution-limit-free property in the context of Non-negative Matrix Factorization (NMF) for hard and soft graph clustering. To use NMF in the hard clustering setting, a common approach is to assign each node to its highest membership cluster. We show that in this case symmetric NMF is not resolution-limit-free, but that it becomes so when hardness constraints are used as part of the optimization. The resulting function is strongly linked to the Constant Potts Model. In soft clustering, nodes can belong to more than one cluster, with varying degrees of membership. In this setting resolution-limit-free turns out to be too strong a property. Therefore we introduce locality, which roughly states that changing one part of the graph does not affect the clustering of other parts of the graph. We argue that this is a desirable property, provide conditions under which NMF quality functions are local, and propose a novel class of local probabilistic NMF quality functions for soft graph clustering.

READ FULL TEXT
research
06/09/2022

Applying separative non-negative matrix factorization to extra-financial data

We present here an original application of the non-negative matrix facto...
research
09/15/2016

Context Aware Nonnegative Matrix Factorization Clustering

In this article we propose a method to refine the clustering results obt...
research
07/27/2019

Alternative Blockmodelling

Many approaches have been proposed to discover clusters within networks....
research
08/12/2023

Latent Random Steps as Relaxations of Max-Cut, Min-Cut, and More

Algorithms for node clustering typically focus on finding homophilous st...
research
08/15/2013

Axioms for graph clustering quality functions

We investigate properties that intuitively ought to be satisfied by grap...
research
05/26/2022

SymNMF-Net for The Symmetric NMF Problem

Recently, many works have demonstrated that Symmetric Non-negative Matri...
research
09/07/2020

Improving Problem Identification via Automated Log Clustering using Dimensionality Reduction

Goal: We consider the problem of automatically grouping logs of runs tha...

Please sign up or login with your details

Forgot password? Click here to reset