Learning Resolution Parameters for Graph Clustering

03/12/2019
by   Nate Veldt, et al.
0

Finding clusters of well-connected nodes in a graph is an extensively studied problem in graph-based data analysis. Because of its many applications, a large number of distinct graph clustering objective functions and algorithms have already been proposed and analyzed. To aid practitioners in determining the best clustering approach to use in different applications, we present new techniques for automatically learning how to set clustering resolution parameters. These parameters control the size and structure of communities that are formed by optimizing a generalized objective function. We begin by formalizing the notion of a parameter fitness function, which measures how well a fixed input clustering approximately solves a generalized clustering objective for a specific resolution parameter value. Under reasonable assumptions, which suit two key graph clustering applications, such a parameter fitness function can be efficiently minimized using a bisection-like method, yielding a resolution parameter that fits well with the example clustering. We view our framework as a type of single-shot hyperparameter tuning, as we are able to learn a good resolution parameter with just a single example. Our general approach can be applied to learn resolution parameters for both local and global graph clustering objectives. We demonstrate its utility in several experiments on real-world data where it is helpful to learn resolution parameters from a given example clustering.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/15/2017

Unifying Sparsest Cut, Cluster Deletion, and Modularity Clustering Objectives with Correlation Clustering

We present and analyze a new framework for graph clustering based on a s...
research
10/14/2019

Graph Clustering in All Parameter Regimes

Resolution parameters in graph clustering represent a size and quality t...
research
03/02/2019

Lexicographically Ordered Multi-Objective Clustering

We introduce a rich model for multi-objective clustering with lexicograp...
research
12/29/2022

On Learning the Structure of Clusters in Graphs

Graph clustering is a fundamental problem in unsupervised learning, with...
research
12/12/2021

Graph-based hierarchical record clustering for unsupervised entity resolution

Here we study the problem of matched record clustering in unsupervised e...
research
02/20/2018

Memetic Graph Clustering

It is common knowledge that there is no single best strategy for graph c...

Please sign up or login with your details

Forgot password? Click here to reset