The Automatic Quasi-clique Merger algorithm (AQCM)

03/06/2021
by   Scott Payne, et al.
0

The Automatic Quasi-clique Merger algorithm is a new algorithm adapted from early work published under the name QCM (quasi-clique merger) [Ou2006, Ou2007, Zhao2011, Qi2014]. The AQCM algorithm performs hierarchical clustering in any data set for which there is an associated similarity measure quantifying the similarity of any data i and data j. Importantly, the method exhibits two valuable performance properties: 1) the ability to automatically return either a larger or smaller number of clusters depending on the inherent properties of the data rather than on a parameter 2) the ability to return a very large number of relatively small clusters automatically when such clusters are reasonably well defined in a data set. In this work we present the general idea of a quasi-clique agglomerative approach, provide the full details of the mathematical steps of the AQCM algorithm, and explain some of the motivation behind the new methodology. The main achievement of the new methodology is that the agglomerative process now unfolds adaptively according to the inherent structure unique to a given data set, and this happens without the time-costly parameter adjustment that drove the previous QCM algorithm. For this reason we call the new algorithm automatic. We provide a demonstration of the algorithm's performance at the task of community detection in a social media network of 22,900 nodes.

READ FULL TEXT

page 9

page 24

page 27

research
02/08/2012

Automatic Clustering with Single Optimal Solution

Determining optimal number of clusters in a dataset is a challenging tas...
research
01/19/2020

The Power of Pivoting for Exact Clique Counting

Clique counting is a fundamental task in network analysis, and even the ...
research
08/28/2018

Enumerating Top-k Quasi-Cliques

Quasi-cliques are dense incomplete subgraphs of a graph that generalize ...
research
09/20/2021

Parallel Algorithms for Finding Large Cliques in Sparse Graphs

We present a parallel k-clique listing algorithm with improved work boun...
research
10/09/2009

Scaling Analysis of Affinity Propagation

We analyze and exploit some scaling properties of the Affinity Propagati...
research
09/27/2017

Diversified Coherent Core Search on Multi-Layer Graphs

Mining dense subgraphs on multi-layer graphs is an interesting problem, ...

Please sign up or login with your details

Forgot password? Click here to reset