Overlapping and Robust Edge-Colored Clustering in Hypergraphs

05/28/2023
by   Alex Crane, et al.
0

A recent trend in data mining has explored (hyper)graph clustering algorithms for data with categorical relationship types. Such algorithms have applications in the analysis of social, co-authorship, and protein interaction networks, to name a few. Many such applications naturally have some overlap between clusters, a nuance which is missing from current combinatorial models. Additionally, existing models lack a mechanism for handling noise in datasets. We address these concerns by generalizing Edge-Colored Clustering, a recent framework for categorical clustering of hypergraphs. Our generalizations allow for a budgeted number of either (a) overlapping cluster assignments or (b) node deletions. For each new model we present a greedy algorithm which approximately minimizes an edge mistake objective, as well as bicriteria approximations where the second approximation factor is on the budget. Additionally, we address the parameterized complexity of each problem, providing FPT algorithms and hardness results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/22/2019

Hypergraph clustering with categorical edge labels

Graphs and networks are a standard model for describing data or systems ...
research
04/24/2020

Non-Exhaustive, Overlapping Co-Clustering: An Extended Analysis

The goal of co-clustering is to simultaneously identify a clustering of ...
research
02/20/2021

nTreeClus: a Tree-based Sequence Encoder for Clustering Categorical Series

The overwhelming presence of categorical/sequential data in diverse doma...
research
12/09/2018

A matching based clustering algorithm for categorical data

Cluster analysis is one of the essential tasks in data mining and knowle...
research
09/19/2019

DAOC: Stable Clustering of Large Networks

Clustering is a crucial component of many data mining systems involving ...
research
10/30/2018

Enhanced Ensemble Clustering via Fast Propagation of Cluster-wise Similarities

Ensemble clustering has been a popular research topic in data mining and...

Please sign up or login with your details

Forgot password? Click here to reset