FISHDBC: Flexible, Incremental, Scalable, Hierarchical Density-Based Clustering for Arbitrary Data and Distance

10/16/2019
by   Matteo Dell'Amico, et al.
0

FISHDBC is a flexible, incremental, scalable, and hierarchical density-based clustering algorithm. It is flexible because it empowers users to work on arbitrary data, skipping the feature extraction step that usually transforms raw data in numeric arrays letting users define an arbitrary distance function instead. It is incremental and scalable: it avoids the O(n^2) performance of other approaches in non-metric spaces and requires only lightweight computation to update the clustering when few items are added. It is hierarchical: it produces a "flat" clustering which can be expanded to a tree structure, so that users can group and/or divide clusters in sub- or super-clusters when data exploration requires so. It is density-based and approximates HDBSCAN*, an evolution of DBSCAN.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/22/2020

Scalable Bottom-Up Hierarchical Clustering

Bottom-up algorithms such as the classic hierarchical agglomerative clus...
research
06/13/2023

PaVa: a novel Path-based Valley-seeking clustering algorithm

Clustering methods are being applied to a wider range of scenarios invol...
research
12/31/2019

Scalable Hierarchical Clustering with Tree Grafting

We introduce Grinch, a new algorithm for large-scale, non-greedy hierarc...
research
02/16/2022

IPD:An Incremental Prototype based DBSCAN for large-scale data with cluster representatives

DBSCAN is a fundamental density-based clustering technique that identifi...
research
04/10/2023

FINEX: A Fast Index for Exact Flexible Density-Based Clustering (Extended Version with Proofs)*

Density-based clustering aims to find groups of similar objects (i.e., c...
research
03/16/2022

Tangles and Hierarchical Clustering

We establish a connection between tangles, a concept from structural gra...
research
09/13/2017

Efficient Computation of Multiple Density-Based Clustering Hierarchies

HDBSCAN*, a state-of-the-art density-based hierarchical clustering metho...

Please sign up or login with your details

Forgot password? Click here to reset