A shortest-path based clustering algorithm for joint human-machine analysis of complex datasets

12/31/2018
by   Diego Ulisse Pizzagalli, et al.
18

Clustering is a technique for the analysis of datasets obtained by empirical studies in several disciplines with a major application for biomedical research. Essentially, clustering algorithms are executed by machines aiming at finding groups of related points in a dataset. However, the result of grouping depends on both metrics for point-to-point similarity and rules for point-to-group association. Indeed, non-appropriate metrics and rules can lead to undesirable clustering artifacts. This is especially relevant for datasets, where groups with heterogeneous structures co-exist. In this work, we propose an algorithm that achieves clustering by exploring the paths between points. This allows both, to evaluate the properties of the path (such as gaps, density variations, etc.), and expressing the preference for certain paths. Moreover, our algorithm supports the integration of existing knowledge about admissible and non-admissible clusters by training a path classifier. We demonstrate the accuracy of the proposed method on challenging datasets including points from synthetic shapes in publicly available benchmarks and microscopy data.

READ FULL TEXT

page 9

page 10

page 11

page 12

page 13

research
12/04/2018

Multiple Manifold Clustering Using Curvature Constrained Path

The problem of multiple surface clustering is a challenging task, partic...
research
04/25/2020

Clustering by Constructing Hyper-Planes

As a kind of basic machine learning method, clustering algorithms group ...
research
08/21/2020

ConiVAT: Cluster Tendency Assessment and Clustering with Partial Background Knowledge

The VAT method is a visual technique for determining the potential clust...
research
05/02/2019

Efficient Contour Computation of Group-based Skyline

Skyline, aiming at finding a Pareto optimal subset of points in a multi-...
research
05/30/2022

GraphWalks: Efficient Shape Agnostic Geodesic Shortest Path Estimation

Geodesic paths and distances are among the most popular intrinsic proper...
research
03/28/2020

Single-Point Visibility Constraint Minimum Link Paths in Simple Polygons

We address the following problem: Given a simple polygon P with n vertic...
research
01/08/2021

When does the Physarum Solver Distinguish the Shortest Path from other Paths: the Transition Point and its Applications

Physarum solver, also called the physarum polycephalum inspired algorith...

Please sign up or login with your details

Forgot password? Click here to reset