k-Means Clustering for Persistent Homology

10/18/2022
by   Prudence Leung, et al.
0

Persistent homology is a fundamental methodology from topological data analysis that summarizes the lifetimes of topological features within a dataset as a persistence diagram; it has recently gained much popularity from its myriad successful applications to many domains. However, a significant challenge to its widespread implementation, especially in statistical methodology and machine learning algorithms, is the format of the persistence diagram as a multiset of half-open intervals. In this paper, we comprehensively study k-means clustering where the input is various embeddings of persistence diagrams, as well as persistence diagrams themselves and their generalizations as persistence measures. We show that the clustering performance directly on persistence diagrams and measures far outperform their vectorized representations, despite their more complex representations. Moreover, we prove convergence of the algorithm on persistence diagram space and establish theoretical properties of the solution to the optimization problem in the Karush–Kuhn–Tucker framework.

READ FULL TEXT
research
12/11/2018

A flat persistence diagram for improved visualization of topological features in persistent homology

Visualization in the emerging field of topological data analysis has pro...
research
04/19/2022

Approximating Persistent Homology for Large Datasets

Persistent homology is an important methodology from topological data an...
research
01/10/2019

Understanding the Topology and the Geometry of the Persistence Diagram Space via Optimal Partial Transport

We consider a generalization of persistence diagrams, namely Radon measu...
research
05/25/2021

A Domain-Oblivious Approach for Learning Concise Representations of Filtered Topological Spaces for Clustering

Persistence diagrams have been widely used to quantify the underlying fe...
research
04/27/2019

Learning metrics for persistence-based summaries and applications for graph classification

Recently a new feature representation and data analysis methodology base...
research
06/04/2020

Fuzzy c-Means Clustering for Persistence Diagrams

Persistence diagrams, a key tool in the field of Topological Data Analys...
research
07/08/2022

A Geometric Condition for Uniqueness of Fréchet Means of Persistence Diagrams

The Fréchet mean is an important statistical summary and measure of cent...

Please sign up or login with your details

Forgot password? Click here to reset