DeepAI AI Chat
Log In Sign Up

k-Means Clustering for Persistent Homology

by   Prudence Leung, et al.
Imperial College London

Persistent homology is a fundamental methodology from topological data analysis that summarizes the lifetimes of topological features within a dataset as a persistence diagram; it has recently gained much popularity from its myriad successful applications to many domains. However, a significant challenge to its widespread implementation, especially in statistical methodology and machine learning algorithms, is the format of the persistence diagram as a multiset of half-open intervals. In this paper, we comprehensively study k-means clustering where the input is various embeddings of persistence diagrams, as well as persistence diagrams themselves and their generalizations as persistence measures. We show that the clustering performance directly on persistence diagrams and measures far outperform their vectorized representations, despite their more complex representations. Moreover, we prove convergence of the algorithm on persistence diagram space and establish theoretical properties of the solution to the optimization problem in the Karush–Kuhn–Tucker framework.


A flat persistence diagram for improved visualization of topological features in persistent homology

Visualization in the emerging field of topological data analysis has pro...

Approximating Persistent Homology for Large Datasets

Persistent homology is an important methodology from topological data an...

Understanding the Topology and the Geometry of the Persistence Diagram Space via Optimal Partial Transport

We consider a generalization of persistence diagrams, namely Radon measu...

A Domain-Oblivious Approach for Learning Concise Representations of Filtered Topological Spaces for Clustering

Persistence diagrams have been widely used to quantify the underlying fe...

Learning metrics for persistence-based summaries and applications for graph classification

Recently a new feature representation and data analysis methodology base...

Fuzzy c-Means Clustering for Persistence Diagrams

Persistence diagrams, a key tool in the field of Topological Data Analys...

A Geometric Condition for Uniqueness of Fréchet Means of Persistence Diagrams

The Fréchet mean is an important statistical summary and measure of cent...