LUMÁWIG: An Efficient Algorithm for Dimension Zero Bottleneck Distance Computation in Topological Data Analysis

10/01/2020
by   Paul Samuel Ignacio, et al.
0

Stability of persistence diagrams under slight perturbations is a key characteristic behind the validity and growing popularity of topological data analysis in exploring real-world data. Central to this stability is the use of Bottleneck distance which entails matching points between diagrams. Use of this metric in practical studies has, however, been few and sparingly because of the computational obstruction, especially in dimension zero where the computational cost explodes with the growth of data size. We present LUMÁWIG, a novel efficient algorithm to compute dimension zero bottleneck distance between two persistent diagrams which runs significantly faster and provides significantly sharper approximates with respect to the output of the original algorithm than any other available algorithm. We bypass the overwhelming matching problem in previous implementations of the bottleneck distance, and prove that the zero dimensional bottleneck distance can be recovered from a very small number of matching cases. We show that LUMÁWIG generally enjoys linear complexity as shown by empirical tests. We also present an application that leverages dimension zero persistence diagrams and the bottleneck distance to produce features for classification tasks.

READ FULL TEXT
research
06/04/2021

Bottleneck Profiles and Discrete Prokhorov Metrics for Persistence Diagrams

In topological data analysis (TDA), persistence diagrams have been a suc...
research
08/17/2021

Rectangular Approximation and Stability of 2-parameter Persistence Modules

One of the main reasons for topological persistence being useful in data...
research
10/08/2020

Intrinsic Hierarchical Clustering Behavior Recovers Higher Dimensional Shape Information

We show that specific higher dimensional shape information of point clou...
research
09/17/2018

Computing Wasserstein Distance for Persistence Diagrams on a Quantum Computer

Persistence diagrams are a useful tool from topological data analysis wh...
research
06/23/2023

Effective data reduction algorithm for topological data analysis

One of the most interesting tools that have recently entered the data sc...
research
02/03/2022

RipsNet: a general architecture for fast and robust estimation of the persistent homology of point clouds

The use of topological descriptors in modern machine learning applicatio...
research
06/11/2023

A Normalized Bottleneck Distance on Persistence Diagrams and Homology Preservation under Dimension Reduction

Persistence diagrams are used as signatures of point cloud data assumed ...

Please sign up or login with your details

Forgot password? Click here to reset