A Domain-Oblivious Approach for Learning Concise Representations of Filtered Topological Spaces for Clustering

05/25/2021
by   Yu Qin, et al.
2

Persistence diagrams have been widely used to quantify the underlying features of filtered topological spaces in data visualization. In many applications, computing distances between diagrams is essential; however, computing these distances has been challenging due to the computational cost. In this paper, we propose a persistence diagram hashing framework that learns a binary code representation of persistence diagrams, which allows for fast computation of distances. This framework is built upon a generative adversarial network (GAN) with a diagram distance loss function to steer the learning process. Instead of using standard representations, we hash diagrams into binary codes, which have natural advantages in large-scale tasks. The training of this model is domain-oblivious in that it can be computed purely from synthetic, randomly created diagrams. As a consequence, our proposed method is directly applicable to various datasets without the need for retraining the model. These binary codes, when compared using fast Hamming distance, better maintain topological similarity properties between datasets than other vectorized representations. To evaluate this method, we apply our framework to the problem of diagram clustering and we compare the quality and performance of our approach to the state-of-the-art. In addition, we show the scalability of our approach on a dataset with 10k persistence diagrams, which is not possible with current techniques. Moreover, our experimental results demonstrate that our method is significantly faster with the potential of less memory usage, while retaining comparable or better quality comparisons.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 7

page 8

page 9

page 10

research
10/18/2022

k-Means Clustering for Persistent Homology

Persistent homology is a fundamental methodology from topological data a...
research
06/19/2018

On the Metric Distortion of Embedding Persistence Diagrams into Reproducing Kernel Hilbert Spaces

Persistence diagrams are important feature descriptors in Topological Da...
research
06/19/2018

On the Metric Distortion of Embedding Persistence Diagrams into separable Hilbert spaces

Persistence diagrams are important descriptors in Topological Data Analy...
research
03/16/2021

Learning Hyperbolic Representations of Topological Features

Learning task-specific representations of persistence diagrams is an imp...
research
05/22/2018

Large Scale computation of Means and Clusters for Persistence Diagrams using Optimal Transport

Persistence diagrams (PDs) are now routinely used to summarize the under...
research
11/08/2022

An Incremental Phase Mapping Approach for X-ray Diffraction Patterns using Binary Peak Representations

Despite the huge advancement in knowledge discovery and data mining tech...
research
07/28/2022

Topological Analysis of Ensembles of Hydrodynamic Turbulent Flows – An Experimental Study

This application paper presents a comprehensive experimental evaluation ...

Please sign up or login with your details

Forgot password? Click here to reset