Persistence Bag-of-Words for Topological Data Analysis

12/21/2018
by   Bartosz Zielinski, et al.
22

Persistent homology (PH) is a rigorous mathematical theory that provides a robust descriptor of data in the form of persistence diagrams (PDs). PDs are compact 2D representations formed by multisets of points. Their variable size makes them, however, difficult to combine with typical machine learning workflows. In this paper, we introduce persistence bag-of-words, which is a novel, expressive and discriminative vectorized representation of PDs for topological data analysis. It represents PDs in a convenient way for machine learning and statistical analysis and has a number of favorable practical and theoretical properties like 1-Wasserstein stability. We evaluate our representation on several heterogeneous datasets and show its high discriminative power. Our approach achieves state-of-the-art performance and even beyond in much less time than alternative approaches. Thereby, it facilitates the topological analysis of large-scale data sets in future.

READ FULL TEXT

page 1

page 5

page 6

page 8

page 10

page 11

page 13

page 20

research
02/13/2018

Persistence Codebooks for Topological Data Analysis

Topological data analysis, such as persistent homology has shown benefic...
research
09/19/2021

Instability of the Betti Sequence for Persistent Homology and a Stabilized Version of the Betti Sequence

Topological Data Analysis (TDA), a relatively new field of data analysis...
research
11/23/2020

The Interconnectivity Vector: A Finite-Dimensional Vector Representation of Persistent Homology

Persistent Homology (PH) is a useful tool to study the underlying struct...
research
03/10/2020

Topological Machine Learning for Mixed Numeric and Categorical Data

Topological data analysis is a relatively new branch of machine learning...
research
05/25/2022

Some equivalence relation between persistent homology and morphological dynamics

In Mathematical Morphology (MM), connected filters based on dynamics are...
research
02/28/2022

A probabilistic result on impulsive noise reduction in Topological Data Analysis through Group Equivariant Non-Expansive Operators

In recent years, group equivariant non-expansive operators (GENEOs) have...
research
06/03/2021

Homological Time Series Analysis of Sensor Signals from Power Plants

In this paper, we use topological data analysis techniques to construct ...

Please sign up or login with your details

Forgot password? Click here to reset