G-Mapper: Learning a Cover in the Mapper Construction

09/12/2023
by   Enrique Alvarado, et al.
0

The Mapper algorithm is a visualization technique in topological data analysis (TDA) that outputs a graph reflecting the structure of a given dataset. The Mapper algorithm requires tuning several parameters in order to generate a "nice" Mapper graph. The paper focuses on selecting the cover parameter. We present an algorithm that optimizes the cover of a Mapper graph by splitting a cover repeatedly according to a statistical test for normality. Our algorithm is based on G-means clustering which searches for the optimal number of clusters in k-means by conducting iteratively the Anderson-Darling test. Our splitting procedure employs a Gaussian mixture model in order to choose carefully the cover based on the distribution of a given data. Experiments for synthetic and real-world datasets demonstrate that our algorithm generates covers so that the Mapper graphs retain the essence of the datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/23/2018

Graph Laplacian mixture model

Graph learning methods have recently been receiving increasing interest ...
research
09/06/2011

An Automatic Clustering Technique for Optimal Clusters

This paper proposes a simple, automatic and efficient clustering algorit...
research
11/02/2022

Pop2Piano : Pop Audio-based Piano Cover Generation

The piano cover of pop music is widely enjoyed by people. However, the g...
research
05/28/2013

Dynamic Clustering via Asymptotics of the Dependent Dirichlet Process Mixture

This paper presents a novel algorithm, based upon the dependent Dirichle...
research
07/06/2022

Careful seeding for the k-medoids algorithm with incremental k++ cluster construction

The k-medoids algorithm is a popular variant of the k-means algorithm an...
research
02/22/2023

Distributionally Robust Recourse Action

A recourse action aims to explain a particular algorithmic decision by s...
research
08/28/2020

The UU-test for Statistical Modeling of Unimodal Data

Deciding on the unimodality of a dataset is an important problem in data...

Please sign up or login with your details

Forgot password? Click here to reset