ShapeVis: High-dimensional Data Visualization at Scale

01/15/2020
by   Nupur Kumari, et al.
0

We present ShapeVis, a scalable visualization technique for point cloud data inspired from topological data analysis. Our method captures the underlying geometric and topological structure of the data in a compressed graphical representation. Much success has been reported by the data visualization technique Mapper, that discretely approximates the Reeb graph of a filter function on the data. However, when using standard dimensionality reduction algorithms as the filter function, Mapper suffers from considerable computational cost. This makes it difficult to scale to high-dimensional data. Our proposed technique relies on finding a subset of points called landmarks along the data manifold to construct a weighted witness-graph over it. This graph captures the structural characteristics of the point cloud and its weights are determined using a Finite Markov Chain. We further compress this graph by applying induced maps from standard community detection algorithms. Using techniques borrowed from manifold tearing, we prune and reinstate edges in the induced graph based on their modularity to summarize the shape of data. We empirically demonstrate how our technique captures the structural characteristics of real and synthetic data sets. Further, we compare our approach with Mapper using various filter functions like t-SNE, UMAP, LargeVis and show that our algorithm scales to millions of data points while preserving the quality of data visualization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2019

Topological Data Analysis with ε-net Induced Lazy Witness Complex

Topological data analysis computes and analyses topological features of ...
research
11/30/2020

Contagion Dynamics for Manifold Learning

Contagion maps exploit activation times in threshold contagions to assig...
research
07/19/2019

Scalable Topological Data Analysis and Visualization for Evaluating Data-Driven Models in Scientific Applications

With the rapid adoption of machine learning techniques for large-scale a...
research
11/06/2020

Mapper Interactive: A Scalable, Extendable, and Interactive Toolbox for the Visual Exploration of High-Dimensional Data

The mapper algorithm is a popular tool from topological data analysis fo...
research
01/15/2013

Barnes-Hut-SNE

The paper presents an O(N log N)-implementation of t-SNE -- an embedding...
research
01/30/2020

NCVis: Noise Contrastive Approach for Scalable Visualization

Modern methods for data visualization via dimensionality reduction, such...

Please sign up or login with your details

Forgot password? Click here to reset