Improving the Projection of Global Structures in Data through Spanning Trees

07/12/2019
by   Daniel Alcaide, et al.
0

The connection of edges in a graph generates a structure that is independent of a coordinate system. This visual metaphor allows creating a more flexible representation of data than a two-dimensional scatterplot. In this work, we present STAD (Spanning Trees as Approximation of Data), a dimensionality reduction method to approximate the high-dimensional structure into a graph with or without formulating prior hypotheses. STAD generates an abstract representation of high-dimensional data by giving each data point a location in a graph which preserves the distances in the original high-dimensional space. The STAD graph is built upon the Minimum Spanning Tree (MST) to which new edges are added until the correlation between the distances from the graph and the original dataset is maximized. Additionally, STAD supports the inclusion of additional functions to focus the exploration and allow the analysis of data from new perspectives, emphasizing traits in data which otherwise would remain hidden. We demonstrate the effectiveness of our method by applying it to two real-world datasets: traffic density in Barcelona and temporal measurements of air quality in Castile and León in Spain.

READ FULL TEXT

page 3

page 9

research
10/01/2021

Visual Cluster Separation Using High-Dimensional Sharpened Dimensionality Reduction

Applying dimensionality reduction (DR) to large, high-dimensional data s...
research
11/02/2021

UnProjection: Leveraging Inverse-Projections for Visual Analytics of High-Dimensional Data

Projection techniques are often used to visualize high-dimensional data,...
research
10/04/2022

Rainbow spanning trees in randomly coloured G_k-out

Given a graph G=(V,E) on n vertices and an assignment of colours to its ...
research
01/30/2023

Circular Coordinates for Density-Robust Analysis

Dimensionality reduction is a crucial technique in data analysis, as it ...
research
10/16/2016

Probabilistic Dimensionality Reduction via Structure Learning

We propose a novel probabilistic dimensionality reduction framework that...
research
08/16/2019

Visualization of Very Large High-Dimensional Data Sets as Minimum Spanning Trees

Here, we introduce a new data visualization and exploration method, TMAP...
research
02/11/2021

Visualizing hierarchies in scRNA-seq data using a density tree-biased autoencoder

Single cell RNA sequencing (scRNA-seq) data makes studying the developme...

Please sign up or login with your details

Forgot password? Click here to reset