Log In Sign Up

An Analytical Survey on Recent Trends in High Dimensional Data Visualization

by   Alexander Kiefer, et al.

Data visualization is the process by which data of any size or dimensionality is processed to produce an understandable set of data in a lower dimensionality, allowing it to be manipulated and understood more easily by people. The goal of our paper is to survey the performance of current high-dimensional data visualization techniques and quantify their strengths and weaknesses through relevant quantitative measures, including runtime, memory usage, clustering quality, separation quality, global structure preservation, and local structure preservation. To perform the analysis, we select a subset of state-of-the-art methods. Our work shows how the selected algorithms produce embeddings with unique qualities that lend themselves towards certain tasks, and how each of these algorithms are constrained by compute resources.


page 1

page 2

page 3

page 4


Extending Scatterplots to Scalar Fields

Embedding high-dimensional data into a 2D canvas is a popular strategy f...

Visualizing structure and transitions in high-dimensional biological data

The high-dimensional data created by high-throughput technologies requir...

Visualizing Data using GTSNE

We present a new method GTSNE to visualize high-dimensional data points ...

Burning sage: Reversing the curse of dimensionality in the visualization of high-dimensional data

In high-dimensional data analysis the curse of dimensionality reasons th...

Data Quality Measures and Efficient Evaluation Algorithms for Large-Scale High-Dimensional Data

Machine learning has been proven to be effective in various application ...

A Survey and Implementation of Performance Metrics for Self-Organized Maps

Self-Organizing Map algorithms have been used for almost 40 years across...