Visual Cluster Separation Using High-Dimensional Sharpened Dimensionality Reduction

10/01/2021
by   Youngjoo Kim, et al.
0

Applying dimensionality reduction (DR) to large, high-dimensional data sets can be challenging when distinguishing the underlying high-dimensional data clusters in a 2D projection for exploratory analysis. We address this problem by first sharpening the clusters in the original high-dimensional data prior to the DR step using Local Gradient Clustering (LGC). We then project the sharpened data from the high-dimensional space to 2D by a user-selected DR method. The sharpening step aids this method to preserve cluster separation in the resulting 2D projection. With our method, end-users can label each distinct cluster to further analyze an otherwise unlabeled data set. Our `High-Dimensional Sharpened DR' (HD-SDR) method, tested on both synthetic and real-world data sets, is favorable to DR methods with poor cluster separation and yields a better visual cluster separation than these DR methods with no sharpening. Our method achieves good quality (measured by quality metrics) and scales computationally well with large high-dimensional data. To illustrate its concrete applications, we further apply HD-SDR on a recent astronomical catalog.

READ FULL TEXT

page 12

page 15

research
02/23/2022

Human Motion Detection Using Sharpened Dimensionality Reduction and Clustering

Sharpened dimensionality reduction (SDR), which belongs to the class of ...
research
08/01/2023

Classes are not Clusters: Improving Label-based Evaluation of Dimensionality Reduction

A common way to evaluate the reliability of dimensionality reduction (DR...
research
10/06/2021

Revisiting Dimensionality Reduction Techniques for Visual Cluster Analysis: An Empirical Study

Dimensionality Reduction (DR) techniques can generate 2D projections and...
research
04/19/2018

Mathematical Analysis on Out-of-Sample Extensions

Let X=X∪Z be a data set in R^D, where X is the training set and Z is the...
research
11/13/2018

Interactive dimensionality reduction using similarity projections

Recent advances in machine learning allow us to analyze and describe the...
research
09/07/2022

Dimensionality Reduction using Elastic Measures

With the recent surge in big data analytics for hyper-dimensional data t...
research
07/12/2019

Improving the Projection of Global Structures in Data through Spanning Trees

The connection of edges in a graph generates a structure that is indepen...

Please sign up or login with your details

Forgot password? Click here to reset