Stochastic Cluster Embedding

08/18/2021
by   Zhirong Yang, et al.
21

Neighbor Embedding (NE) that aims to preserve pairwise similarities between data items has been shown to yield an effective principle for data visualization. However, even the currently best NE methods such as Stochastic Neighbor Embedding (SNE) may leave large-scale patterns such as clusters hidden despite of strong signals being present in the data. To address this, we propose a new cluster visualization method based on Neighbor Embedding. We first present a family of Neighbor Embedding methods which generalizes SNE by using non-normalized Kullback-Leibler divergence with a scale parameter. In this family, much better cluster visualizations often appear with a parameter value different from the one corresponding to SNE. We also develop an efficient software which employs asynchronous stochastic block coordinate descent to optimize the new family of objective functions. The experimental results demonstrate that our method consistently and substantially improves visualization of data clusters compared with the state-of-the-art NE approaches.

READ FULL TEXT

page 8

page 11

page 12

page 14

page 15

page 18

page 22

page 24

research
10/06/2021

T-SNE Is Not Optimized to Reveal Clusters in Data

Cluster visualization is an essential task for nonlinear dimensionality ...
research
02/09/2017

Stochastic Neighbor Embedding separates well-separated clusters

Stochastic Neighbor Embedding and its variants are widely used dimension...
research
05/03/2022

A unified view on Self-Organizing Maps (SOMs) and Stochastic Neighbor Embedding (SNE)

We propose a unified view on two widely used data visualization techniqu...
research
11/03/2018

Stochastic Neighbor Embedding under f-divergences

The t-distributed Stochastic Neighbor Embedding (t-SNE) is a powerful an...
research
05/02/2020

Stochastic Neighbor Embedding of Multimodal Relational Data for Image-Text Simultaneous Visualization

Multimodal relational data analysis has become of increasing importance ...
research
07/17/2020

A Unifying Perspective on Neighbor Embeddings along the Attraction-Repulsion Spectrum

Neighbor embeddings are a family of methods for visualizing complex high...
research
07/14/2023

Visualizing Overlapping Biclusterings and Boolean Matrix Factorizations

Finding (bi-)clusters in bipartite graphs is a popular data analysis app...

Please sign up or login with your details

Forgot password? Click here to reset