A Unifying Perspective on Neighbor Embeddings along the Attraction-Repulsion Spectrum

07/17/2020
by   Jan Niklas Böhm, et al.
0

Neighbor embeddings are a family of methods for visualizing complex high-dimensional datasets using kNN graphs. To find the low-dimensional embedding, these algorithms combine an attractive force between neighboring pairs of points with a repulsive force between all points. One of the most popular examples of such algorithms is t-SNE. Here we show that changing the balance between the attractive and the repulsive forces in t-SNE yields a spectrum of embeddings, which is characterized by a simple trade-off: stronger attraction can better represent continuous manifold structures, while stronger repulsion can better represent discrete cluster structures. We show that UMAP embeddings correspond to t-SNE with increased attraction; this happens because the negative sampling optimisation strategy employed by UMAP strongly lowers the effective repulsion. Likewise, ForceAtlas2, commonly used for visualizing developmental single-cell transcriptomic data, yields embeddings corresponding to t-SNE with the attraction increased even more. At the extreme of this spectrum lies Laplacian Eigenmaps, corresponding to zero repulsion. Our results demonstrate that many prominent neighbor embedding algorithms can be placed onto this attraction-repulsion spectrum, and highlight the inherent trade-offs between them.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2022

Contrastive learning unifies t-SNE and UMAP

Neighbor embedding methods t-SNE and UMAP are the de facto standard for ...
research
02/24/2022

SQuadMDS: a lean Stochastic Quartet MDS improving global structure preservation in neighbor embedding like t-SNE and UMAP

Multidimensional scaling is a statistical process that aims to embed hig...
research
06/07/2022

On random embeddings and their application to optimisation

Random embeddings project high-dimensional spaces to low-dimensional one...
research
02/28/2022

Structure from Voltage

Effective resistance (ER) is an attractive way to interrogate the struct...
research
08/18/2021

Stochastic Cluster Embedding

Neighbor Embedding (NE) that aims to preserve pairwise similarities betw...
research
02/25/2021

t-SNE, Forceful Colorings and Mean Field Limits

t-SNE is one of the most commonly used force-based nonlinear dimensional...
research
06/09/2022

Principal Trade-off Analysis

This paper develops Principal Trade-off Analysis (PTA), a decomposition ...

Please sign up or login with your details

Forgot password? Click here to reset