ActUp: Analyzing and Consolidating tSNE and UMAP

05/12/2023
by   Andrew Draganov, et al.
0

tSNE and UMAP are popular dimensionality reduction algorithms due to their speed and interpretable low-dimensional embeddings. Despite their popularity, however, little work has been done to study their full span of differences. We theoretically and experimentally evaluate the space of parameters in both tSNE and UMAP and observe that a single one – the normalization – is responsible for switching between them. This, in turn, implies that a majority of the algorithmic differences can be toggled without affecting the embeddings. We discuss the implications this has on several theoretic claims behind UMAP, as well as how to reconcile them with existing tSNE interpretations. Based on our analysis, we provide a method () that combines previously incompatible techniques from tSNE and UMAP and can replicate the results of either algorithm. This allows our method to incorporate further improvements, such as an acceleration that obtains either method's outputs faster than UMAP. We release improved versions of tSNE, UMAP, and that are fully plug-and-play with the traditional libraries at https://github.com/Andrew-Draganov/GiDR-DUN

READ FULL TEXT

page 3

page 7

page 14

research
06/20/2022

GiDR-DUN; Gradient Dimensionality Reduction – Differences and Unification

TSNE and UMAP are two of the most popular dimensionality reduction algor...
research
12/02/2021

Interactive Visualization of Spatial Omics Neighborhoods

Dimensionality reduction of spatial omic data can reveal shared, spatial...
research
12/30/2020

kōan: A Corrected CBOW Implementation

It is a common belief in the NLP community that continuous bag-of-words ...
research
03/14/2022

Accelerating Plug-and-Play Image Reconstruction via Multi-Stage Sketched Gradients

In this work we propose a new paradigm for designing fast plug-and-play ...
research
06/23/2023

Using persistent homology to understand dimensionality reduction in resting-state fMRI

Evaluating the success of a manifold learning method remains a challengi...
research
09/26/2021

An Analysis of Euclidean vs. Graph-Based Framing for Bilingual Lexicon Induction from Word Embedding Spaces

Much recent work in bilingual lexicon induction (BLI) views word embeddi...
research
02/09/2023

Adap-τ: Adaptively Modulating Embedding Magnitude for Recommendation

Recent years have witnessed the great successes of embedding-based metho...

Please sign up or login with your details

Forgot password? Click here to reset