Using Dimensionality Reduction to Optimize t-SNE

12/02/2019
by   Rikhav Shah, et al.
0

t-SNE is a popular tool for embedding multi-dimensional datasets into two or three dimensions. However, it has a large computational cost, especially when the input data has many dimensions. Many use t-SNE to embed the output of a neural network, which is generally of much lower dimension than the original data. This limits the use of t-SNE in unsupervised scenarios. We propose using random projections to embed high dimensional datasets into relatively few dimensions, and then using t-SNE to obtain a two dimensional embedding. We show that random projections preserve the desirable clustering achieved by t-SNE, while dramatically reducing the runtime of finding the embedding.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/27/2014

Subspace clustering of dimensionality-reduced data

Subspace clustering refers to the problem of clustering unlabeled high-d...
research
09/13/2019

Multi-Perspective, Simultaneous Embedding

We describe a method for simultaneous visualization of multiple pairwise...
research
06/28/2018

Beyond One-hot Encoding: lower dimensional target embedding

Target encoding plays a central role when learning Convolutional Neural ...
research
05/31/2021

Semi-orthogonal Embedding for Efficient Unsupervised Anomaly Segmentation

We present the efficiency of semi-orthogonal embedding for unsupervised ...
research
02/17/2020

t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections

t-Distributed Stochastic Neighbor Embedding (t-SNE) for the visualizatio...
research
05/11/2023

Collection Space Navigator: An Interactive Visualization Interface for Multidimensional Datasets

We introduce the Collection Space Navigator (CSN), a browser-based visua...
research
07/14/2016

Random projections of random manifolds

Interesting data often concentrate on low dimensional smooth manifolds i...

Please sign up or login with your details

Forgot password? Click here to reset