Automatic Selection of t-SNE Perplexity

08/10/2017
by   Yanshuai Cao, et al.
0

t-Distributed Stochastic Neighbor Embedding (t-SNE) is one of the most widely used dimensionality reduction methods for data visualization, but it has a perplexity hyperparameter that requires manual selection. In practice, proper tuning of t-SNE perplexity requires users to understand the inner working of the method as well as to have hands-on experience. We propose a model selection objective for t-SNE perplexity that requires negligible extra computation beyond that of the t-SNE itself. We empirically validate that the perplexity settings found by our approach are consistent with preferences elicited from human experts across a number of datasets. The similarities of our approach to Bayesian information criteria (BIC) and minimum description length (MDL) are also analyzed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2016

Bayesian Model Selection of Stochastic Block Models

A central problem in analyzing networks is partitioning them into module...
research
02/09/2017

Stochastic Neighbor Embedding separates well-separated clusters

Stochastic Neighbor Embedding and its variants are widely used dimension...
research
06/01/2023

Efficient and Robust Bayesian Selection of Hyperparameters in Dimension Reduction for Visualization

We introduce an efficient and robust auto-tuning framework for hyperpara...
research
02/13/2019

Differential Description Length for Hyperparameter Selection in Machine Learning

This paper introduces a new method for model selection and more generall...
research
03/17/2018

Multi-device, Multi-tenant Model Selection with GP-EI

Bayesian optimization is the core technique behind the emergence of Auto...
research
08/18/2020

Word2vec Skip-gram Dimensionality Selection via Sequential Normalized Maximum Likelihood

In this paper, we propose a novel information criteria-based approach to...
research
05/03/2022

A unified view on Self-Organizing Maps (SOMs) and Stochastic Neighbor Embedding (SNE)

We propose a unified view on two widely used data visualization techniqu...

Please sign up or login with your details

Forgot password? Click here to reset