Deep Learning Multidimensional Projections

02/21/2019
by   Mateus Espadoto, et al.
0

Dimensionality reduction methods, also known as projections, are frequently used for exploring multidimensional data in machine learning, data science, and information visualization. Among these, t-SNE and its variants have become very popular for their ability to visually separate distinct data clusters. However, such methods are computationally expensive for large datasets, suffer from stability problems, and cannot directly handle out-of-sample data. We propose a learning approach to construct such projections. We train a deep neural network based on a collection of samples from a given data universe, and their corresponding projections, and next use the network to infer projections of data from the same, or similar, universes. Our approach generates projections with similar characteristics as the learned ones, is computationally two to three orders of magnitude faster than SNE-class methods, has no complex-to-set user parameters, handles out-of-sample data in a stable manner, and can be used to learn any projection technique. We demonstrate our proposal on several real-world high dimensional datasets from machine learning.

READ FULL TEXT

page 7

page 8

page 11

research
11/07/2018

SRP: Efficient class-aware embedding learning for large-scale data via supervised random projections

Supervised dimensionality reduction strategies have been of great intere...
research
11/08/2016

Accelerating the BSM interpretation of LHC data with machine learning

The interpretation of Large Hadron Collider (LHC) data in the framework ...
research
05/11/2023

Collection Space Navigator: An Interactive Visualization Interface for Multidimensional Datasets

We introduce the Collection Space Navigator (CSN), a browser-based visua...
research
09/13/2019

Multi-Perspective, Simultaneous Embedding

We describe a method for simultaneous visualization of multiple pairwise...
research
01/17/2022

Distortion-Aware Brushing for Interactive Cluster Analysis in Multidimensional Projections

Brushing is an everyday interaction in 2D scatterplots, which allows use...
research
02/17/2020

t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections

t-Distributed Stochastic Neighbor Embedding (t-SNE) for the visualizatio...
research
01/26/2021

Contrastive analysis for scatter plot-based representations of dimensionality reduction

Exploring multidimensional datasets is a ubiquitous part of the ones wor...

Please sign up or login with your details

Forgot password? Click here to reset